News

Open Source Dependency Management As A Balancing Act

During my career I have spent a lot of time packaging other people's code, writing my own, and working on large software frameworks. I have seen projects that still haven't released a stable version, never quite hitting 1.0, while others made 1.0 releases within months of beginning development, and then quickly moving on to 2.0, 3.0, etc. There is quite a variance in these release cycles, and this coupled with maintaining large projects can make things difficult. I will go through some of the decisions we have faced in projects I have worked on and the pressures on the project. On the one extreme, users would like to have a stable API that never changes, with dependencies that don't specify a minimum version so that they can choose whatever version works best...

How to Use Sphinx to Give an Old Book New Life

The Internet Archive, Project Gutenberg, and Google Books are wonderful sources of historical books, but the finished products of their digitization efforts, while thorough and functional, lack that last bit of polish. For example, one of my interests is historical cooking, specifically Georgian and Regency British cookery and the contemporary period in American cookery, but the PDF versions of the relevant cookbooks are usually just basic black and white scans with no features that aid findability or searchability. The plain text versions, while more searchable, are not aesthetically pleasing and often contain numerous optical character recognition errors...

4 Ways to Open Up Your Project's Infrastructure

Open source isn't just about opening up your code—it's also about building a supporting infrastructure that invites people to contribute. In order to create a vibrant, growing, and exciting project, the community needs to be able to participate in the governance, the documentation, the code, and the actual structures that keep the project alive. If the overall "hive" is doing well, it attracts more individuals with diverse skills to the project. Although many projects strive for "open everything," infrastructure is often closed to contribution. Usually, only a few people run the infrastructure and keep the lights on. They're sometimes unable to recruit help because, well, you can't really give the keys to the kingdom to everyone. A certain level of trust is needed before granting a contributor access to project infrastructure...

I'm OK -- You, Maybe Not So Much

It is widely agreed that competition, or lack thereof, in health care is a problem.  The Wall Street Journal recently showed how Viagra and Cialis prices seem to move -- up, of course -- in lockstep. USA Today found Walgreens charging 1237% more than Costco, for the same drug.  Economists like Martin Gaynor have been discussing problems with competition in health care for years. The Harvard Business Review just published a lengthy article on the problem. But, it turns out, we may be ignoring an important competition that has real impacts on our health: with each other...

Python Versus R for Machine Learning and Data Analysis

Machine learning and data analysis are two areas where open source has become almost the de facto license for innovative new tools. Both the Python and R languages have developed robust ecosystems of open source tools and libraries that help data scientists of any skill level more easily perform analytical work. The distinction between machine learning and data analysis is a bit fluid, but the main idea is that machine learning prioritizes predictive accuracy over model interpretability, while data analysis emphasizes interpretability and statistical inference. Python, being more concerned with predictive accuracy, has developed a positive reputation in machine learning. R, as a language for statistical inference, has made its name in data analysis...

Open Source FIWARE Platform Creates New IoT Business Opportunities

The European-funded IoT open source platform FIWARE has matured significantly in the past two years according to developers, and is now being used in industrial production cases, pilot smart city, and utilities projects. Two projects using the FIWARE platform include a city water quality pilot and an early warning system to identify and prevent pest risks to agricultural crops. To further support industry uptake, FIWARE has recently formalized a foundation to lead community efforts. The Foundation is expected to see a new wave of community participation in the open source platform, which already has significant links with other open source projects...

Why Keep Open States Going?

We announced earlier this month that Open States—a project covered on Opensource.com in 2011—is now being maintained by the original creators of the project, a community of Sunlight Foundation alumni and other volunteers. After a year of scant staffing culminating in the closure of Sunlight Labs, we expect that getting Open States fully operational again will take a significant effort, and we know from experience that maintaining the menagerie of scrapers into the future isn't easy either. (That's why we're looking for volunteers and donations.) So why, despite the challenges, are we taking on this project? Why do we think that Open States matters so much? And where do we plan on taking this tool going forward?...

How to Create an Internal Innersource Community

In recent years, we have seen more and more interest in a variance of open source known as innersource. Put simply, innersource is taking the principles of open source and bringing them inside the walls of an organization. As such, you build collaboration and community that may look and taste like open source, but in which all code and community is private within the walls of the organization. As a community strategy and leadership consultant, I work with many companies to help build their innersource communities. As such, I thought it could be fun to share some of the most important principles that map to most of my clients and beyond. This could be a helpful primer if you are considering exploring innersource inside your organization...

Read More »

Open Chemistry Project Upholds Mission of Unorganization, The Blue Obelisk

Chemistry is not the most open field of scientific endeavor; in fact, as I began working more in the area (coming from a background in physics), I was surprised with the norms in the field. As a PhD student way back in 2003, I simply wanted to draw a 3D molecular structure on my operating system of choice (Linux), and be able to save an image for a paper/poster discussing my research. This proved to be nearly impossible, and in 2005 a group of like-minded researchers got together at a meeting of the American Chemical Society and formed an unorganization: The Blue Obelisk (named after their meeting place in San Diego)...

Open Data Portals Should Be API [First]

Not long ago, I was speaking at the National Association of Government Web Professionals. At the same conference, Mark Headd was speaking. We were speaking on different open data topics. My discussion was about the difference between open government and open data and his talk was about API [First]. Luckily they had us scheduled at different times, so I had the opportunity to see him speak on the API [First] strategy for website development using open data. His subtitle was "Open Data as a Foundation for Better Websites"...