Black Duck Software has set out to debunk some of the leading “myths” about open-source software development, including the belief that open-source developers have created only a few billion lines of code.
Black Duck, which provides products and services for accelerating software development through the managed use of open-source and third-party code, released its research dispelling some of the more common myths about open-source development.
Black Duck officials said the company actively spiders the Internet collecting downloadable code into a giant repository that is known to Black Duck customers as the Black Duck KnowledgeBase. This core information repository contains more than 170,000 open-source projects from nearly 4,000 unique Web sites. The top myth findings were derived from actual analysis of open-source software rather than just user perception.
Myth No. 1 that Black Duck attempts to dispel is that open source is just source code. However, according to Black Duck, source code is actually only 15 percent of what is released by open-source developers. There are four times as many binary files as source files in open-source releases. In addition to binaries, open-source projects are packaged with scripts, markup language files, graphics images, documentation and many other artifacts, Black Duck officials said.
Open-source myth No. 2 is that open-source adoption is mostly application infrastructure. Black Duck officials said there is a tendency to focus on adoption of monolithic applications in IT infrastructure-for example, Linux or MySQL. However, the Black Duck KnowledgeBase shows that the open-source world is dominated by components, not fully formed applications, and these are being reused from project to project in hundreds and even thousands of instances.
Apache Log4j, for example, is reused by over 5,500 projects, according to the Black Duck Knowledgebase.
“Java developers, in particular, have taken tremendous advantage of code reuse,” the Black Duck study said. “There are 14 times more files distributed ending in the .class file suffix (binaries) than .java (source files). A major reason is that Java components are built once and reused and redistributed by many other projects in binary form.”
Tens of Billions of Lines of Code
The third myth Black Duck sets out to dispel is that there are only a few billion lines of open source code out there. However, “this figure is an order of magnitude too low,” the Black Duck study said. “There are tens of billions of lines of open source code available on the Internet,” the company said. In addition, 23 percent of all downloadable code was released or renewed in 2008. Over 90 percent of open source code is written in the major languages: C, C++, Java, JavaScript and C#; however, dozens of languages are used, the study said.
The fourth myth Black Duck takes a swipe at is that open-source programmers do not add comments about their code. Yet, according to the Black Duck findings, open-source developers create about one comment line for every four lines of source code they produce. Indeed, the findings showed that the most commented programming language is Java, with more than one comment line for every two lines of code. The least commented language is Boo; a python-inspired programming language that operates within the .NET Framework.
The fifth and final myth Black Duck takes on is that the GNU GPL (General Public License) Version 3 is being ignored. However, Black Duck research showed that GPLv3, which was initially released in June 2007, has grown from zero to over 6,300 projects today. In terms of project adoption, it has surpassed the CPL (Common Public License), Mozilla, MIT and Apache licenses.
In fact, according to Black Duck, GPLv3 is now the fifth most chosen license in the open-source community, and, if the current trend continues, it will surpass BSD (Berkeley Software Distribution) for the No. 4 spot in a year or two. About 70 percent of all open-source projects use a variant of the GPL license, Black Duck said.
Doug Levin, founder and director of Black Duck, called the company’s open-source myth-busting “interesting and useful” for enterprises.