Google Public Data Explorer Visualizes Statistics

 
 
By Nicholas Kolakowski  |  Posted 2010-03-08
 
 
 

Google announced the launch March 8 of Google Public Data Explorer, an experimental application offered through Google Labs. The tool allows everyone from students to policy wonks to create a wide variety of charts from public data, such as fertility or unemployment rates.

For example, a user who swings by the Website can select "Explore the Data," choose a data set such as "Unemployment in the U.S.," and then click on various options to see a visual state-by-state comparison of unemployment to the national average. A user could also create a colorful chart detailing life expectancy for various countries over the past few decades, and then embed that chart in a blog post.

The Website taps into a number of public data sources, including the Organization for Economic Co-operation and Development, California Department of Education, Eurostat, the U.S. Centers for Disease Control and Prevention, and the U.S. Bureau of Economic Analysis. Google search already incorporates statistical data from the World Bank, the U.S. Bureau of Labor and Statistics, and the U.S. Census Bureau-all of which is also being used in the Public Data Explorer application.

As part of the Public Data Explorer initiative, Google used search information to determine the types of public data most sought after by users.

"To help us better prioritize which data sets to include in our public data search feature, we've analyzed anonymous search logs to find patterns in the kinds of searches people are doing, similar to the patterns you can find on Google Trends and Insights for Search," Jurgen Schwarzler, a statistician with Google's Public Data team, wrote in a March 8 post on the official Google blog. "Some public data providers have asked us to share what we've learned, so we decided to put together an approximate list of the 80 most popular data and statistics search topics."

Those topics include school comparisons, unemployment, population, sales tax, salaries, exchange rates, health statistics and oil prices. Google examined those billions of queries from across a number of sources and filtered out spam and repeats in order to create the full public data list.

Google has drawn on its users' search data to provide a view of a larger societal trend before. In April 2009, Google.org set up a Flu Trends site in an effort to track the spread of the H1N1 flu virus.

"We've found that there is a close relationship between how many people search for flu-related topics and how many people actually have flu symptoms," read an April 27 note on the Flu Trends site. "Some search queries tend to be popular exactly when flu season is happening, and are therefore good indicators of flu activity. Our estimates, based on up-to-date aggregated Google search data, may indicate flu activity up to two weeks ahead of traditional flu surveillance systems."

Microsoft's Bing search engine also displays public data in charts and graphs through its partnership with Wolfram Alpha, a self-described "computational engine" Website that offers a definitive-often numerical-answer to queries in place of the traditional page of hyperlinked search results. Typing in a phrase such as "U.S. military vs. UK" will return a statistical comparison of the two armies in a chart, whereas typing in a simpler phrase such as "death rates in the United States" will return a single data point.

Wolfram Alpha is the product of Stephen Wolfram, founder and CEO of Wolfram Research and creator of Mathematica, a computation platform the symbolic code of which forms the core code base of Wolfram Alpha. Google co-founder Sergey Brin reportedly interned with Wolfram before creating his own search-based startup.

Rocket Fuel