Database Vendors Take a Page from Google for Data Analytics

Database Vendors Take a Page from Google for Data Analytics

Written By
Brian Prince
Brian Prince
Aug 26, 2008
2 minute read
eWeek content and product recommendations are editorially independent. We may make money when you click on links to our partners. Learn More

Aster Data Systems and data warehousing vendor Greenplum have added support for Google’s MapReduce framework in the name of data analytics for enterprises.

The move by Greenplum comes as Aster Data Systems unveiled In-Database MapReduce, which is available now for evaluation.

“Traditionally, massively parallel [processing] databases were able to parallelize ordinary SQL, but had limitations when parallelizing more general programs, whether written as user-defined functions or a database programming language such as PL/SQL,” said Mayank Bawa, CEO of Aster.

“In many cases, these capabilities simply ran on a single node of an MPP database. Now, analysts and developers can take advantage of the power of MapReduce from within ordinary SQL, by creating SQL/MR functions in Java, Python, R and more.”

Leveraging Greenplum MapReduce, companies can write MapReduce programs in a few lines of Perl or Python that can process and analyze huge volumes of unstructured data for a variety of applications, such as keyword analysis and content indexing, according to Greenplum President and co-founder Scott Yara. In addition, while SQL is expressive enough to allow some analysis and data mining, there is a range of powerful mining and machine learning tools that are not easily expressed via SQL, he argued.

“Good examples are Bayesian machine learning approaches, clustering algorithms and natural language processing,” Yara said.

The effectiveness of MapReduce as an answer to the analytical needs of enterprises is becoming an increasingly hot subject of discussion as cloud computing has gained steam. Gartner analyst Donald Feinberg described MapReduce as complex but added it can give enterprises the ability to process extremely large sets of data very fast.

“It’s very, very programming-intensive,” he said. “It’s not something that your average application programmer that writes programs in SQL using SQL in C or SQL in Java or something like that is going to do.”

Curt Monash, president of Monash Research, described MapReduce as a powerful tool for data manipulation and analysis.

“Companies that are integrating MapReduce and SQL are increasing its applicability and giving developers and DBAs [database administrators] the ability to work together on a common parallel data processing infrastructure,” Monash said in a statement.

eWeek Logo

eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site's focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.