MapR, Informatica Partner on New Hadoop Distribution

By Chris Preimesberger  |  Posted 2012-03-06 Print this article Print

Sharing resources is the operative term here: This new analytics software is now accessible from four different vendors.

MapR Technologies and Apache Hadoop, which already partner for their own distribution of the popular open-source batch analytics software package, on March 5 announced that MapR Hadoop will become available with Informatica's data integration platform.

Sharing resources is the operative term here. This new analytics software is now accessible from four different vendors.

Informatica HParser Community Edition€”already available for free download from the Informatica site€”is now freely available as part of the MapR distribution from the MapR Website. Moreover, the MapR distribution is also available through the EMC Greenplum MR Edition and as part of the Cisco Unified Computing System.

By using MapR and Informatica in tandem, enterprises can perform high-performance data integration, replication and messaging and get reports quickly. Together the two products can move numerous terabytes per hour of transaction, interaction and streaming data into MapR, in addition to parsing and processing a range of structured and unstructured data natively in Hadoop€”all without scripting.

Informatica HParser provides Hadoop developers with parsing capabilities to address data sources that include logs, call data records, industry standards, documents, and binary or hierarchical data.

Key features of the combination of MapR and Informatica include:

  • Bidirectional data integration with Informatica PowerCenter and Informatica PowerExchange.
  • Snapshot replication using Informatica FastClone.
  • Data streaming using Informatica Ultra Messaging.
  • Parallel parsing and transformation on MapR using Informatica HParser.
"As the big data ecosystem continues to grow more holistic, data management practices are needed to address the challenges of information volume, variety, velocity and complexity," said Gartner Research Vice President Merv Adrian.

"Coupling advanced data management practices is the next step organizations need to take as they build a platform that expands the breadth and depth of applications."

Chris Preimesberger Chris Preimesberger was named Editor-in-Chief of Features & Analysis at eWEEK in November 2011. Previously he served eWEEK as Senior Writer, covering a range of IT sectors that include data center systems, cloud computing, storage, virtualization, green IT, e-discovery and IT governance. His blog, Storage Station, is considered a go-to information source. Chris won a national Folio Award for magazine writing in November 2011 for a cover story on and CEO-founder Marc Benioff, and he has served as a judge for the SIIA Codie Awards since 2005. In previous IT journalism, Chris was a founding editor of both IT Manager's Journal and and was managing editor of Software Development magazine. His diverse resume also includes: sportswriter for the Los Angeles Daily News, covering NCAA and NBA basketball, television critic for the Palo Alto Times Tribune, and Sports Information Director at Stanford University. He has served as a correspondent for The Associated Press, covering Stanford and NCAA tournament basketball, since 1983. He has covered a number of major events, including the 1984 Democratic National Convention, a Presidential press conference at the White House in 1993, the Emmy Awards (three times), two Rose Bowls, the Fiesta Bowl, several NCAA men's and women's basketball tournaments, a Formula One Grand Prix auto race, a heavyweight boxing championship bout (Ali vs. Spinks, 1978), and the 1985 Super Bowl. A 1975 graduate of Pepperdine University in Malibu, Calif., Chris has won more than a dozen regional and national awards for his work. He and his wife, Rebecca, have four children and reside in Redwood City, Calif.Follow on Twitter: editingwhiz

Submit a Comment

Loading Comments...
Manage your Newsletters: Login   Register My Newsletters

Rocket Fuel