Amazon Web Services brings MapR's M7 big data platform to Amazon Elastic MapReduce.
Amazon Web Services
announced the availability of MapR M7 on Amazon Elastic MapReduce.
M7 is a big data platform, now available through Amazon Elastic MapReduce (EMR)
, that provides ease of use, dependability and greater performance of NoSQL and Apache Hadoop
applications. Amazon EMR makes it easy and cost-effective to deploy and operate elastic Hadoop clusters
Indeed, with just a few mouse clicks or a single line of code, customers can launch a dynamically scalable M7 cluster on Amazon EMR to store or process vast amounts of data, MapR officials said.
In essence, MapR M7 delivers ease of use, dependability and performance for HBase and Hadoop. For customers looking to deploy HBase
, M7 offers seamless splits, no compactions, instant recovery from failure, point-in-time recovery, full HA, mirroring and consistent low latency, AWS officials said.
Amazon EMR makes it easy to provision and manage Hadoop in the AWS Cloud. Hadoop is available in multiple distributions, and Amazon EMR gives you the option of using the Amazon Distribution or the MapR Distribution
for Hadoop. MapR delivers on the promise of Hadoop with an enterprise-grade platform that supports a broad set of mission-critical and real-time production uses.
"AWS is pleased to offer Amazon EMR with MapR M7," said Peter Sirota, general manager for Amazon Elastic MapReduce. "Together, MapR M7 and Amazon EMR are a powerful combination for our customers."
M7 is supported on multiple AWS instance types, including the latest high-performance SSD-backed High I/O instances. It can scale horizontally to thousands of nodes per cluster. Standard YCSB benchmark tests on AWS High Storage instances have shown M7 delivering consistent performance of over 100,000 operations per second per node, demonstrating how M7 users can derive greater utilization from their cloud infrastructure.
"MapR is the only distribution that enables Linux applications and commands to access data directly in the cluster via the NFS interface
that is available with all MapR editions
," said Jeff Barr, chief evangelist for Amazon Web Services, in a July 12 blog post
. "MapR M7 was optimized for cloud deployments including high performing instances such as High Storage
and High I/O
The MapR enterprise-grade platform supports a broad set of mission-critical and real-time production uses and claims to bring dependability, ease of use and high speeds to Hadoop, NoSQL, database and streaming applications in one unified big data platform. MapR is the only distribution that enables Linux applications and commands to access data directly in the cluster via the NFS interface that is available with all MapR editions.
"MapR's latest technology accomplishment with the availability of the MapR M7 Distribution is providing ground breaking capabilities for Apache HBase applications to enhance Big Data operations," said John Schroeder, CEO and co-founder of MapR, in a statement. "Customers that want added flexibility, scalability and cost-effectiveness in the cloud can gain further benefits from MapR's technology via AWS."
"We joined forces with MapR Technologies
last June to deliver enterprise-grade Hadoop on EMR with their M5 and M3 Editions," Barr wrote in his post. "We're making MapR's M7 Edition available on EMR, enabling users to run 24x7 HBase
applications in addition to their Hadoop ones."
The M7 architecture provides the following advantages for HBase users, according to Barr:
- Up to 100K ops/s per node on hs1 instances
- Instant recovery from any failure
MapR is used across financial services, retail, media, health care, manufacturing, telecommunications and government organizations as well as by leading Fortune 100 and Web 2.0 companies. Investors include Lightspeed Venture Partners, Mayfield Fund, NEAs and Redpoint Ventures.