MapR Technologies' M7 Takes Hadoop and HBase to the Next Level
November 19, 2012
By
Rachel Ramsey, TMCnet Web Editor
Data becomes big data when your current ability cannot process it, store and cope with it efficiently. Apache Hadoop is an open source framework and is the best tool available today for processing and storing herculean amounts of big data. It makes data mining, analytics and processing of big data cheap and fast.
MapR Technologies makes managing and analyzing big data a reality for more business users. It provides a complete distribution for Apache Hadoop, offering three editions of its distribution for Hadoop: M3, M5 and M7. The M3 Edition is free and available for unlimited production use. Support is provided on a community bases and through MapR’s forums. The M5 Edition is subscription software offering that includes features such as mirroring, snapshots, NFS HA, data placement control and many more. It’s ideal for the most demanding mission-critical environments, also offering full support, on-demand patches and online incident submission.
MapR M7 is a complete distribution for Apache Hadoop and HBase that includes Pig, Hive, Mahout, Cascading, Sqoop, Flume and more. HBase is a distributed column-oriented database built on top of the Hadoop Distributed File System (HDFS). HBase is the Hadoop application to use when you require real-time read/write random access to very large datasets. It’s modeled after Google’s (News
- Alert) Bigtable and targeted to support large tables, on the order of billions of rows and millions of columns.
The M7 Edition makes HBase easy, dependable and fast. M7 not only delivers enterprise-grade features such as instant recovery, snapshots and mirroring but also provides consistent performance while eliminating architectural complexity.
The M7 edition offers patent-pending architecture for a unified data platform for tables and files, no RegionServers eliminates complexity and unreliability, no manual administrative tasks such as table merges or splits, instant recovery from failures, full data protection and disaster recovery, consistent performance with no compactions, high throughput with low latency and unprecedented scale.
One of the core benefits of M7 is making HBase enterprise grade with instant recovery from hardware and software failures, disaster recovery and full data protection with snapshots and mirroring. Even with multiple hardware or software outages and errors, applications will continue running without any administrator actions required.
M7 increases the performance of HBase to unprecedented levels. First, by eliminating the need for compactions, M7 provides uniform and consistent performance. Second, by utilizing innovative data structures that minimize the read- and write-amplification factor, inserts and updates are much faster. M7 also supports in-memory columns, providing more options to increase database performance.
“M7 is taking Hadoop and HBase to the next level,” said Jan Gelin, vice president of Technical Operations, Rubicon Project, a real-time advertising platform that was recently named number one in advertising reach by comScore (News - Alert). “The enterprise-grade capabilities of M7 give us a more complete platform and the ability to do new things with data.”
“The Evaluator Group believes that enterprise Hadoop users will evaluate Hadoop and its supporting infrastructure using the same criteria that they would apply to other production data center-resident applications they are responsible for,” said John Webster, senior partner, Evaluator Group. “MapR’s support for an enterprise grade version of HBase responds to these demands with automated stateful failover, instant recovery and full data protection against user and application errors.”
Registration to participate in the MapR M7 beta program is now open.
Register now!
Edited by
Rachel Ramsey