Hadoop Channel Hadoop Channel Hadoop Channel

Hadoop Featured Articles

Welcome to the Hadoop channel! Apache Hadoop is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. With data growing so rapidly and the rise of unstructured data accounting for 90 percent of the data today, the time has come for enterprises to reevaluate their approach to data storage, management and analytics. Hadoop was initially inspired by papers published by Google outlining its approach to handling an avalanche of data, and has since become the de facto standard for storing, processing and analyzing hundreds of terabytes, and even petabytes of data. Stay tuned to the Hadoop channel for the latest resources, industry trends and news.

Hadoop and Tweets - What It's Like to Analyze Twitter's Big Data

Twitter has become a platform for massive amounts of real-time updates for events, news, trends and stories. More than 400 million tweets are sent per day, and that number grows when there is breaking news or events - the recent Boston bombings are a testament to that.

Syncsort Provides a Smarter Approach to Hadoop ETL

Hadoop is a software framework that can be used to manage big data. It can also be used in the cloud to keep data flowing for companies and has the ability to produce a comprehensive report that incorporates data stored in countless records.

MapR Technologies and Concurrent Partner to Expand Hadoop in the Enterprise

As enterprises today utilize cloud computing, social media, the Internet of Things, location-based services and mobile devices, they are struggling with how to manage, analyze and process this little thing called big data. To harness big data, enterprises are turning to Hadoop, an open-source framework that allows for distributed processing of large data sets across clusters of computers using simple programming models.

Hadoop's Sexy Side: 'We're Not There Yet'

Hadoop may be good at big data analytics, but right now it has a more pedestrian use for the majority users: storage and ETL (extract, transform, load).

Video Showcase

Video: MapR-Introducing M7



Making HBase™ Easy, Dependable and Fast - M7 is a New Big Data Platform that Brings Enterprise-Grade Reliability and Performance to HBase™.

Video: Why MapR: The Most Advanced Distribution for Hadoop



MapR Customers discuss the benefits of MapR's Distribution for Apache Hadoop.
title= title= title=

Featured White Papers

Hadoop Channel

Evaluating Hadoop in the Data Center

What will make Hadoop and enterprise data center-grade analytics platform?


Hadoop Channel

Learn how MapR makes Hadoop Easy, Dependable and Fast.

High Availability in the Hadoop Ecosystem: MapR provides high availability with no single points of failure across the entire stack.


Hadoop Channel

High Availability in the Hadoop Ecosystem

The MapR Distribution for Apache™ Hadoop® provides high availability with no single points of failure across the entire stack. In the storage layer, MapR's Distributed NameNode HA™ architecture provides high availability with self-healing and support for multiple, simultaneous failures, with no additional hardware whatsoever.

Featured Datasheets

Hadoop Channel

MapR: M7 Edition

MapR M7 Edition is a complete distribution for Apache Hadoop and HBase™ that includes Pig, Hive, Mahout, Cascading, Sqoop, Flume and more. The M7 Edition makes HBase™ easy, dependable and fast. M7 not only delivers enterprise grade features such as Instant Recovery, Snapshots and Mirroring but also provides consistent performance while eliminating architectural complexity.


Hadoop Channel

MapR: M5 Edition

Subscription software offering that includes features such as mirroring, snapshots, NFS HA, data placement control, and many more. The M5 Edition also offers full support, on-demand patches and online incident submission.