SQL Layers

Stinger, Panthera, Impala, Drill, Pivotal

Stinger, Panthera, Impala, Drill, Pivotal

Hortonworks is 100% OS. Deep integration. Analytical DBMS, in memo, 3 for Hadoop, stream; all just like Cloudera but Horton offer a free desktop Hadoop sandbox. Slower release schedules choosing to only release when products are fully vetted by community. Fewer bugs, lower risk from conservative approach, but does have a couple of drawbacks …

Strategic partnerships with Teradata, SAP, Red Hat, Rackspace and Microsoft. Customers include AT&T, Bloomberg, and Cardinal Health.

MapR holds the third place slot in market share behind Cloudera and Hortonworks. Gives speed and reliability to the slow Hadoop. Not totally OS but some balance of proprietary and OS which actually provides some advantages, in the form of readymade capabilities that are somewhat lacking in Hortonworks and Cloudera. These include an optimized metadata management feature with strong distributed performance and protection from single point of failure; full support for random write processing; and a stable, node based job management system (MapR 2014).

MapR offers three distributions and a long list of integrations for big data applications including Hive, Stinger, Tez, Drill, Impala and Shark for SQL access; and Pig, Oozie, Storm, Zookeeper, Sqoop, Whirr, Spark, Flume and Mahout for just about any other capability users require.

EMC Pivotal is a product line resulting from the combination of VMware Cetas, Cloud Foundry, Gemfire, EMC Greenplum and Pivotal Labs plus $100 million investment from General Electric. Pivotal’s analysis technique moves the processing to the DB, similar to SAS and Alteryx.

Greenplum is an analytic DB that is part of the EMC Pivotal product line. Pivotal HD Community Edition provides integration of HBase, HDFS, Hive, MapReduce, SAS and Zookeeper. Two other Pivotal products Gemfire and Hawq (an acronym for Hadoop with query) combine with HD to form an in memory analysis Hadoop distribution.