Dell, EMC, Dell Technologies, Cisco,

Friday, December 18, 2015

How Businesses Can Make Sense of the Big Data Market

Businesses don't get any points for how efficiently their infrastructure runs or how high they can stack all the Big Data they collect. What does count is the quality of the analytics and intelligence that data produces.

Over the last several years, #Hadoop is the word that's become most synonymous with ingesting, processing, and transforming data. This open-source framework for distributed data storage and processing has spawned its own enterprise space and integrated its way into all the major cloud platforms. Hadoop is far from the only #BigData technology worth talking about, but it's become the one on which many others are built.

The problem for businesses is the Hadoop space is full of distributions and tooling options, and as Gartner Research Director Nick Heudecker explained, many of them look the same. Heudecker, whose research covers information management including the Big Data and NoSQL spaces, said if you're looking at the general data processing options, a lot of vendors offer very similar features.

Breaking Down the Market
There are three main pure-playHadoop start-ups—#Cloudera, #Hortonworks, and #MapR—and they've all grown steadily in 2015. According to Gartner, each has approximately 700 customers, give or take 10 percent, putting the global market between 2,100-2,400 Hadoop customers worldwide. All three offer both a free tier and an enterprise tier of their Hadoop distribution, and each makes significant open-source contributions to projects under the #Apache Software Foundation (ASF) banner.

"Our data indicates that 44 percent of Hadoop use is currently unpaid," said Heudecker. "Is there a clear leader? I don't think so. They're all grabbing market share because it's a very new space."

In the last few months, much of the competition between the three has come down to competition over data analytics capabilities and creative ways of integrating Apache Spark, an open-source Big Data processing engine with use cases from real-time data streams to machine learning. MapR recently announced MapR Streams as part of a "converged data platform" integrating Hadoop, Spark-based stream processing, and analytics. Hortonworks rolled out an update to the Hortonworks Data Platform (HDP) with in-memory Spark analytics, and Cloudera offers a variety of open-source Spark integrations through its One Platform Initiative, along with offering Spark training classes.


http://www.pcmag.com/article2/0,2817,2496736,00.asp

No comments:

Post a Comment