January 2020 Hadoop Apache Project Tracker

Screenshot 2020 01 13 15.14.03 1

Until 2018, I published comparisons of the supported versions of components in the multiple available Hadoop distributions. Those older posts showed the ebb and flow of new additions to the typical stack supported by “most” distributors slowed to a halt after everyone (mostly) added Kafka. The relative version currency was fairly stable, with Hortonworks typically first to market with many of the newest Apache versions and Cloudera often close behind and leading on some where they were more dominant on the project committee.

But as mergers, acquisitions, and the rise of the cloud-platform-as-Hadoop-provider dynamic played out of the past 2 years, I got away from the regular tracking cycle. This seems like a good time to revisit the question: Who supports what? since so many people seem to think “Hadoop” is HDFS and therefore “going away, because cloud object stores.” The players I include in this visit are AWS, Cloudera, Google and HPE. All of them support 9 pieces: Apache HDFS, Mapreduce, YARN, Hive, Pig, Spark, Sqoop, Tez and Zookeeper.

All but Google also support Apache HBase, Mahout and Oozie, as well as Hue. There is a nuance here: Google Cloud Dataproc allows you to perform initialization actions to add components (indicated on the table with “IA”) and offers scripts for dozens of installable components, but cautions that “the initialization actions provided in this repository are provided without support and you use them at your own risk.” The same applies to the various projects themselves; similar mechanisms apply to AWS, and of course Microsoft Azure, whose HDInsight begins with the Cloudera distribution, adds its own pieces and also permits you to add yours. Zeppelin is listed as “soon” for Cloudera, which would move it into the “3 supporters” category as well, but for now it appears below.

Screenshot 2020 01 13 15.25.35

Apache Flume, Impala, Kafka, Phoenix, Presto, Sentry, Storm and Zeppelin are supported by two vendors apiece. Note that some of these are used infrequently and some will be deprecated soon, but continue to get support because many users have them in their stacks.

Screenshot 2020 01 13 15.27.56

There are many additional pieces supported by only one of these vendors: Apache Accumulo, Ambari, Atlas, Avro, Crunch, Drill, Druid, Flink, Knox, Kudu, Livy, Lucene, Myriad, NiFi, Ozone, Parquet, Ranger, Solr, Tensorflow and others. And many more not directly supported by any of them. We’ll save that for the next post. And meanwhile, this is a blog post, not the result of a lengthy review process, and likely to have a few things that need updating or correcting. Please let me know what you spot.

Additional Resources

View Free, Relevant Gartner Research

Gartner’s research helps you cut through the complexity and deliver the knowledge you need to make the right decisions quickly, and with confidence.

Read Free Gartner Research


Sourced from: Gartner Blog.
View the original article here.

Have you checked out the new WhichVoIP.co.za website as yet? Benchmark your services against your peers, have a look at what your competitors are doing, get listed in the best Telecoms provider directory in South Africa, and advertise on the site to attract customers to your page where you can view page hits, respond to reviews, load adverts, and more. Visit WhichVoIP.co.za or jump to a leading comparison section:

  • Compare VoIP providers

  • Compare Hosted PBX providers

  • Compare Telephone systems

  • Compare VoIP phones

  • Compare Fibre offers

  • Compare Wireless providers

    Enjoy the site!


  • About Telecoms-Channel

    Telecoms-Channel.co.za is your one-stop source for the latest news and insights from the telecoms industry in South Africa, where you get comprehensive coverage of the industry and keep up with the ever-evolving market landscape.

    Whether you need to understand market trends, identify new opportunities, or stay informed of the latest developments, we have you covered.

    In addition to bringing the best news together, we have access to an extensive supplier network that makes it easy for any telecoms company looking to tap into new markets or enter the telecoms industry. Take advantage of our expertise and contact us today to find your next partner!

    Other posts you might be interested in

    Ericsson LG
    Industry News

    Ericsson-LG’s Key Trends Shaping the Future of Enterprise Communication

    Ericsson-LG is at the forefront of transforming the business communications landscape. By harnessing the power of artificial intelligence, cloud technology, and robust cybersecurity, the company is empowering businesses to enhance productivity, streamline operations, and improve customer experiences.

    Cloud PBX Solutions

    Request Once, Get Multiple Quotes - Save Thousands!