TrustRadius: an HG Insights company

Hortonworks Data Platform

Score5 out of 10

37 Reviews and Ratings

What is Hortonworks Data Platform?

Hortonworks Data Platform (HDP) is an open source framework for distributed storage and processing of large, multi-source data sets. HDP modernizes IT infrastructure and keeps data secure—in the cloud or on-premises—while helping to drive new revenue streams, improve customer experience, and control costs. Hortonworks merged with Cloudera in eary 2019.

Categories & Use Cases

Experience with Hortonworks data Platform

Pros

  • Monitoring console
  • Spark data processing
  • UI application
  • Acl with ranger in service and hdfs data

Cons

  • Upgrade service version require OS competence
  • Search features like cloudera search

Return on Investment

  • 3 projects in production environemts
  • Use to check distributed algorithm performance without cloud costs

Other Software Used

Cloudera Data Platform, Azure Data Factory, Apache Kafka, Confluent Platform

Hortonworks: based in the open, close to success

Pros

  • Hortonworks two main pillars are HDP (Hortonworks Data Platform) and HDP (Hortonworks Data Flow). The former applies to the infrastructure required for building and deploying a data lake, and the latter is about ingestion, in batch or realtime.
  • Both HDP and HDF rely entirely on opensource projects, this is a distinctive point about Hortonworks.
  • In the last year new improvements like Data Plane and Stream Analytics Manager (SAM) take HDP and HDF several steps further into management and governance.

Cons

  • As an open source project collection, it relies strongly on community activity. You still have the option to contract premium consulting or training services.
  • Altough it is quickly evolving into Data Science tools availability (eg. Tensorflow incorporate in HDP 3), it can be cumbersome from a developer transitioning from a traditional IDE, into the notebook vs. datalake metaphore.
  • As expected for a big data infranstructure, the resource requirements base line is rather high. This means that if used on premise, you need to think of about 10 machines for a minimal reasonable deploy.

Return on Investment

  • It is difficult to have a negative impact, because the required investment is not that high.
  • The big open community behind Hortonworks and related Apache Project makes it easy to put 'the wheel to meet the road' quite quickly.
  • We have seen management meetings where the attendants were impressed by the results achieved with the datalake built on HDP.

Other Software Used

KNIME Analytics Platform, Cloudera Manager, Apache Spark

Hortonworks is at the leading edge

Pros

  • Security with Ranger. Cell level security. Integration with Kerberos and LDAP.
  • Ambari management console. UI and API that allows complete management of the platform.
  • PLUS components have metadata, streaming, monitoring and other tools that flesh out the offering.

Cons

  • Installation is highly complex and needs to be streamlined.
  • The platform itself needs to be a little more stable. It might help to actually slow down releases to make sure they are more stable.
  • Upgrading from lower versions should be easier.

Return on Investment

  • Positive ROI for streaming use cases.
  • Negative impact on staffing, it will require 1-2 FTEs more than Cloudera or a BI operations team.
  • Positive ROI for Ambari and the ELK stack. You will get great capabilities for IT Operations.

Alternatives Considered

Cloudera Enterprise

Other Software Used

Tableau Server, Oracle Database, Attunity Managed File Transfer

Great Partner for Apache Hadoop

Pros

  • Knowledge base of the original committers
  • SmartSense of HDP is a great way of understanding the problems before they happen
  • Integrates very well with Apache Phoenix (SQL on Hadoop)
  • Very good support team
  • Provides more features for the money

Cons

  • Licensing cost is high when compared to other distribution partners
  • VM setup - It's not as good as what Cloudera provides
  • Monitoring isn't that great. Ambari Management interface on HDP is just a basic one and does not have many rich features
  • Version upgrades are more challenging than anticipated. Each upgrade has it's own quirks and compatibility issues that need to be resolved manually

Return on Investment

  • Improved data analysis (especially for our size of data) time by over 150%
  • Increased revenue by 60% by better predicting customer needs/way they interact with the system
  • Reduced developer time in crunching numbers significantly

Other Software Used

Teradata Enterprise Data Warehousing, Apache Solr, Apache Hive

Hortonworks HDP makes Hadoop easier

Pros

  • HDP keeps up-to pace with the Apache Hadoop.
  • HDP's Ambari is intuitive and easy to use.
  • HDP remains similar to most open source tools and hence makes the learning curve gentler.

Cons

  • The HDP community is relatively new and could get better.
  • Tools like Falcon which are not so much used in general tend to remain dormant in the HDP when it comes to development.

Return on Investment

  • We have developed a strong partnership with Hortonworks.

Other Software Used

Tableau Desktop, Cloudera Enterprise, MapR