TrustRadius Insights for Cloudera Enterprise Data Hub are summaries of user sentiment data from TrustRadius reviews and, when necessary, third party data sources.
Pros
Excellent Management Capabilities: Many users have praised Cloudera Manager for its excellent management capabilities, making it easy to effectively manage a Hadoop cluster. They appreciate the user-friendly interface and robust features that allow them to efficiently monitor and control their cluster's performance, resource allocation, and configuration.
Comprehensive Solution: Several reviewers appreciate that Cloudera provides all the necessary components to build a data hub, making it a comprehensive solution for their needs. With Cloudera, they can seamlessly integrate different tools and technologies required for data processing, storage, and analysis. This eliminates the need for piecemeal solutions and simplifies the overall setup and maintenance of their infrastructure.
Data Discovery and Automation: Users have mentioned that Cloudera helps them discover data across workloads and enables them to automate workflows. Some users specifically mention customer experience and marketing automation as areas where Cloudera excels in automating processes. They highlight how these capabilities enhance efficiency by reducing manual efforts in data exploration and streamlining repetitive tasks. This empowers businesses to harness the full potential of their data while saving time and resources.
Ever since I joined my current organization Cloudera Enterprise Data Hub has been a vital solution for data visualization. It offers analytics tools ranging from database, machine learning, data warehouse, and processing in one place.
Pros
It is an integrated suite with all analytics engines in one place.
Help discover data across workloads.
Grow business by automating workflow: customer experience and marketing automation.
Protect business through risk modeling and analysis.
Cons
For newbies, it can be challenging to customize some features since there’s not enough documentation.
Improve on version control.
Likelihood to Recommend
Cloudera Enterprise Data Hub offers the best data set in one solution for real-time data analytics and reporting. I would highly recommend it since it makes management of clusters much easier than other alternatives which are more complex.
Cloudera is useful for working with data center architects to optimize
the effect of data and uncover its hidden value.
Cloudera offers solutions to formalize the enterprise standard database,
including data governance, management, and security operations.
Cloudera is versatile in terms of combining data management and analytics
throughout the lifecycle to collect data where it's needed.
Cloudera's adaptability extends to a wide range of scenarios, including
security, compliance, migration, and metadata management.
By detecting anomalies and reducing cyber dangers, Cloudera helps
research customer behavior, churn reduction, and business concerns.
Pros
Cloudera enables the use of options to automate, build, and deploy machine learning and artificial intelligence.
Cloudera helps build the self-service machine learning workflow by providing technologies that make data science easy.
Cons
Some of Cloudera's documentation is not accurate or informative enough for newcomers.
In terms of membership pricing, I would like Cloudera to be more profitable.
Likelihood to Recommend
Cloudera excels at seamless migrations and upgrades.
Cloudera supports self-healing and data center
replacement of failed cloud instances while maintaining the state.
Cloudera is essential to increase or decrease
capacity through the user interface or API.
Cloudera is great at simplifying big data analytics
by providing the technology and tools needed to gain insights from IoT and
connected devices to help monitor and condition our assets.
Cloudera's cybersecurity platform option offers
stronger anomaly detection, visibility, and prevention, as well as faster
behavioral analysis.
Cloudera is beneficial for enabling and utilizing
the platform's machine learning and ad-hoc queries while securely storing,
retrieving, and analyzing any volume of data at scale.
Useful for creating, channeling, and visualizing
data from all sectors of the business.
It facilitates data collection, campaign
management, audience segmentation, predictive analytics, and data cleansing.
Supports high-volume processing, data storage,
and data visualization.
It helps teams automate complicated data
pipelines and build data-driven applications while ensuring governance,
security, and control throughout the data lifecycle.
Pros
Cloudera excels at consolidating existing data clusters without interfering with application performance.
Cloudera simplifies operations by providing separate compute containers that can be individually configured or upgraded based on the needs of each department in a common cluster.
Cons
Cloudera membership fees are expensive; I think they could be more profitable than competing technologies.
Likelihood to Recommend
Cloudera is critical for constructing an organizational data center
while maximizing the value of that volume of data.
Cloudera is great for comprehending data and querying for valuable
replies.
Cloudera supports data transfer from a variety of external databases and
third-party platforms.
We used Cloudera Enterprise for a data lake in order to generate value from big data. Big data stored in the data lake is used for different data science projects like churn prediction, recommendation engines etc.
Pros
Stability
Support
Documentation
Cons
More Up to date technologies(e.g. Spark 2.2)
Data Science Workbench improvement
Data quality tools
Likelihood to Recommend
CDH is not a replacement of RDBMS, we should think it as complementary technologies to DWH systems. CDH can be used for data lake and reduces costs too much.
VU
Verified User
Engineer in Information Technology (10,001+ employees)
We used Cloudera to implement our big data service platform. Our team manages a central Hadoop platform where various departments were subscribed in a pay as you go model where they used it to solve various BI and analytical problems. On a day to day basis application teams used it for running Mapreduce/Tez/Spark jobs against their data set which is stored in HDFS.
Pros
One of the oldest distributors of enterprise standard Hadoop.
Distribution is based on open source Hadoop even though customizations are done on top of that.
Faster updates and bug fixes to the products as they have Apache committers.
Central configuration and control of your Hadoop platform (but still needs improvements).
Cons
Not fully Open Source, couple of components of the distributions are privately owned, meaning with public contributions are not welcome
Improvements to Cloudera manager can only be recommended. its very hard to get it done once recommended as the full control is with them.
Should make components more aligned to Open Source rather than making it closed sourced.
Custom Features of open source software tools supported only by Cloudera are tricky. Cant commit changes to tools like Hue.
Improvements to Cluster Management tool is required, which are already available to its competitors.
Likelihood to Recommend
I personally started my big data career by learning Cloudera Distribution and obtaining certification years ago. Comparatively, it was the only distribution worth mentioning back then, but competitors may have outgrown them as of now, the rate at which Cloudera advances in terms of quality of the product and services could be at a lower pace than they used to be. Cluster Management Interface used to be of high quality. Cloudera never supported distribution on a Windows platform where competitors do. I am not a big fan of privately owned repos mixed with open source tools. I will try to stay away from those as that may create a dependency later on.
VU
Verified User
Engineer in Information Technology (501-1000 employees)
The ability to monitor and manage numerous Hadoop clusters from one tool. It is an easy way to learn and use REST API and Python client library. It is also easy to use the graphical interface to manage the health of our clusters. It also makes the install process on numerous nodes easy.
Pros
Massive amounts of data consumption. It is fairly easy to implement and understand how to manage a Hadoop cluster.
Cloudera makes the management of a cluster much easier than installing and managing it with the pure Apache version.
Cons
Lots of documentation that is easy to find if you know exactly the right version of what you are looking for. Lots of "knob" turning.
Likelihood to Recommend
It is useful for monitoring and managing numerous clusters.
VU
Verified User
Analyst in Information Technology (1001-5000 employees)
I used the Cloudera Enterprise in my previous company wherein I had to used the Cloudera Impala product. I have also used Cloudera Hadoop, Hive, Sqoop, Pig, Mahout etc. Since my project had requirement for trying different tools available in market, we were doing research on which tool is the best, thats where we landed on Cloudera Enterprise products. It is used only in my department. To address the shareholder data in files from different suppliers which needs to be processed via MapReduce framework.
Pros
Wide range of Products available on Cloudera which runs on MapReduce framework.
Easy to install on Virtual Machines. All the products are available on the website and steps for installation are easily available and well documented with examples and case scenarios.
All the products with latest versions are available which can be easily installed via yum.
Cloudera Impala which the fastest way to query the HDFS without using MapReduce framework. This is freely available on their website while with Amazon EC2 machines and Windows Azure you have to shell some money for using their machine.
Cons
Some of the previous versions have compatibility issues with latest CentOS Virtual machines. So you have to take care of the Cloudera product's versions as well as CentOS version.
Compatibility issues with installation on Mac. Everything is Linux based so user has to have good knowledge of Linux commands.
GUI's are missing
I ended up spending time on matching the exact Cloudera product version with CentOS version. So had invested a lot of time in installtion/uninstallation of these products until u match/find the exact version. I had issues while installing Impala maybe it was new when i had a chance to work on Cloudera Impala.
Likelihood to Recommend
It is well suited when you locally want all the Cloudera products (MapReduce/Hive/Pig/Impala/Sqoop etc) on your machine. The best thing it is free. Though Amazon EC2 and Windows Azure Virtual machines which have these products are installed are more friendly but you have to shell a lot of money on hourly basis.
VU
Verified User
Engineer in Information Technology (10,001+ employees)
I first worked with Hadoop before there was Cloudera Enterprise. I remember quite well editing files across systems and devising methods to watch for changes, the pain of working out user access privileges, and the search for a sane package management. Cloudera Manager provides tools for all of this and more. The BI engineering team bootstrapped the Hadoop ecosystem, and I have just recently come in to support expansion. Cloudera Enterprise is great, and I appreciate supporting a company like Cloudera which is firmly rooted in the community.
Pros
Gives a view of the entire ecosystem
Enables the management of a cluster as a cluster, not individual servers
Eliminates the possibility of manual error executing complex administration tasks
Cons
I have no suggestions...yet.
Likelihood to Recommend
A really small, sandbox installation might be better off without Cloudera Enterprise at the start. If you subsequently move to Cloudera Enterprise, the benefits will be very obvious.
Quaero’s data management
platform (QDMP) and its AdVantage platform are built upon Cloudera’s Distribution of Hadoop (Cloudera Enterprise). The AdVantage platform is targeted for clients in the media industry to better
understand their audience, enhance engagement, create richer experiences, and
increase overall audience value. Quaero has deployed
the platform across several clients in the media industry (ranging in ingestion
volume from ~3MM to ~1.5 Billion records
per day).
Pros
Excellent management capabilities via Cloudera Manager.
Open source and does not restrict our data to be bound by a proprietary format.
Offers excellent support for data governance and auditing.
Has all the components that would help us build a data hub.
Excellent platform support offered by Cloudera.
Cons
More flexible pricing of the support subscription.
Looking forward to the integration of Oozie and Impala
Likelihood to Recommend
Cloudera Enterprise offers the right mix of components to build a robust data platform which supports both reporting and analytics which can deal with all sorts of data.