Databricks Data Intelligence Platform vs. IBM StreamSets

Databricks Data Intelligence Platform

Databricks Data Intelligence Platform

90 Reviews and Ratings

IBM StreamSets

IBM StreamSets

8 Reviews and Ratings

Overview
Product	Rating	Most Used By	Product Summary	Starting Price
Databricks Data Intelligence Platform	Score 8.5 out of 10	N/A	Databricks in San Francisco offers the Databricks Lakehouse Platform (formerly the Unified Analytics Platform), a data science platform and Apache Spark cluster manager. The Databricks Unified Data Service aims to provide a reliable and scalable platform for data pipelines, data lakes, and data platforms. Users can manage full data journey, to ingest, process, store, and expose data throughout an organization. Its Data Science Workspace is a collaborative environment for practitioners to run…	$0.07 Per DBU
IBM StreamSets	Score 8.2 out of 10	N/A	IBM® StreamSets enables users to create and manage smart streaming data pipelines through a graphical interface, facilitating data integration across hybrid and multicloud environments. IBM StreamSets can support millions of data pipelines for analytics, applications and hybrid integration.	N/A

Pricing

Databricks Data Intelligence Platform

IBM StreamSets

Editions & Modules

Standard: $0.07
Per DBU
Premium: $0.10
Per DBU
Enterprise: $0.13
Per DBU

No answers on this topic

Offerings

Pricing Offerings
Databricks Data Intelligence Platform	IBM StreamSets
Free Trial
No	No
Free/Freemium Version
No	No
Premium Consulting/Integration Services
No	No

Entry-level Setup Fee

No setup fee

No setup fee

Additional Details

—

—

More Pricing Information

Community Pulse
	Databricks Data Intelligence Platform	IBM StreamSets
Considered Both Products	Databricks Data Intelligence Platform Austin Franchino Senior Data and Security Engineer Chose Databricks Data Intelligence Platform Databricks is a true all-in-one platform, and at the time of implementation, it had more features available to us, making it a clear choice over Snowflake. Moving our workloads from local computing to the servers in Databricks gave our start-up staff a great quality of life … Incentivized Helpful? Axel Richier Tech Lead Data Engineer Chose Databricks Data Intelligence Platform Compared to Synapse & Snowflake, Databricks provides a much better development experience, and deeper configuration capabilities. It works out-of-the-box but still allows you intricate customisation of the environment. I find Databricks very flexible and resilient at the same … Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform NA Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform The most important differentiating factor for Databricks Lakehouse Platform from these other platforms is support for ACID transactions and the time travel feature. Also, native integration with managed MLflow is a plus. EMR, Cloudera, and Hortonworks are not as optimized when … Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform Databricks has a much better edge than Synapse in hundred different ways. Databricks has Photon engine, faster available release in cloud and databricks does not run on Open source spark version so better optimization, better performance and better agility and all kind of … Incentivized Helpful? GM Gilrod Maerina Analyst Chose Databricks Data Intelligence Platform Databricks [Lakehouse Platform (Unified Analytics Platform)] can work with all data types in their original format while Snowflake requires additional structures to fit the data before loading it. Databricks is open source so potential is far greater. Incentivized Helpful? Surendranatha Reddy Chappidi Senior Data Engineer Chose Databricks Data Intelligence Platform Databricks provides support for CURD operations by introducing Delta Lake file format. Cloudera doesn't have support for the same. Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform Databricks was picked among other competitors. Closest competition in our organization was H2O.ai and Databricks came out to be more useful for ROI and time to market in our internal research. We could have used AWS products, however Databricks notebooks and ability to launch … Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform When we started using it, only the notebook experience was mature. However, DB was very helpful giving us direct support to get onto their platform. Really there was little in the way to compare to them at the time. AWS has services but not the same low-cost angle. Incentivized Helpful? Ann Le Freelance Translator Chose Databricks Data Intelligence Platform I also use Microsoft Azure Machine Learning in parallel with Databricks. They use different file formats which teach me to be flexible and able to write different programs. They are equally useful to me and I would like to master both platforms for any future usage. I do prefer … Incentivized Helpful? Verified User Anonymous Chose Databricks Data Intelligence Platform Easier to set up and get started. Less of a learning curve. Incentivized Helpful?	IBM StreamSets Verified User Anonymous Chose IBM StreamSets First advantage is that this software is particularly new and it keeps updating according to the needs of the user. Other advantage is the it organises and produces conclusions on the basis of data without leaving any relevant information. Other softwares lack in data … Incentivized Helpful? Verified User Anonymous Chose IBM StreamSets Before, we were using Informatica since most of our applications were running on on-prem servers. Later, when we started moving to the cloud, we tried Informatica Cloud, but it's more useful for batch-oriented than streaming. That's why one of our tech architects suggested IBM … Incentivized Helpful? Verified User Anonymous Chose IBM StreamSets the IBM solution can be considered a good player in the specific perimeter of application because its main functionalities are working well, are easy to use, and complete. it allows also a good degree of freedom when it comes to personalization of pipelines and streams, and … Incentivized Helpful? Verified User Anonymous Chose IBM StreamSets We chose IBM StreamSets because we used to own the product before selling it to IBM, so we have a tremendous amount of folks who are familiar with the product. Incentivized Helpful? Verified User Anonymous Chose IBM StreamSets IBM StreamSets works well when compared to some of the other tools in the same category. They are easy to set up, development can be fast paced as the in-built / out of the box connectors that come along with the product. Incentivized Helpful? AK Abhishek Katara Assistant Consultant Chose IBM StreamSets StreamSets is a one-stop solution to design Data engineering Pipelines and doesn't require deep Programming knowledge, It's so user-friendly that anyone in Team can contribute to the Idea of pipeline design. In Hadoop One has to be programming proficient to use its various … Incentivized Helpful?

Best Alternatives
	Databricks Data Intelligence Platform	IBM StreamSets
Small Businesses	No answers on this topic	Skyvia Score 9.9 out of 10
Medium-sized Companies	Snowflake Score 8.9 out of 10	Astera Data Pipeline Builder (Centerprise) Score 8.7 out of 10
Enterprises	Snowflake Score 8.9 out of 10	Control-M Score 9.3 out of 10
All Alternatives	View all alternatives	View all alternatives

User Ratings
	Databricks Data Intelligence Platform	IBM StreamSets
Likelihood to Recommend	10.0 (0 ratings)	9.0 (0 ratings)
Usability	10.0 (0 ratings)	- (0 ratings)
Support Rating	8.7 (0 ratings)	- (0 ratings)

User Testimonials
	Databricks Data Intelligence Platform	IBM StreamSets
Likelihood to Recommend	If you need a managed big data megastore, which has native integration with highly optimized Apache Spark Engine and native integration with MLflow, go for Databricks Lakehouse Platform. The Databricks Lakehouse Platform is a breeze to use and analytics capabilities are supported out of the box. You will find it a bit difficult to manage code in notebooks but you will get used to it soon. Incentivized Verified User Anonymous Read full review	Because real-world sources often change (new fields get added, formats get tweaked, etc.), StreamSets helps detect and adapt to those "schema drifts" or changes automatically, or with minimal manual intervention. That makes pipelines more resilient and significantly reduces the maintenance burden. Therefore, data sets with constantly changing sources/formats are great for StreamSets. Incentivized Verified User Anonymous Read full review
Pros	There is databricks community, which is a free version. It is available for beginners to have an easy start with a big data platform. It does not have every feature of the full version but is still adequate for extremely new coders. There are many resourceful training elements that are available to developers, data scientists, data engineers and other IT professionals to learn Apache Spark. Incentivized Ann Le Freelance Translator Read full review	it connects to many data sources and helps catch issues early with built-in alerts and monitoring tools it supports real-time and batch processing, handles data drift well, and makes pipeline debugging easier with the updated UI Incentivized AS Arseniy Seroka CEO Read full review
Cons	Connect my local code in Visual code to my Databricks Lakehouse Platform cluster so I can run the code on the cluster. The old databricks-connect approach has many bugs and is hard to set up. The new Databricks Lakehouse Platform extension on Visual Code, doesn't allow the developers to debug their code line by line (only we can run the code). Maybe have a specific Databricks Lakehouse Platform IDE that can be used by Databricks Lakehouse Platform users to develop locally. Visualization in MLFLOW experiment can be enhanced Incentivized Verified User Anonymous Read full review	Where the person's skillsets in data analysis is not of an expert. Data monitoring and analysis. Customer data for better customer acquisition Incentivized Verified User Anonymous Read full review
Likelihood to Renew	No answers on this topic	IBM Stream sets has been a wonderful addition to our technology stack. It has helped in some of our initiatives such as data engineering, data integration for not only external customers but also for internal purposes. The tool has also helped on our use cases related to streaming data. Moving to another tool would require significant amount of work and time. Incentivized Verified User Anonymous Read full review
Usability	Because it is an amazing platform for designing experiments and delivering a deep dive analysis that requires execution of highly complex queries, as well as it allows to share the information and insights across the company with their shared workspaces, while keeping it secured. in terms of graph generation and interaction it could improve their UI and UX Incentivized Verified User Anonymous Read full review	because i think that overall the solution is having a positive impact on the business, it allows multiple benefits in simplification of the tasks and is capable of doing multiple process that are usually done by a combination of man and systems, reducing the time and effort required to have the data. Incentivized Verified User Anonymous Read full review
Support Rating	One of the best customer and technology support that I have ever experienced in my career. You pay for what you get and you get the Rolls Royce. It reminds me of the customer support of SAS in the 2000s when the tools were reaching some limits and their engineer wanted to know more about what we were doing, long before "data science" was even a name. Databricks truly embraces the partnership with their customer and help them on any given challenge. Jonatan Bouchard Director Data Science Read full review	Streamsets support has improved a lot in the last couple of years. We had some challenges in the beginning with support, but now the quality of the support and the responsiveness to tickets are better. We have contacted support multiple times when it came to scenarios where the system was slow or the output as not as we expected Incentivized Verified User Anonymous Read full review
Implementation Rating	No answers on this topic	I was not involved in the implementation Incentivized Verified User Anonymous Read full review
Alternatives Considered	Databricks is a true all-in-one platform, and at the time of implementation, it had more features available to us, making it a clear choice over Snowflake. Moving our workloads from local computing to the servers in Databricks gave our start-up staff a great quality of life boost. Incentivized Austin Franchino Senior Data and Security Engineer Read full review	Before, we were using Informatica since most of our applications were running on on-prem servers. Later, when we started moving to the cloud, we tried Informatica Cloud, but it's more useful for batch-oriented than streaming. That's why one of our tech architects suggested IBM StreamSets for our real-time data streaming. During the POC stage, we were happy that the data streaming was way better with IBM StreamSets compared to the Informatica Cloud way of doing. Incentivized Verified User Anonymous Read full review
Return on Investment	ROI for us has been tremendous. Time to market by processing raw data in our big data infrastructure has been pretty fast. Non engineers can easily use Databricks, hence helping business customers. Thousands of different data combinations can easily be joined and used by our data teams. Incentivized Verified User Anonymous Read full review	Reduced manual handling, cutting down operational costs for our team. It also accelerated our time to Insight, which has eventually led to faster decision making. Data quality is improved. Incentivized Sarthak Chopra Specialist Read full review
ScreenShots