One Stop Shop for Data Professionals.
Use Cases and Deployment Scope
Databricks is the primary data platform where we land, standardize, clean, transform, and clean our data sources. We utilize the Workflows feature to automate reoccurring tasks and have built internal applications around the reusable workflows. We use the dashboard feature internally to allow customer success teams and business analysts to keep tabs on the performance and outputs of our products. The workloads are orchestrated in Databricks but executed within our own AWS accounts, allowing us to stay compliant with our stringent security requirements.
Pros
- Thoughtful application of AI assistants during the coding and analysis steps.
- Intuitive UI for users of varying skill sets.
- Frequently updated documentation.
Cons
- Greater support for non spark workloads.
- Ability to host JAR files on serverless endpoints.
Return on Investment
- Greater democratization to data sources.
- Migration took a while, as we were largely a Pandas shop.
Usability
Alternatives Considered
Snowflake
Other Software Used
Notion, Datadog



