Databricks logo

Databricks

A unified analytics platform, powered by Apache Spark
225
374
+ 1
0

What is Databricks?

Databricks Unified Analytics Platform, from the original creators of Apache Spark™, unifies data science and engineering across the Machine Learning lifecycle from data preparation to experimentation and deployment of ML applications.
Databricks is a tool in the General Analytics category of a tech stack.

Who uses Databricks?

Companies
24 companies reportedly use Databricks in their tech stacks, including QuintoAndar, Core Banking, and www.autotrader.co.uk.

Developers
200 developers on StackShare have stated that they use Databricks.

Databricks Integrations

Kafka, TensorFlow, Apache Spark, Hadoop, and Keras are some of the popular tools that integrate with Databricks. Here's a list of all 13 tools that integrate with Databricks.
Decisions about Databricks

Here are some stack decisions, common use cases and reviews by companies and developers who chose Databricks in their tech stack.

Vamshi Krishna
Data Engineer at Tata Consultancy Services · | 4 upvotes · 61.4K views

I have to collect different data from multiple sources and store them in a single cloud location. Then perform cleaning and transforming using PySpark, and push the end results to other applications like reporting tools, etc. What would be the best solution? I can only think of Azure Data Factory + Databricks. Are there any alternatives to #AWS services + Databricks?

See more

Databricks's Features

  • Built on Apache Spark and optimized for performance
  • Reliable and Performant Data Lakes
  • Interactive Data Science and Collaboration
  • Data Pipelines and Workflow Automation
  • End-to-End Data Security and Compliance
  • Compatible with Common Tools in the Ecosystem
  • Unparalled Support by the Leading Committers of Apache Spark

Databricks Alternatives & Comparisons

What are some alternatives to Databricks?
Snowflake
Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.
Azure Databricks
Accelerate big data analytics and artificial intelligence (AI) solutions with Azure Databricks, a fast, easy and collaborative Apache Spark–based analytics service.
Domino
Use our cloud-hosted infrastructure to securely run your code on powerful hardware with a single command — without any changes to your code. If you have your own infrastructure, our Enterprise offering provides powerful, easy-to-use cluster management functionality behind your firewall.
Confluent
It is a data streaming platform based on Apache Kafka: a full-scale streaming platform, capable of not only publish-and-subscribe, but also the storage and processing of data within the stream
Apache Spark
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
See all alternatives

Databricks's Followers
374 developers follow Databricks to keep up with related blogs and decisions.