Sign up to add or upvote prosMake informed product decisions
Sign up to add or upvote consMake informed product decisions
- No public GitHub repository available -
What is dbt?
dbt is a transformation workflow that lets teams deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation. Now anyone who knows SQL can build production-grade data pipelines.
What is Apache Spark?
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
Need advice about which tool to choose?Ask the StackShare community!
Jobs that mention dbt and Apache Spark as a desired skillset
Sign up to get full access to all the companiesMake informed product decisions
Sign up to get full access to all the tool integrationsMake informed product decisions
Sep 1 2021 at 5:34PM
Mar 24 2021 at 12:57PM
Nov 24 2020 at 7:01PM
Aug 26 2020 at 4:42PM
Jul 9 2020 at 2:41PM
Apr 8 2020 at 5:37PM
Aug 28 2019 at 3:10AM
Oct 22 2015 at 8:05AM