It is an open-source data infrastructure to keep customer data in-sync between your data warehouse and 3rd party tools. | It is an orchestrator that's designed for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports. |
Run it on your computer today or install it on your company’s servers. Your data stays private on your servers. No usage or storage limits;
Pull data from trusted sources like data warehouses instead of instrumenting Event streams;
Marketing, Sales, Customer Support and other teams are trying new tools all the time. Instead of building and maintaining all of these integrations, add our pre-built connectors;
Maintain control over your data and infrastructure so you’re never locked in. While a dedicated team is building and maintaining it, we welcome the developer community to contribute and improve it as well | Manage your data assets with code;
A single pane of glass for your data platform;
From pull request to production. Effortlessly;
Monitor runs across all your jobs in one place with the run timeline view |
Statistics | |
GitHub Stars 760 | GitHub Stars 14.3K |
GitHub Forks 120 | GitHub Forks 1.9K |
Stacks 3 | Stacks 29 |
Followers 6 | Followers 17 |
Votes 0 | Votes 0 |
Integrations | |

Segment is a single hub for customer data. Collect your data in one place, then send it to more than 100 third-party tools, internal systems, or Amazon Redshift with the flip of a switch.

Tag Manager gives you the ability to add and update your own tags for conversion tracking, site analytics, remarketing, and more. There are nearly endless ways to track user behavior across your sites and apps, and the intuitive design lets you change tags whenever you want.

RudderStack allows you to easily build pipelines connecting your whole customer data stack, then make them smarter by pulling analysis from your data warehouse to trigger enrichment and activation in customer tools.

Astro is the modern data orchestration platform, powered by Apache Airflow. Astro enables data engineers, data scientists, and data analysts to build, run, and observe pipelines-as-code.

A code-generated, type-safe tracking library to accurately implement analytics events that are defined and maintained in a single-source-of-truth web app. Built to optimize the experience of maintaining and version controlling complicated event schemas.

The leader in collaborative data cataloging, it empowers analysts & information stewards to search, query & collaborate for fast and accurate insights.

Codelessly connect your site to your stack. Automate tedious work so engineering can focus on product. It integrates your marketing and analytics tools with one click.

Iteratively helps teams capture reliable product analytics they can trust. It eliminates the most common causes of error during the definition and implementation of tracking plans, and cuts down on the time it takes to correctly instrument the product. As a result, folks that consume product analytics get exactly what they spec'd out and can rely on the incoming data knowing it is trustworthy and accurate.