What is DVC?
It is an open-source Version Control System for data science and machine learning projects. It is designed to handle large files, data sets, machine learning models, and metrics as well as code.
DVC is a tool in the Version Control System category of a tech stack.
DVC is an open source tool with 7.4K GitHub stars and 696 GitHub forks. Here’s a link to DVC's open source repository on GitHub
Who uses DVC?
18 developers on StackShare have stated that they use DVC.
GitHub, Python, Git, Google Drive, and GitLab are some of the popular tools that integrate with DVC. Here's a list of all 11 tools that integrate with DVC.
- Storage agnostic
- Low friction branching
- Metric tracking
- ML pipeline framework
- Language- & framework-agnostic
- HDFS, Hive & Apache Spark
- Track failures
DVC Alternatives & Comparisons
What are some alternatives to DVC?
See all alternatives
Pachyderm is an open source MapReduce engine that uses Docker containers for distributed computations.
MLflow is an open source platform for managing the end-to-end machine learning lifecycle.
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.
Subversion exists to be universally recognized and adopted as an open-source, centralized version control system characterized by its reliability as a safe haven for valuable data; the simplicity of its model and usage; and its ability to support the needs of a wide variety of users and projects, from individuals to large-scale enterprise operations.
Mercurial is dedicated to speed and efficiency with a sane user interface. It is written in Python. Mercurial's implementation and data structures are designed to be fast. You can generate diffs between revisions, or jump back in time within seconds.