GitHub - CodeCutTech/dvc-demo
A demonstration of Data Version Control (DVC) for managing ML pipelines and data versioning.
DVC is an open-source version control system for machine learning projects. It helps you:
.
├── data/ # Raw and processed data files
│ └── raw.dvc # DVC file for raw data
├── src/ # Source code for data processing and model training
├── config/ # Configuration files
├── .dvc/ # DVC internal files
├── dvc.yaml # DVC pipeline definition
├── dvc.lock # DVC lock file for reproducible pipelines
└── .dvcignore # Files/directories to be ignored by DVC