GitHub - CodeCutTech/dvc-demo

A demonstration of Data Version Control (DVC) for managing ML pipelines and data versioning.

DVC is an open-source version control system for machine learning projects. It helps you:

.
├── data/              # Raw and processed data files
│   └── raw.dvc        # DVC file for raw data
├── src/               # Source code for data processing and model training
├── config/            # Configuration files
├── .dvc/              # DVC internal files
├── dvc.yaml           # DVC pipeline definition
├── dvc.lock           # DVC lock file for reproducible pipelines
└── .dvcignore         # Files/directories to be ignored by DVC