File Cleaning & Archiving

src.pipeline.clean_pipeline.clean_pipeline_workspace(data_pipeline_dir: Path, master_dataset_dir: Path)[source]

Archives labeled results and cleans the workspace except for label_studio.

  • Copies JSON label files from data_pipeline/labeled to a timestamped folder in master_dataset.

  • Copies matching images from data_pipeline/input (regardless of extension).

  • Cleans all folders in data_pipeline except ‘label_studio’.