I have a large dataset with multiple files on my local file system that I would like to track. It is not part of a github repository, since the files are quite large (it is around 30 GB, each file being 0.5 GB).
I added references to these files in W&B using the command
artifact.add_reference(name='data_folder',uri='file://path/to/directory')
Now, if I change these files and log them in a run, I can see that the version of the artifact changes on the web UI. In the future, if I want to use an older version of this dataset, is there a way to do so?
I’m assuming not because W&B is only tracking the references, so there is no way of going back to the old dataset.
Any help would be appreciated.
Thanks,
Chaitanya