Download artifact from Google Bucket

power46 · December 1, 2023, 1:41pm

Hi,
we are using W&B to manage our artifact versions but their actual location is on a Google Bucket.

I have noticed that using the wandb python library, ie:

import wandb

api = wandb.Api()
artifact = api.artifact("name")
artifact.download()

it downloads between 500-600 Mbit/s.

Instead using:

gcloud storage cp --recursive gs://<bucket>/<artifact_folder>/* /path

it downloads 4-5 Gbit/s, max throughput of the disk on the VM.

Any suggestion on how to speed it up?

Thanks

raphael-sanandres · December 5, 2023, 12:02am

Hello @power46!

Since you are storing your files in a Google bucket, are you using Reference Artifacts? This should change download to be from a bucket itself instead from our servers. The bucket download should be quite a bit faster than doing it through our servers.

power46 · December 5, 2023, 7:15am

Thanks for your reply.
Yes, we are already using reference artifacts.

raphael-sanandres · December 22, 2023, 8:54pm

After looking around, the difference may be due to the recursive download that you are using for GCP. We are using defaults for the artifact download so it looks like the difference is purely based on the API calls but I am investigating further into the differences. However, as of right now, there isn’t a faster way to download artifacts since we download reference artifacts with only one method.

power46 · December 26, 2023, 10:40am

Thanks for looking into this.
As you can imagine it is a bit annoying to not be able to take advantage of the full bandwidth.

system · February 24, 2024, 10:40am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How to download a specific folder from an artifact? W&B Help artifacts	5	1493	November 28, 2023
Taking a long time to download artifact of only 300mb W&B Help	2	307	October 14, 2022
Programmatically accessing artifact object very slow for first call for large artifacts W&B Help	7	941	January 1, 2022
Artifact download link W&B Help artifacts , wandb	5	789	March 21, 2023
Trying to download a table artifact using the 'artifact_path' W&B Help artifacts	0	24	February 18, 2025

Download artifact from Google Bucket

Related topics