How to distinguish resumed runs during sweeps?

cschell · March 16, 2022, 9:10am

I’m looking into WandB’s Sweep feature for my next project and am currently trying to implement the resume-mechanism.

I use the following code to restore my model:

wandb.init(resume=True)

if wandb.run.resumed:
    model = wandb.restore("last.ckpt")
else:
    model = ... # instantiate new model

However, wandb.run.resumed is apparently always True, since the wandb agent sets the WANDB_RUN_ID-environment variable, so restore fails for new runs. What is a good way to handle this?

_scott · April 20, 2022, 11:44am

Hi,
Sorry this was missed, I have forwarded this to support.

ramit_goolry · April 21, 2022, 10:15pm

Hi @cschell,

I just tested this on my end wandb.run.resumed is only True when the last run which had been run in the directory had exits with a nonzero exit code. When the previous run exits with a zero exit code, wandb.run.resumed is False.

I suspect you might always be getting True because the previous run crashes on wandb.restore. Could you try instantiating a new run which creates “last.ckpt” and then try resuming?

Thanks,
Ramit

ramit_goolry · April 27, 2022, 7:16pm

Hi @cschell,

We wanted to follow up with you regarding your support request as we have not heard back from you. Please let us know if we can be of further assistance or if your issue has been resolved.

Best,
Weights & Biases

ramit_goolry · May 2, 2022, 4:22pm

Hi Christian, since we have not heard back from you we are going to close this request. If you would like to re-open the conversation, please let us know!

system · June 20, 2022, 10:16pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Resuming sweep runs on a cluster with job time limits W&B Help sweeps	8	1879	February 4, 2023
Resume run not working for sweep run W&B Help sweeps , wandb	4	2034	March 18, 2023
Wandb init resume not working W&B Help	4	493	January 23, 2024
Resuming run/training W&B Help projects , wandb	9	2931	August 9, 2022
What is the correct way to resume a paused or crashed run? W&B Help dashboard , sweeps , questions , wandb , beginner-friendly	4	4263	June 9, 2023

How to distinguish resumed runs during sweeps?

Related topics