Error calling wandb.watch

samlapp · August 7, 2023, 12:19am

After calling wandb.watch while training, I saved my model object by pickeling with torch.save(). Then, in a new script, I reloaded the model object and ran a similar training script to continue training for more epochs. I get an error that says:

ValueError: You can only call `wandb.watch` once per model.  Pass a new instance of the model if you need to call wandb.watch again in your code.

I can’t figure out what attribute of the model I should delete to avoid wandb seeing it as a model that has already been watched. Or perhaps, its more that on the server side, wandb recognizes this object as the same object as before and already has its hash or something?

Either way, I would like advice on how to either (a) have a conditional block like

if not [somehow check if the model has been watched]:
    wandb.watch(model)

or a way to “clean” the model of its wandb.watch history, like

del model._hidden_property_created_by_wandb

Thanks for any advice
Sam

uma-wandb · August 10, 2023, 6:48pm

Hey @samlapp, in your wandb.watch() call, are you specifying any arguments? It is intended behavior to only call wandb.watch once per model.

I recommend calling wandb.unwatch() on the model before pickling it. This would remove the hooks that are being saved. You can do this with a line similar to the following:

wandb.unwatch(model)

Please let me know if this helps!

uma-wandb · August 15, 2023, 5:46am

Hi @samlapp,

We wanted to follow up with you regarding your support request as we have not heard back from you. Please let us know if we can be of further assistance or if your issue has been resolved.

Best,

Uma

uma-wandb · August 17, 2023, 5:56pm

Hi samlapp, since we have not heard back from you we are going to close this request. If you would like to re-open the conversation, please let us know!

samlapp · August 21, 2023, 10:01pm

Apologies for the delay. The .unwatch() solution hasn’t solved the issue for me. For instance, if I
model.train(...)
then
wandb.unwatch(model)
then model.train(...) again, I still get the same error: ValueError: You can only call wandb.watch once per model. Pass a new instance of the model if you need to call wandb.watch again in your code.

samlapp · August 21, 2023, 10:09pm

Here’s how I’m calling it:

wandb_session = wandb.init(...)
wandb_session.watch(
                   model, log="all", log_freq=log_freq, log_graph=(True)
)

and

wandb_session.unwatch(model)

samlapp · August 21, 2023, 10:14pm

I think the issue is specific to using log_graph=True. When I set it to False, I no longer have the error.

system · October 20, 2023, 10:14pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Wandb.watch not logging parameters W&B Help	19	2009	February 5, 2022
Why does wandb.watch() monitor some parameters' gradients twice? W&B Help wandb	3	392	February 16, 2024
Wandb.watch() when using mixed precision and torch.cuda.amp.GradScaler() W&B Help	4	466	April 9, 2023
Why the weights for my model are not logged while I can see the gradients? W&B Help questions	8	1070	June 17, 2023
Wandb.watch doesnt log anything for me W&B Help	4	1581	May 2, 2023

Error calling wandb.watch

Related topics