Wandb.watch() when using mixed precision and torch.cuda.amp.GradScaler()

dt_90 · January 26, 2023, 9:36am

I have a PyTorch project where I’m using mixed precision gradient scaling. When using wandb.watch() to log model gradients is it possible to unscale them using something like scaler.unscale() at some point in the code prior to logging? My code looks something like the below.

wandb.init(project="my_project", name='my_run', config=config, mode='online')
model = Net()
wandb.watch(model, log='all')
optimiser = my_optim(model.parameters(),lr=lr)

scaler = torch.cuda.amp.GradScaler(enabled=use_amp)

for epoch in range(epochs):
    for input, target in train_loader:
        with torch.autocast(device_type='cuda', dtype=torch.float16, enabled=use_amp):
             pred = model(input) 
             loss = loss_fn(input, target)
        scaler.scale(loss).backward()
        scaler.step(optimiser)
        scaler.update()
        optimiser.zero_grad(set_to_none=True)

thanos-wandb · January 31, 2023, 11:38am

Hi @dt_90 thanks for writing in! I wanted to follow up on this request, and see if you’ve already tried it and if you ran into any issues? also, was wondering what would be the use case to log the unscaled gradients instead?

thanos-wandb · February 3, 2023, 4:07pm

Hi @dt_90 just checking in here to see if you still experience any issues with this, and if you could provide some more information what errors you get? if possible to share a minimal code example would greatly help to reproduce the issue and further assist you with this. Thanks!

thanos-wandb · February 8, 2023, 6:10pm

Hi @dt_90 since we haven’t heard back from you any additional information regarding this issue, we will close this ticket for now. However, please let us know if this issue still persists for you and we will be happy to keep investigating.

system · April 9, 2023, 6:11pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Wandb.watch doesnt log anything for me W&B Help	4	1587	May 2, 2023
Wandb.watch not logging parameters W&B Help	19	2014	February 5, 2022
Wandb.watch with pytorch not logging anything W&B Help	4	3286	May 17, 2022
Problems logging Gradients with WandB and Pytorch Lightning W&B Help dashboard	0	34	November 6, 2024
Wanb.watch(model) causing CUDA OOM W&B Help wandb	5	1400	April 20, 2022

Wandb.watch() when using mixed precision and torch.cuda.amp.GradScaler()

Related topics