Debug error with wandb

1060111768 · March 30, 2023, 4:23pm

Hi,
I met with a debug error when tuning hyperparams with sweep.

wandb: ERROR Run c3yfj87h errored: RuntimeError('cuDNN error: CUDNN_STATUS_INTERNAL_ERROR')

wandb: ERROR Run 542e421i errored: RuntimeError('false INTERNAL ASSERT FAILED at "../c10/cuda/CUDAGraphsC10Utils.h":73, please report a bug to PyTorch. Unknown CUDA graph CaptureStatus32522')

When I directly run it with a terminal, there is no such error. It only occurs when I debug. Could someone give some clues about the reason why?

raphael-sanandres · April 6, 2023, 8:49pm

According to a Stack Overflow post, the error RuntimeError('cuDNN error: CUDNN_STATUS_INTERNAL_ERROR') normally indicates that this is an out of memory problem.

This is likely this is an issue with PyTorch but there may be information in the debug logs of the run. They should be located in the wandb folder in the same directory as where the script was run. The wandb folder has folders formatted as run-DATETIME-ID associated with a single run. Could you retrieve the debug.log and debug-internal.log files from one of these folders specifically from the run that is having issues?

raphael-sanandres · April 11, 2023, 11:31pm

Hi there, I wanted to follow up on this request. Please let us know if we can be of further assistance or if your issue has been resolved.

1060111768 · April 19, 2023, 8:44pm

Hi,
sorry for delay of my reply. Unfortunately I cannot remember which run is related with this issue. For now I have not seen this error for several days. If the same problem occurrs again, I will turn to here then. Thanks for your help

system · June 18, 2023, 8:45pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Error handling W&B Help sweeps , wandb	6	187	June 27, 2024
Wanb.watch(model) causing CUDA OOM W&B Help wandb	5	1376	April 20, 2022
RuntimeError: max must be larger than min SCALER W&B Help wandb	3	512	September 26, 2022
Wandb: ERROR Error while calling W&B API: An internal error occurred. Please contact support W&B Help wandb	2	39	August 28, 2024
Issue with W&B Sweeps and Lightning W&B Help sweeps , wandb	3	44	September 4, 2024

Debug error with wandb

Related topics