I prepared a template project to configure wandb and slurm to support job arrays.
Related Topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Wandb sweep using slurm and multi gpu setting | 1 | 437 | April 30, 2024 | |
SLURM and Launch-agent | 5 | 264 | October 30, 2024 | |
Distributed data parallel with pytorch lightning | 6 | 217 | August 21, 2024 | |
Jobs keep failed from second one | 4 | 548 | July 25, 2023 | |
Has anyone used wandb sweeps and torch.distributed before? | 2 | 394 | June 3, 2022 |