Parallelizing runs with multiple logical GPU's

kprybol · November 4, 2021, 12:31am

With Google Colab (or similar large GPUs setups and JupyterHub) you can create multiple logical/virtual GPU’s and parallelize training runs assuming your models are small enough.

gpus = tf.config.list_physical_devices('GPU')
if gpus:
  # Create 2 virtual GPUs with 1GB memory each
  try:
    tf.config.set_logical_device_configuration(
        gpus[0],
        [tf.config.LogicalDeviceConfiguration(memory_limit=1024),
         tf.config.LogicalDeviceConfiguration(memory_limit=1024)])
    logical_gpus = tf.config.list_logical_devices('GPU')
    print(len(gpus), "Physical GPU,", len(logical_gpus), "Logical GPUs")
  except RuntimeError as e:
    # Virtual devices must be set before GPUs have been initialized
    print(e)

Is it possible to train multiple sweeps runs in parallel with logical GPU’s within a Colab like environment?

lesliewandb · November 4, 2021, 1:52pm

Hi Kevin,

Yes it is possible. To do so, you can do CUDA_VISIBLE_DEVICE=2 wandb agent , once for each of the GPUs on each of the machines.

kprybol · November 4, 2021, 6:46pm

I guess more specifically can this be done from a notebook environment? Looks like you’re referencing a shell command. I believe that would break the virtual devices created by TF since you’re leaving the environment in which the virtual devices were created.

lesliewandb · November 5, 2021, 7:18pm

Yes you can use multiple GPUs in both jupyter and google colab with wandb

system · January 3, 2022, 6:46pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Make each sweep run with more than 1 GPU W&B Help	3	1059	February 9, 2024
Help with running a sweep agent on a multi-gpu machine with pytorch DistributedDataParallel W&B Help sweeps	4	716	January 8, 2025
WandB sweeps and ddp W&B Help sweeps , wandb	3	1179	November 5, 2023
MultiGPU training W&B Help dashboard , projects , wandb , beginner-friendly , pytorch	2	1319	January 24, 2024
Sweeps + Accelerate (mulit GPU) + Trainer W&B Help sweeps	7	1249	January 3, 2025

Parallelizing runs with multiple logical GPU's

Related topics