Stable Baslines3: step vs global_step vs tensorboard step

mohammadbakir · April 6, 2023, 7:25pm

Hi @erikk , happy to help. At a high level the RL trainers maintain track of how many steps have been taken during the training when batches are processed during training. During training, the global_step is updated every time a batch is processed. When logging training metrics to wandb, the global_step is used as the x-axis to indicate this. The wandb sdk also has an internal step counter which follows a different rule for increments. Hence, both the trainer global step and wandb step variables are updated at different times due to the update conditions being different.

In regards to the Tensor Board behavior you are seeing, could you provide me a link to your workspace for review, or screenshots of what you are seeing. This will help me better understand what you are seeing.

Topic		Replies	Views
Step vs Global step in wandb watch Show the Community! wandb	2	2954	August 29, 2023
What is the W&B sdk "Step" counter logging? W&B Help	3	1993	November 5, 2023
Strange global_step restarts affecting learning rates and performance? W&B Help wandb	6	709	January 20, 2024
Tensorboard sync shows incorrect number of steps W&B Help	4	767	April 4, 2023
Changing global_step manually W&B Help wandb	12	893	February 6, 2024

Stable Baslines3: step vs global_step vs tensorboard step

Related topics