Sweep, how is the optimisation metric selected in bayesian optimisation

felix_quinton · September 16, 2022, 1:48pm

Hi every one,

When using a sweep, the selection of a metric that need to beoptimized is required when using bayesian optimisation.

I wanted to know if for the selection of the next critierion for the next runs, the bayesian optimisation is based on the value of that metric at the end of the run (last epoch) or on the highest value reached by the metric during the run ?

Because in the first case, if I choose a metric calculated over my validation dataset, it performance may decrease during the training because of overfitting, then the value of my metric at the end would not reflect the best performance of my model.

Thanks for your help

mohammadbakir · September 19, 2022, 10:16pm

Hi @felix_quinton , please visit this detailed article on the specifics of how Bayesian optimization works. If you still have any questions please let us know.

felix_quinton · September 21, 2022, 3:30pm

Hi,

Thanks for your time, i read this article but it doesn’t seems to answer my question, I might have been unclear.

Lets take an example, :

I want to found the best hyperparameter configuration for my model over 10 runs with bayesian search.
I choose the accuracy over my validation dataset as a metric to maximise. I train all my runs over 1000 epochs.

For the run 1:

The run achieve it’s best value of accuracy over validation dataset at the epoch 700, with a value of 0.60
After that the run start to overfit and the value of accuracy over validation decrease to 0.40 at epoch 1000

For the run 2:

The run achieve it’s best value of accuracy over validation dataset at the epoch 700, with a value of 0.50
After that the run start to overfit and the value of accuracy over validation decrease to 0.45 at epoch 1000

In my eyes, I would considered the run 1 as a better run since this configuration as reached the highest results with a maximum score of 0.60 compare to 0.50 for the run 2.

But the process seems to only care of the value reached at the end of the training to consider the quality of the run. Meaning that, in this case, the run 2 would appears as a better run with a score of 0.45 compared to 0.40 for the run 1. And so the hyperparameter combination of run 2 would be have more influence than the one of run 1 in the bayesian optimisation process.

Am I right ?

Thanks.

system · November 20, 2022, 3:30pm

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Runs in parrallel with bayesian optimisation W&B Help sweeps , wandb	2	312	November 18, 2022
How does the bayesian method in sweeps treat crashed runs? W&B Help sweeps , beginner-friendly	5	1119	August 8, 2023
Bayes contoller behavior while using wandb.define_metric() W&B Help sweeps	4	325	March 11, 2024
Use the same parameter but produce different results in Bayesian Sweep W&B Help sweeps , wandb	9	1576	June 12, 2023
Is parameter importance affected by the number of runs in bayesian sweep? W&B Help sweeps	4	550	October 23, 2022

Sweep, how is the optimisation metric selected in bayesian optimisation

Related topics