Dataset selection for hyperparameter optimization and training

simonkleinfeld · November 24, 2022, 7:50am

Hi,
i want to make a multiclass classifier using a bert model. For this i would like to compare the performance of (at least) two domain specific bert models. But before i compare the model performance i would like to find the best hyperparameters using wandb sweeps und the simpletransformers api (the simpletransformers api, has an easy integration with wandb).

Currently i’m a bit confused how to select a good dataset for

the hyperparameter optimization
the training with the best hyperparams.

So for the hyperparams, should i create n cross-validation sets and then run a training cycle with the current selected hyperparams for every m in n dataset?
E.g. i created 2 train/test sets and i only want to find the best n of episodes out of [1,2]:
For both train/test sets, the training is done for 1 episode and in the nex cycle for 2 episodes?

And if i found the best hyperparameters, should i train the final model afterwards using my full dataset?

Hope my questions are kind of clear

ramit_goolry · November 30, 2022, 6:25am

Hi @simonkleinfeld!

Thank you for writing in! The W&B Help channel is usually meant for support with W&B issues, you would probably get a better response on the community channel : Show the Community! - W&B Community.

In any case, I’ll take a stab at helping here : A good dataset for your model would be the same (or similar) to the dataset you plan to finally use for training and inference. Good hyperparameters are usually dataset dependent.

system · January 29, 2023, 6:26am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parameter sweep small dataset W&B Help sweeps , beginner-friendly	2	291	February 21, 2024
Sweeps hyperparameter tuning with cross validation W&B Help	2	523	January 19, 2022
Hugging Face with Sweeps causes Broken pipe W&B Help sweeps	2	875	December 24, 2023
Could someone help me out with Understanding Best Practices for Hyperparameter Tuning? Ask Me Anything	0	781	April 11, 2024
Hyperparameter tuning combined with k-fold cross validation W&B Help sweeps	13	3203	June 9, 2023

Dataset selection for hyperparameter optimization and training

Related topics