Transfer Learning in plain PyTorch using Fastai Insights

jimmiemunyi · October 17, 2021, 6:45pm

Hey good people. Here is a post about tips and tweaks you can employ to make transfer Learning work just right.

Most blog posts I have read about this topic just suggest changing the last linear layer and freezing all other parameters but there is more tweaks you can try

E.g
Discriminative Learning Rates, Don’t Freeze Batch Norm layers, Unfreezing the model after a few epochs, using a custom and better head (classifier)

charlesfrye · October 18, 2021, 9:04pm

Unfreezing just batchnorm makes a ton of sense – we usually expect the distribution in the domain to shift when we move to a new task (especially when it’s a much narrower task, rather than just a different task), and being able to capture at least the changes in mean and std seems like it’d help a ton without incurring too much additional cost.

Thanks for sharing!

jimmiemunyi · October 19, 2021, 1:30am

Yes I’m surprised most tutorials suggest the opposite, that we should keep the BatchNorm frozen so as not to hurt the previous running statistics

Topic		Replies	Views
#2 PyTorch Book Thread: Sunday, 5th Sept 8AM PT PyTorch Book Reading Group	38	4008	October 4, 2021
Week 14 Discussion Thread Fastbook Reading Group	25	2110	October 26, 2021
#4 PyTorch Book Thread: Sunday, 19th Sept 8AM PT PyTorch Book Reading Group	29	2734	September 20, 2021
ML Sprint: PyTorch Best Practises, Model Deploying Resources Show the Community!	4	1522	May 13, 2022
Week 13 Discussion Thread Fastbook Reading Group	23	3907	September 6, 2021

Transfer Learning in plain PyTorch using Fastai Insights

Related topics