#2 PyTorch Book Thread: Sunday, 5th Sept 8AM PT

girijesh · September 5, 2021, 3:49pm

1st row is header!!!

matt24 · September 5, 2021, 3:49pm

usually the first row contains the headers

jc17 · September 5, 2021, 4:04pm

Simplest – Consider each word having an ID – and do one hot encoding…

girijesh · September 5, 2021, 4:04pm

simples is One hot encoding as we do in Bag of Words !
then we can use word2vec

yuvraj · September 5, 2021, 4:04pm

by finding a way to convert the representation in numbers, for example creating a dictionary of all words and representing a word with its index in that dictionary.

girijesh · September 5, 2021, 4:05pm

for punctuation and other things we can we can remove as preprocessing steps

simplysumanth · September 5, 2021, 4:09pm

This is Off-topic:
In one of the Kaggle project for Image Classification → Evaluation metric was f1_score Macro. So during training do we need to use the same metric (other than accuracy) …if so this is not in pytorch (it’s in sklearn), So how can we use it in GPU based model?

girijesh · September 5, 2021, 4:28pm

What does exactly mean by non-linearity here, as here we took y = mx + c is a linear equation!!

bhutanisanyam1 · September 5, 2021, 4:28pm

girijesh · September 5, 2021, 4:30pm

Home work

Try ImageIO and TorchVision
Read hd5py module
Make a cheat sheet for all handy function used in today’s class
Think about making a documentation for permute function.
Time series

yuvraj · September 5, 2021, 4:32pm

Great stuff! Thanks.

nandeshwar · September 6, 2021, 7:19pm

Transfer Learning is the way of using already trained models for different data or tasks. Transfer Learning can be applied in several ways. Also known as Model Adaptation.

Fine-Tuning is one of the ways of Transfer Learning only. In this, the already trained neural network is further trained on the new dataset. The benefits are 1. Good Neural-Net architecture to start with 2. Weights Initialization is done using the Pre-Trained weights hence the model converges faster.

tarkanaguner · September 7, 2021, 12:28pm

Hi, trying to read and catch up… At page 90 of the book, at 4.4.2, the author talks about reshaping the bikes data. It says: We see that the rightmost dimension is the number of columns in the original
dataset. Then, in the middle dimension, we have time, split into chunks of 24 sequential hours. In other words, we now have N sequences of L hours in a day, for C channels. To get to our desired N × C × L ordering, we need to transpose the tensor
My question is, why the author strictly wants the data in N x C x L format, whereas N x L x C seems more natural…Where each row is hourly data and 17 features are in the columns? Isn’t the NxLxC, the ‘normal’ setup? So what does the auther want to achieve to get features into the rows, rather than keeping them in the columns, so that each hour is in the rows as a sepearate data point??

ukamath · September 7, 2021, 8:48pm

The YouTube video shows Jupyter notebook with more code and tests than what is in the https://github.com/deep-learning-with-pytorch/dlwpt-code, is there a fork or a separate code base for this?

bhutanisanyam1 · September 7, 2021, 10:34pm

Thanks for checking! No, this was just via the code on there, which notebook did you notice, has a difference? If I’m on and older version that has more context, I can point you to the version then

sahiljuneja · September 8, 2021, 12:00pm

Based on what I have seen, it honestly seems like a personal preference.

N x C gives a window that shows the data across all features during a particular hour of the day. So, (N, C, 3) would be the data for the 3rd hour of the day.

So, if I wanted to get the average of the temp for the first date I could do -

daily_bikes[0, 10 , :].mean()

If instead, it was N x L x C, then we look at it differently. N x L would be a window that shows the data for an entire day for a particular feature. So, (N, L, 10) would be the data for the entire day for the temp feature.

In this case, the average would be -

daily_bikes[0, :, 10].mean()

As long as we are consistent with what operations we perform and on what dimension, I don’t think there’s much difference here. Alludes more to the discussion on Named Tensors in the 3rd chapter.

It probably only matters how it’s shaped when trying to input into a NN. That’s something I have not tried yet, and choosing either of the above could potentially have an impact at that point, I think.

deep_learner_007 · October 3, 2021, 10:59pm

I was getting a bit confused with respect to offset and stride. I request you to share a small example if possible.

deep_learner_007 · October 3, 2021, 11:03pm

I found the following link explaining stride: Pytorch tensor stride - how it works - PyTorch Forums

bhutanisanyam1 · October 4, 2021, 9:21am

Sure! I’ll cover this in the beginning of the next call. Thanks for asking

Topic		Replies	Views
#1 PyTorch Book Thread: Sunday, 29 Aug 8 AM PT PyTorch Book Reading Group	66	4355	September 17, 2021
#3 PyTorch Book Thread: Sunday, 12th Sept 8AM PT PyTorch Book Reading Group	38	3192	October 7, 2021
Special Session w Thomas Viehmann : How to make the most out of community resources PyTorch Book Reading Group	53	3184	September 21, 2021
#5 PyTorch Book Thread: Sunday, 26th Sept 8AM PT PyTorch Book Reading Group	32	2305	September 26, 2021
#6 PyTorch Book Thread: Sunday, 3rd Oct 8AM PT PyTorch Book Reading Group	23	2349	October 7, 2021

#2 PyTorch Book Thread: Sunday, 5th Sept 8AM PT

Related topics