DenseNet

lessing · September 20, 2021, 6:46pm

Register: wandb.me/prg
DenseNet Paper: [1608.06993] Densely Connected Convolutional Networks
Blog post: DenseNet Architecture Explained with PyTorch Implementation from TorchVision | Committed towards better future
Livestream on YouTube here: W&B Paper Reading Group: DenseNet - YouTube

vinayak_nayak · September 21, 2021, 5:04pm

Yes can hear you very well Aman!

girijesh · September 21, 2021, 5:05pm

one between each layer and its subsequent layer—our network has L(L+1)/2 direct connections.

I couldn’t comprehend the above line from the abstract! Can we discuss it in detail?

ravimashru · September 21, 2021, 5:12pm

This is because the first layer has 1 connection, the second layer has 2 connections, and so on… so if we add all these connections, 1 + 2 + 3 + … + L = L(L+1)/2.

girijesh · September 21, 2021, 5:13pm

yeah got it, like arithmetic series

girijesh · September 21, 2021, 5:17pm

We need to keep size intact with padding while doing convolution ?

vinayak_nayak · September 21, 2021, 5:17pm

Strided (with a really large stride) 1 x 1 convolution?

vinayak_nayak · September 21, 2021, 5:24pm

This means we’ll never be adding input from first block to last block or more generally, there is no skip connection across blocks but only within the blocks, right?

durgaamma2005 · September 21, 2021, 5:38pm

how 32 features are getting added in db1

prateekagrawal · September 21, 2021, 5:39pm

What is the reason/intuition behind using more layers in the deeper blocks (Dense Block 3 and 4).?

girijesh · September 21, 2021, 5:41pm

Will this architecture will not overfit as having too many conv densely connected ? and at last one FC too ?

ramesh · September 21, 2021, 5:41pm

I am confused on 1x1 with 128 filters as BottleNeck. Is that same across all the blocks and all the layers in each block?

msampat · September 21, 2021, 5:42pm

In the transition block, could we have only the pooling layer and avoid the 1x1 layer ? What is the advantage of the 1x1 layers : where we go from 64x56x56 to 128x56x56

durgaamma2005 · September 21, 2021, 5:44pm

fundamental doubt: how do you differentiate between filters. like each kernel is unique in a stack of kernels in filter. so when out channels depends on no of filters we apply, how exactly we differentiate between filters. since no of kernels will be equal all across filters, is kernel values are arbitrary or random values so that we can get various weight values so that each filter will be different?

sahilcodes03 · September 21, 2021, 5:45pm

Fundamental Question - When we go deeper with layers, the features that get extracted are called low level features or high level features?

msampat · September 21, 2021, 5:46pm

Thanks @amanarora . that makes sense. trying it out

ramesh · September 21, 2021, 5:46pm

That makes sense. But the first layer inside the DenseBlock has only 32 outputs. So the 1x1 that follows it is not a Bottleneck anymore if it increases the feature maps to 128. But the term Bottleneck makes sense in the later layers in the DenseBlock.

ghosh-r · September 21, 2021, 5:47pm

The ResNet architectures also add skip connections. DenseNet also adds connections from earlier layers to latter layers inside the DenseNet Blocks. How do you form your intuition about how this is helping the overall performance? How do this kind of skip-connection-type connections improve over the performance of vanilla ResNets?

nanthony · September 21, 2021, 5:47pm

Can you please quickly explain how a 1x1 conv changes the number of channels? I thought it would be the same number… what don’t I get?

bharatr97 · September 21, 2021, 5:48pm

Are there any ablation studies done for the DenseNet? Like do we know which of the connections inside the dense block contribute how much to the performance? Are skip connections more valuable than direct connections?

Topic		Replies	Views
Week 13 Discussion Thread Fastbook Reading Group	23	3907	September 6, 2021
Master List: Bi-weekly Paper Reading Group Paper Reading Group	4	3299	November 2, 2021
ResNets - Deep Residual Learning for Image Recognition Paper Reading Group	32	2024	October 26, 2021
Week 14 Discussion Thread Fastbook Reading Group	25	2110	October 26, 2021
U-Net: CNNs for Biomedical Image Segmentation Paper Reading Group	7	1669	November 16, 2021

DenseNet

Related topics