Skip to content

Commit 5b8557a

Browse files
authored
Merge pull request huggingface#306 from Drakrig/unit2-convnextr-fix
Fixes for Unit 2 - ConvNext
2 parents 4890225 + 1b395d9 commit 5b8557a

File tree

1 file changed

+4
-5
lines changed

1 file changed

+4
-5
lines changed

chapters/en/unit2/cnns/convnext.mdx

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -12,9 +12,9 @@ The key improvements are:
1212
- Training techniques
1313
- Macro design
1414
- ResNeXt-ify
15-
- Inverted bottleneck
16-
- Large kernel sizes
17-
- Micro design
15+
- Inverted Bottleneck
16+
- Large Kernel Sizes
17+
- Micro Design
1818

1919
We will go through each of the key improvements.
2020
These designs are not novel in itself. However, you can learn how researchers adapt and modify designs systematically to improve existing models.
@@ -28,10 +28,9 @@ The researchers first discerned that, while architectural design choices are cru
2828
Inspired by DeiT and Swin Transformers, ConvNext closely adapts their training techniques. Some of the notable changes are:
2929
- Epochs: Extending the epochs from the original 90 epochs to 300 epochs.
3030
- Optimizer: Using AdamW optimizer instead of Adam optimizer, which differs in how it handles weight decay.
31-
- Regularization: Using Stochastic Depth and Label Smoothing as regularization techniques.
3231
- Mixup (generates a weighted combination of random image pairs), Cutmix (cuts part of an image and replace it with a patch from another image), RandAugment (applies a series of random augmentations such as rotation, translation, and shear), and Random Erasing (randomly selects a rectangle region in an image and erases its pixels with random values) to increase training data.
33-
Modifying these training procedures has improved ResNet-50's accuracy from 76.1% to 78.8%.
3432
- Regularization: Using Stochastic Depth and Label Smoothing as regularization techniques.
33+
3534
Modifying these training procedures has improved ResNet-50's accuracy from 76.1% to 78.8%.
3635

3736

0 commit comments

Comments
 (0)