ResNet on CIFAR10

Configuration

Imports

Configuration

Data

Trivial Augment, arXiv:2103.10158 [cs.CV]

Random Erasing, arXiv:1708.04896 [cs.CV]

Batch Augmentation, arXiv:1901.09335 [cs.LG]

Model

Improvements of ResNet from arXiv:1812.01187 [cs.CV]

SiLU activation $x\cdot\sigma(x)$, arXiv:1702.03118 [cs.LG]; more general Swish activation $x\cdot\sigma(\beta x)$, arXiv:1710.05941 [cs.NE]

Loss

Label smoothing introduced in arXiv:1512.00567 [cs.CV]

CutMix

CutMix, arXiv:1905.04899 [cs.CV]

Lookahead optimizer

Lookahead optimizer, arXiv:1907.08610 [cs.LG]

Code from https://github.com/alphadl/lookahead.pytorch

Training

History

Trainer Setup

AdamW optimizer, arXiv:1711.05101 [cs.LG]

"1cycle" leraning rate policy, arXiv:1803.09820 [cs.LG]

Training