How to Train Neural Networks Fast and Efficiently | Tutorial

Ғылым және технология

0:00 Multi-GPU Training
2:15 Cyclic Learning Rate Schedules
3:07 Mixup: Beyond Empirical Risk Minimization
3:44 Label Smoothing
4:28 Deep Double Descent
5:55 Transfer Learning
6:18 Mixed Precision Training (Theory)
10:00 Mixed Precision Training (Tutorial)
@CodeEmporium (Ajay's channel)
/ @codeemporium
Code for the video
gist.github.com/ajhalthor/140...
Code behind the DCGAN with Apex
github.com/NVIDIA/apex/blob/m...
Mixed Precision Training
arxiv.org/abs/1710.03740
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
arxiv.org/abs/1706.02677
Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
arxiv.org/abs/1807.11205
Don't Decay the Learning Rate, Increase the Batch Size
openreview.net/pdf?id=B1Yy1BxCZ
On the Variance of the Adaptive Learning Rate and Beyond
arxiv.org/abs/1908.03265
Cyclical Learning Rates for Training Neural Networks
arxiv.org/abs/1506.01186
The 1cycle policy
sgugger.github.io/the-1cycle-...
Super-Convergence: Very Fast Training of Neural Networks Using Large Learning Rates
arxiv.org/pdf/1708.07120.pdf
Bag of Tricks for Image Classification with Convolutional Neural Networks
arxiv.org/pdf/1812.01187.pdf
mixup: Beyond Empirical Risk Minimization
arxiv.org/abs/1710.09412
Deep Double Descent
openai.com/blog/deep-double-d...
Deep Double Descent: Where Bigger Models and More Data Hurt
arxiv.org/abs/1912.02292
Reconciling modern machine learning practice and the bias-variance trade-of
arxiv.org/abs/1812.11118

Пікірлер: 9

  • @kemchobhenchod
    @kemchobhenchod4 жыл бұрын

    Your videos are getting better and better. If you put any courses available online I will definitely subscribe.

  • @Success_Unlimited_
    @Success_Unlimited_ Жыл бұрын

    Nice work! Can you propose me material so that I can understand in practice how to build a neural network for energy management? Something with examples.

  • @pablorodrigogantiercadena4047
    @pablorodrigogantiercadena40474 жыл бұрын

    It seems you all were watching jeremy howard and fast.ai, it's a good thing don't get me wrong, this techniques are really good 👍

  • @leoisikdogan

    @leoisikdogan

    4 жыл бұрын

    I didn't know they had videos but I've been reading some of their blog posts and papers they recommended :) Fast.ai is indeed a proponent of Leslie Smith's research on learning rate schedules that we covered in the video. I have a list of the original papers and blog posts in the description.

  • @sergey_koryagin
    @sergey_koryagin2 жыл бұрын

    cool

  • @habtelejebo895
    @habtelejebo895 Жыл бұрын

    Can you do Particle Swarm Optimization for ANN python?

Келесі