ML Paper Challenge Day 28 — On the importance of initialization and momentum in deep learning | by Chun-kit Ho | Medium

Member-only story
ML Paper Challenge Day 28 — On the importance of initialization and momentum in deep learning
Chun-kit Ho
·Follow
3 min read·
May 10, 2020
--
Papers with Code - On the importance of initialization and momentum in deep learningDeep and recurrent neural networks (DNNs and RNNs respectively) are powerful models that were considered to be almost…
paperswithcode.com
Day 28: 2020.05.09
Paper: On the importance of initialization and momentum in deep learning
Category: Model/Deep Learning/Optimization
both the initialization and the momentum are crucial
poorly initialized networks cannot be trained with momentum
well-initialized networks perform markedly worse when the momentum is absent or poorly tuned
carefully tuned momentum methods suffice for dealing with the curvature issues in deep and recurrent network training objectives without the need for sophisticated second-order methods.
Momentum and Nesterov’s Accelerated GradientNesterov’s Accelerated Gradient (NAG) can be viewed as a simple modification of Classical Momentum (CM)which increases stability, and can sometimes provide a…
--
--
Written by Chun-kit Ho134 Followers
·463 Following
cloud architect@ey | full-stack software engineer | social innovation | certified professional solutions architect in aws & gcp
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams