ML Paper Challenge Day 23 — Layer Normalisation | by Chun-kit Ho | Medium

Member-only story
ML Paper Challenge Day 23 — Layer Normalisation
Chun-kit Ho
·Follow
2 min read·
May 4, 2020
--
Papers with Code - Layer NormalizationImplemented in 16 code libraries.
paperswithcode.com
Day 23: 2020.05.04
Paper: Layer Normalisation
Category: Model/Deep Learning/Technique (Layer Normalisation)
Layer NormalisationBackgroundbatch normalisation requires running averages of the summed input statistics.
However, the summed inputs to the recurrent neurons in a recurrent neural network (RNN) often vary with the length of the sequence so applying batch normalisation to RNNs appears to require different statistics for different time-steps.
-> not really feasible to apply to recurrent neural networks
the effect of batch normalisation is dependent on the mini-batch size
-> cannot be applied to online learning tasks or to extremely large distributed models where the mini-batches have to be small.
Howtranspose batch normalisation into layer normalisation by directly computing…
--
--
Written by Chun-kit Ho134 Followers
·463 Following
cloud architect@ey | full-stack software engineer | social innovation | certified professional solutions architect in aws & gcp
No responses yet
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams