M.A. Grzejdziak

Master thesis (1)

1 records found

Vanishing empirical variance in randomly initialized networks

Master thesis (2023) - M.A. Grzejdziak (author) , David Tax (mentor) , Marco Loog (graduation committee member) , Marcel JT Reinders (graduation committee member) , Wendelin Böhmer (graduation committee member)

Neural networks are commonly initialized to keep the theoretical variance of the hidden pre-activations constant, in order to avoid the vanishing and exploding gradient problem. Though this condition is necessary to train very deep networks, numerous analyses showed that it is no ...