804. Vanishing Gradient Problem
medium

What is the vanishing gradient problem in the context of gradient descent for deep networks?