6. Activation Function for Vanishing Gradients
easy

Which activation function most directly addresses the vanishing gradient problem in deep networks?