Gradient descent

Batch gradient descent.

0/10

completed

mediumGradient Descent Oscillation
8/10

A model trained with gradient descent oscillates between high and low loss values without converging. What is the most likely cause?