719. SGD and Saddle Point Escape
medium

How does SGD noise help escape saddle points compared to batch gradient descent?