538. Noise in SGD as a Benefit
medium

SGD introduces noise into the gradient estimate. In non-convex optimization, how can this noise be beneficial?