723. SGD Non-Convergence to Precise Minimum
medium

Mini-batch SGD does not converge to a precise minimum even with enough iterations. Why?