13. Adaptive Learning Rates in Adam
medium

An adaptive optimizer like Adam adjusts the learning rate per parameter. What information does it use to do this?