Cyclical learning rates alternate between a minimum and maximum value during training. What is the intended benefit?