210. Depth vs Width in Neural Networks
medium

Why are deep networks (many layers) often preferred over wide shallow networks for complex tasks?