30. Attention Mechanism Computation
easy

What does the attention mechanism compute for each token in a transformer?