Groups
Category
Standard softmax attention costs O(n²) in sequence length because every token compares with every other token.
Systematic debugging beats guesswork: always re-read the statement, re-check constraints, and verify the output format before touching code.