Large language models usually line words up in fixed order slots, which can waste mental energy and make it harder to find the important parts of a long or noisy text.
Big language models use RoPE to remember word order, but it throws away the imaginary half of a complex number during attention.