RePo: Language Models with Context Re-Positioning
IntermediateHuayang Li, Tianyu Zhao et al.Dec 16arXiv
Large language models usually line words up in fixed order slots, which can waste mental energy and make it harder to find the important parts of a long or noisy text.
#context re-positioning#positional encoding#self-attention