Groups
A Mixture of Experts (MoE) routes each input to a small subset of specialized models called experts, enabling conditional computation.
Key-Value memory systems store information as pairs where keys are used to look up values by similarity rather than exact match.