MiMo-V2-Flash Technical Report
IntermediateXiaomi LLM-Core Team, : et al.Jan 6arXiv
MiMo-V2-Flash is a giant but efficient language model that uses a team-of-experts design to think well while staying fast.
#Mixture-of-Experts#Sliding Window Attention#Global Attention