Groups
Category
Mamba uses a state-space model whose parameters are selected (gated) by the current input token, letting the model adapt its memory dynamics at each step.
Spectral analysis studies the distribution of eigenvalues and singular values of neural network weight matrices during training.