Groups
Category
Temporal (causal) convolution computes each output at time t using only the current and past inputs, ensuring no future information leakage.
Mamba uses a state-space model whose parameters are selected (gated) by the current input token, letting the model adapt its memory dynamics at each step.