COMPOT is a training-free way to shrink Transformer models while keeping their smarts.
ROCKET is a fast, training-free way to shrink big AI models while keeping most of their smarts.