ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression
IntermediateAmmar Ali, Baher Mohammad et al.Feb 11arXiv
ROCKET is a fast, training-free way to shrink big AI models while keeping most of their smarts.
#model compression#training-free compression#sparse factorization