Papers2

#dictionary learning

Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines?

Anton Korznikov, Andrey Galichin et al.Feb 15arXiv

Sparse autoencoders (SAEs) are popular for explaining what large language models are doing, but this paper shows they often don’t learn real, meaningful features.

#sparse autoencoders#interpretability#dictionary learning

Not triaged yet

ROCKET: Rapid Optimization via Calibration-guided Knapsack Enhanced Truncation for Efficient Model Compression

Intermediate

Ammar Ali, Baher Mohammad et al.Feb 11arXiv

ROCKET is a fast, training-free way to shrink big AI models while keeping most of their smarts.

#model compression#training-free compression#sparse factorization

Not triaged yet