People thought big AI models were all learning the same overall picture of the world, but those measurements were secretly biased by model size and depth.
Similarity tells you if two models seem to think about things the same way, but it doesn’t tell you if that thinking is sturdy when the world wiggles.