How I Study AI - Learn AI Papers & Lectures the Easy Way

Transformers converge to invariant algorithmic cores

Different transformers may have very different weights, but they often hide the same tiny "engine" inside that actually does the task.

#algorithmic cores#mechanistic interpretability#transformers

Not triaged yet

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

Intermediate

Taejong Joo, Wenhan Xia et al.Feb 17arXiv

The paper finds a simple trick—randomly skipping some parameter updates—can train large language models better than fancy optimizers.

#Magma#random masking#adaptive optimizers

Not triaged yet

The AI Hippocampus: How Far are We From Human Memory?

Intermediate

Zixia Jia, Jiaqi Li et al.Jan 14arXiv

This survey asks how close AI memory systems are to human memory and organizes the answer into three parts: implicit memory (inside the model), explicit memory (outside storage you can look up), and agentic memory (what an AI agent keeps over time to plan and act).

#LLM memory#implicit memory#explicit memory

Not triaged yet

Papers3

Transformers converge to invariant algorithmic cores

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

The AI Hippocampus: How Far are We From Human Memory?