Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
IntermediateSeijin Kobayashi, Yanick Schimpf et al.Dec 23arXiv
The paper shows that big sequence models (like transformers) quietly learn longer goals inside their hidden activations, even though they are trained one step at a time.
#hierarchical reinforcement learning#temporal abstractions#autoregressive models