daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently
IntermediateMohan Jiang, Dayuan Fu et al.Feb 2arXiv
Long tasks trip up most AIs because they lose track of goals and make small mistakes that snowball over many steps.
#long-horizon agency#pull request chains#software evolution