The paper introduces RPG-Encoder, a way to turn a whole code repository into one clear map that mixes meaning (semantics) with structure (dependencies).
Long tasks trip up most AIs because they lose track of goals and make small mistakes that snowball over many steps.
ABC-Bench is a new test that checks if AI coding agents can really do backend work from start to finish, not just write a few lines of code.
This paper is the first big map of how AI can fix real software problems, not just write short code snippets.
This paper builds an open, end-to-end ecosystem (ALE) that lets AI agents plan, act, and fix their own mistakes across many steps in real computer environments.
The paper introduces Nemotron-Cascade, a step-by-step (cascaded) reinforcement learning recipe that trains an AI across domains like alignment, instructions, math, coding, and software engineering—one at a time.