SWE-RM: Execution-free Feedback For Software Engineering Agents
IntermediateKaShun Shum, Binyuan Hui et al.Dec 26arXiv
Coding agents used to fix software rely on feedback; unit tests give only pass/fail signals that are often noisy or missing.
#execution-free feedback#reward model#software engineering agents