On Data Engineering for Scaling LLM Terminal Capabilities
IntermediateRenjie Pi, Grace Lam et al.Feb 24arXiv
This paper shows that you can vastly improve a modelβs command-line (terminal) skills by carefully engineering the training data, not just by using a bigger model.
#Terminal-Bench 2.0#terminal agents#synthetic task generation