TMax: The Open Recipe for Terminal Agents That Challenges Claude and Kimi
AllenAI unveils TMax, an open dataset of RL environments and a training recipe that yields compact terminal agents up to 27B parameters. The 9B model beats all open sub-10B contenders on Terminal Bench 2.0 and approaches closed systems like Claude Ha...