Show HN: Terminal-Bench-RL: Training long-horizon terminal agents with RL

(github.com)

124 points | by Danau5tin 2 days ago

12 comments