SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI

(arxiv.org)

123 points | by mpweiher 4 days ago

48 comments