Combining LLMs Rarely Beats the Best Single Model, I tested 67 frontier models

(arxiv.org)

1 points | by josefchen 8 hours ago

No comments yet.