GDPVal: Measuring the performance of our models on real-world tasks

(openai.com)

39 points | by BGyss 2 days ago

11 comments