Reinforcement learning towards broadly and persistently beneficial models

(alignment.openai.com)

2 points | by gmays 8 hours ago

No comments yet.