GPT-OSS Reinforcement Learning

(docs.unsloth.ai)

180 points | by vinhnx 4 days ago

44 comments