R-Zero: Self-Evolving Reasoning LLM from Zero Data

(arxiv.org)

90 points | by lawrenceyan 14 hours ago

42 comments