R-Zero: Self-Evolving Reasoning LLM from Zero Data

(arxiv.org)

3 points | by Anon84 3 days ago

1 comments