Understanding RL for model training, and future directions with GRAPE

(arxiv.org)

33 points | by sonabinu 3 days ago

1 comments

goerz 2 days ago
Don’t people google their newly coined acronyms? GRAPE is already Gradient-Ascent-Pulse-Engineering, which is arguably “machine learning” (optimal control)