Lion-K CCWD: Corrected Cautious Weight Decay and Hyperparameter Transfer

(jiha-kim.github.io)

1 points | by ibobev 8 hours ago

No comments yet.