Alignment whack-a-mole: Finetuning activates recall of copyrighted books in LLMs

(github.com)

146 points | by reconnecting 9 hours ago

106 comments