Language Model Contains Personality Subnetworks

(arxiv.org)

42 points | by PaulHoule 7 hours ago

26 comments