When Models Examine Themselves: Vocabulary-Activation Correspondence in LLMs

(zenodo.org)

2 points | by patternmatcher 11 hours ago

1 comments