Towards unlimited contexts: faster-than-GPU sparse logarithmic attention on CPU [video]

(youtube.com)

3 points | by mfiguiere 14 hours ago

No comments yet.