Two different tricks for fast LLM inference

(seangoedecke.com)

128 points | by swah 9 hours ago

57 comments