JetSpec Enables Up to 9.64x Lossless LLM Inference Speedup with Up to 1000TPS

(haoailab.com)

4 points | by snyhlxde 11 hours ago

1 comments