Surpassing vLLM with a Generated Inference Stack

(infinity.inc)

17 points | by lukebechtel 5 hours ago

4 comments