Show HN: Docker Model Runner Integrates vLLM for High-Throughput Inference

(github.com)

6 points | by ericcurtin 7 hours ago

1 comments

ericcurtin 7 hours ago
I'm one of the devs, happy to answer any questions