HN
New
Show
Ask
Jobs
Built with Paraglide and Solid
en
pl
Cutting LLM Batch Inference Time by Half with Dynamic Prefix Bucketing
(daft.ai)
2 points | by
DISCURSIVE
11 hours ago
No comments yet.
No comments yet.