Show HN: FlashQwen – A from-scratch CUDA inference engine for Qwen3

(github.com)

5 points | by langtang1996 4 days ago

No comments yet.