Hello, I'm
Sylar.
I build things, break things, and write about what I learn along the way.
All Posts
vLLM-Omni 量化推理实践
5 June 2026 • 3 minute read训练后量化是在不重训的前提下降低大型扩散 Transformer 显存与延迟成本的主...
That's all the posts so far!
Contact
You can find me on any of the following platforms: