b
Discover
Models
Search
About
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration
2 weeks ago
·
NeurIPS