bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration | Read Paper on Bytez