bytez
Search
Feed
Models
Agent
Devs
Model API
docs
Speculative Decoding with CTC-based Draft Model for LLM Inference Acceleration | Read Paper on Bytez