bytez
Search
Feed
Models
Agent
Devs
Plan
docs
CAS-Spec: Cascade Adaptive Self-Speculative Decoding for On-the-Fly Lossless Inference Acceleration of LLMs | Read Paper on Bytez