bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding | Read Paper on Bytez