bytez
Search
Feed
Models
Agent
Devs
API Dashboard
docs
GitHub
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization | Read Paper on Bytez
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization
4 months ago
·
arXiv