b
Discover
Models
Search
About
Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exiting
4 weeks ago
·
NeurIPS