b

DiscoverModelsSearch
About
Kangaroo: Lossless Self-Speculative Decoding for Accelerating LLMs via Double Early Exiting
4 weeks ago
·
NeurIPS