bytez
Search
Feed
Models
Agent
Devs
API Dashboard
docs
GitHub
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache | Read Paper on Bytez
NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache
2 weeks ago
·
arXiv