bytez
Search
Feed
Models
Agent
Devs
API Dashboard
docs
GitHub
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models | Read Paper on Bytez
Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models
6 months ago
·
NeurIPS