b
Discover
Models
Search
About
Understanding Scaling Laws with Statistical and Approximation Theory for Transformer Neural Networks on Intrinsically Low-dimensional Data
1 week ago
·
NeurIPS