b
Discover
Models
Search
About
On Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability
1 month ago
·
NeurIPS