b

DiscoverModelsSearch
About
The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit
2023
·
NeurIPS