b

DiscoverModelsSearch
About
On the Convergence of Encoder-only Shallow Transformers
2023
·
NeurIPS