bytez
Search
Feed
Models
Agent
Devs
Plan
docs
HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization | Read Paper on Bytez