Multiclass Loss Geometry Matters for Generalization of Gradient Descent in Separable Classification | Read Paper on Bytez