Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation | Read Paper on Bytez