Understanding and Minimising Outlier Features in Neural Network Training | Read Paper on Bytez