Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers | Read Paper on Bytez