Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers
2018·Arxiv