Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks | Read Paper on Bytez