Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks

Devs

Multi-teacher knowledge distillation as an effective method for compressing ensembles of neural networks | Read Paper on Bytez