On the Surprising Effectiveness of Attention Transfer for Vision Transformers | Read Paper on Bytez