Attention Tree: Learning Hierarchies of Visual Features for Large-Scale Image Recognition | Read Paper on Bytez