MAttNet: Modular Attention Network for Referring Expression Comprehension | Read Paper on Bytez