On transfer learning using a MAC model variant | Read Paper on Bytez