Stories in the Eye: Contextual Visual Interactions for Efficient Video to Language Translation | Read Paper on Bytez