Uncovering Temporal Context for Video Question and Answering | Read Paper on Bytez