Video captioning with recurrent networks based on frame- and video-level features and visual content classification | Read Paper on Bytez