Learning Joint Representations of Videos and Sentences with Web Image Search

Devs

Learning Joint Representations of Videos and Sentences with Web Image Search | Read Paper on Bytez