Learning Video Representations From Large Language Models | Read Paper on Bytez