Espresso: High Compression For Rich Extraction From Videos for Your Vision-Language Model | Read Paper on Bytez