EgoSchema: A Diagnostic Benchmark for Very Long-form Video Language Understanding | Read Paper on Bytez