On the Consistency of Video Large Language Models in Temporal Comprehension | Read Paper on Bytez