What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models | Read Paper on Bytez