Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning | Read Paper on Bytez