video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model | Read Paper on Bytez