AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration | Read Paper on Bytez