SonicVisionLM: Playing Sound with Vision Language Models | Read Paper on Bytez