InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training | Read Paper on Bytez