Getting the subtext without the text: Scalable multimodal sentiment classification from visual and acoustic modalities | Read Paper on Bytez