Multi-modal Egocentric Activity Recognition using Audio-Visual Features | Read Paper on Bytez