Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding | Read Paper on Bytez