Improving Sound Source Localization with Joint Slot Attention on Image and Audio | Read Paper on Bytez