Learning Spatially-Aware Language and Audio Embeddings | Read Paper on Bytez