Interactive Cross-modal Learning for Text-3D Scene Retrieval | Read Paper on Bytez