Context-Aware Integration of Language and Visual References for Natural Language Tracking | Read Paper on Bytez