A Real-Time Cross-modality Correlation Filtering Method for Referring Expression Comprehension
2019·Arxiv