b
Discover
Search
About
VITR: Augmenting Vision Transformers with Relation-Focused Learning for Cross-Modal Information Retrieval
2023
·
arXiv