b
Discover
Models
Search
About
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
5 months ago
·
CVPR