b
Discover
Models
Search
About
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
7 months ago
·
arXiv