Voila-A: Aligning Vision-Language Models with User's Gaze Attention | Read Paper on Bytez

Devs

Voila-A: Aligning Vision-Language Models with User's Gaze Attention | Read Paper on Bytez