VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval | Read Paper on Bytez