Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features | Read Paper on Bytez