Directed-Tokens: A Robust Multi-Modality Alignment Approach to Large Language-Vision Models | Read Paper on Bytez