Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information | Read Paper on Bytez