Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding | Read Paper on Bytez