Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer | Read Paper on Bytez