Choose What You Need: Disentangled Representation Learning for Scene Text Recognition Removal and Editing | Read Paper on Bytez