Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception | Read Paper on Bytez