More Grounded Image Captioning by Distilling Image-Text Matching Model | Read Paper on Bytez