Sieve: Multimodal Dataset Pruning using Image Captioning Models | Read Paper on Bytez