Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning | Read Paper on Bytez