What value do explicit high level concepts have in vision to language problems?
2015·Arxiv