Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning | Read Paper on Bytez