Can Multi-Modal LLMs Provide Live Step-by-Step Task Guidance? | Read Paper on Bytez