bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Thinking in Text and Images: Interleaved Vision--Language Reasoning Traces for Long-Horizon Robot Manipulation | Read Paper on Bytez