bytez
Search
Feed
Models
Agent
Devs
Plan
docs
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs | Read Paper on Bytez