A Simple Baseline for Unifying Understanding, Generation, and Editing via Vanilla Next-token Prediction | Read Paper on Bytez