On the Power of Decision Trees in Auto-Regressive Language Modeling | Read Paper on Bytez