Transformer-Based Learned Optimization | Read Paper on Bytez