Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Read Paper on Bytez