Guided Policy Search as Approximate Mirror Descent | Read Paper on Bytez