Convergence of Policy Mirror Descent Beyond Compatible Function Approximation | Read Paper on Bytez