Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms | Read Paper on Bytez