Compatible Value Gradients for Reinforcement Learning of Continuous Deep Policies | Read Paper on Bytez