Loop Estimator for Discounted Values in Markov Reward Processes | Read Paper on Bytez