Q-learning as a monotone scheme | Read Paper on Bytez