Correct-by-synthesis reinforcement learning with temporal logic constraints | Read Paper on Bytez