Exclusively Penalized Q-learning for Offline Reinforcement Learning | Read Paper on Bytez