When Maximum Entropy Misleads Policy Optimization | Read Paper on Bytez