An Alternative Softmax Operator for Reinforcement Learning | Read Paper on Bytez