Low-Precision Reinforcement Learning: Running Soft Actor-Critic in Half Precision | Read Paper on Bytez