Smooth Gate Functions for Soft Advantage Policy Optimization | Read Paper on Bytez