Theoretical Investigation of Adafactor for Non-Convex Smooth Optimization | Read Paper on Bytez