Accelerated On-Device Forward Neural Network Training with Module-Wise Descending Asynchronism | Read Paper on Bytez