Staleness-aware Async-SGD for Distributed Deep Learning | Read Paper on Bytez