Revisiting the Hierarchical Multiscale LSTM | Read Paper on Bytez