On Training Bi-directional Neural Network Language Model with Noise Contrastive Estimation | Read Paper on Bytez