On Sampling-Based Training Criteria for Neural Language Modeling | Read Paper on Bytez