Confidence penalty, annealing Gaussian noise and zoneout for biLSTM-CRF networks for named entity recognition | Read Paper on Bytez