Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs | Read Paper on Bytez