Theoretical Understanding of Batch-normalization: A Markov Chain Perspective
2020·Arxiv