Mutual Information Estimation via $f$-Divergence and Data Derangements | Read Paper on Bytez