Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs | Read Paper on Bytez