Minimax Optimal Regret Bound for Reinforcement Learning with Trajectory Feedback | Read Paper on Bytez