Optimal Regret Bounds via Low-Rank Structured Variation in Non-Stationary Reinforcement Learning | Read Paper on Bytez