STAR: Efficient Preference-based Reinforcement Learning via Dual Regularization | Read Paper on Bytez