Adaptive Preference Scaling for Reinforcement Learning with Human Feedback | Read Paper on Bytez