Rectifying Shortcut Behaviors in Preference-based Reward Learning | Read Paper on Bytez