RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals | Read Paper on Bytez