Automatic Reward Shaping from Confounded Offline Data | Read Paper on Bytez