Robust Reward Alignment via Hypothesis Space Batch Cutting | Read Paper on Bytez