Unbiased Evaluation of Large Language Models from a Causal Perspective | Read Paper on Bytez