Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge | Read Paper on Bytez