PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models | Read Paper on Bytez