Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset | Read Paper on Bytez