MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs | Read Paper on Bytez