Beyond Prompts: Dynamic Conversational Benchmarking of Large Language Models | Read Paper on Bytez