Scaling Up Active Testing to Large Language Models | Read Paper on Bytez