Refusal in Language Models Is Mediated by a Single Direction | Read Paper on Bytez