Talking Heads: Understanding Inter-Layer Communication in Transformer Language Models | Read Paper on Bytez