Which Attention Heads Matter for In-Context Learning? | Read Paper on Bytez