Benign Overfitting in Token Selection of Attention Mechanism | Read Paper on Bytez