Bridging the Divide: Reconsidering Softmax and Linear Attention | Read Paper on Bytez