Attention Mechanism, Max-Affine Partition, and Universal Approximation | Read Paper on Bytez