Understanding the Differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks | Read Paper on Bytez