The Role of Sparsity for Length Generalization in LLMs | Read Paper on Bytez