CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Read Paper on Bytez