Transferring Linear Features Across Language Models With Model Stitching | Read Paper on Bytez