Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent | Read Paper on Bytez