RLPO: Residual Listwise Preference Optimization for Long-Context Review Ranking | Read Paper on Bytez