Meta-Learning Objectives for Preference Optimization | Read Paper on Bytez