C-3DPO: Constrained Controlled Classification for Direct Preference Optimization | Read Paper on Bytez