RainbowPO: A unified framework for combining improvements in Preference Optimization

This new framework enhances preference optimization for better AI alignment with human values.

ICLR
April 24, 2025

Latest publications