Alignment-Weighted DPO: A novel way to improve alignment in LLMs via reasoning.

A DPO that targets the most problematic parts of an output by assigning different preference weights.


Latest publications