R3: robust rubric-agnostic reward models

A novel reward modeling framework that is rubric-agnostic, generalizable, and provides reasoned score assignments.


Latest publications