Improving consistency in retrieval-augmented systems with group similarity reward

An RL approach that leverages multiple rollouts across paraphrased set to assign group similarity rewards.


Latest publications