ViCrit: a verifiable reinforcement learning proxy task for visual perception in VLMs

An RL proxy task that trains VLMs to localize synthetic hallucinations injected into human-written captions.


Latest publications