SoTA with less: MCTS-guided sample selection for data-efficient visual reasoning self-improvement

Visual reasoning models that achieve SoTA performance using an order of magnitude fewer training samples.


Latest publications