Optimizing reasoning efficiency through prompt difficult prediction

A routing approach that assigns each problem to the smallest model likely to solve it, reducing compute.

NeurIPS
December 2, 2025

Latest publications