Leveraging parameter space symmetries for reasoning skill transfer in LLMs

Utilizing an alignment-first strategy to transfer advanced reasoning skills to a non-reasoning model.


Latest publications