Language model scaling laws and zero-sum learning

Investigating the relationship between language model size, training dynamics and the phenomenon of zero-sum learning.

NeurIPS
December 10, 2024

Latest publications