CommonLID: Re-evaluating state-of-the-art language identification performance on web data

A community-driven, human-annotated LID benchmark for the web domain, covering 109 languages.

ACL
July 2, 2026

Topics:


Latest publications