EuroBERT: A Next-Generation Multilingual Encoder Model Family for Language Technology
EuroBERT, the newly developed multilingual encoder model family, marks a significant advancement in modern language technology. It enables more efficient processing of 15 European and global languages, handling sequences of up to 8,192 tokens. Officially introduced on 10 March 2025, the EuroBERT family was trained on a dataset of