FLORES-extensions Collection Partial translations of the FLORES(+) dataset and translations into non-textual modalities (speech, ASL). • 5 items • Updated about 14 hours ago
jasonrichdarmawan/nllb-primary-datasets-public-data-embedding Viewer • Updated Sep 24, 2025 • 10.7M • 68 • 1
OLDI and friends Collection This collection groups the datasets that have been featured as part of WMT’s Open Language Data Initiative shared task. • 5 items • Updated about 1 month ago • 5