A multilingual dataset for NER covering 91 langauges and 25 scripts
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a dataset about 2 months ago
whoisjones/sudoku authored a paper about 2 months ago
Hierarchical Text Classification with LLM-Refined Taxonomies updated a dataset about 2 months ago
whoisjones/mazeOrganizations
models 12
whoisjones/otter-bi-mmbert
Token Classification • 0.5B • Updated • 685
whoisjones/otter-bi-rembert
Updated • 1
whoisjones/otter-ce-rembert
Updated
whoisjones/otter-ce-mmbert
Updated
whoisjones/finerweb-multilabel-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 6
whoisjones/finerweb-binary-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 6
whoisjones/finerweb-binary-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 2
whoisjones/finerweb-multilabel-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 2
whoisjones/finerweb-binary-classifier-mdeberta-gemma3
Text Classification • 0.3B • Updated • 2
whoisjones/finerweb-binary-classifier-mdeberta-4o
Text Classification • 0.3B • Updated • 1
datasets 28
whoisjones/sudoku
Viewer • Updated • 1.42M • 18
whoisjones/maze
Viewer • Updated • 9k • 13
whoisjones/multinerd
Viewer • Updated • 1.67M • 31
whoisjones/masakhaner
Viewer • Updated • 153k • 24 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 15
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 956 • 7
whoisjones/fiNERweb-x
Updated • 78
whoisjones/fiNERweb-x-multi
Updated • 337
whoisjones/fiNERweb-gemma-x-multi
Updated • 36
whoisjones/fiNERweb-4o-x-multi
Updated • 55