Datasets
updated
Viewer
• Updated • 183k • 1.1k
• 295
Viewer
• Updated • 2.94M • 32.9k
• 1.52k
Viewer
• Updated • 1.33k • 28.8k
• 458
Viewer
• Updated • 1M • 23.4k
• 818
databricks/databricks-dolly-15k
Viewer
• Updated • 15k • 33.8k
• 953
togethercomputer/RedPajama-Data-1T
Viewer
• Updated • 1.73M • 2.08k
• 1.15k
Viewer
• Updated • 201k • 107
• 32
Viewer
• Updated • 6.29k • 2.18k
• 6
Viewer
• Updated • 64.3k • 1.71k
• 12
Viewer
• Updated • 9.35M • 3.16k
• 10
Viewer
• Updated • 2.68M • 1.8k
• 4
Viewer
• Updated • 6.87k • 22k
• 4
Viewer
• Updated • 4.64M • 2.15k
• 17
Viewer
• Updated • 5.54M • 356
• 3
Viewer
• Updated • 5.33M • 565
• 15
Viewer
• Updated • 538k • 279
• 3
mteb/arxiv-clustering-s2s
Viewer
• Updated • 31 • 3.29k
• 1
Viewer
• Updated • 68.1k • 76
• 10
Viewer
• Updated • 21.4k • 40
• 1
mteb/amazon_reviews_multi
Viewer
• Updated • 2.52M • 2.34k
• 27
Viewer
• Updated • 19.9k • 3.37k
• 17
Updated • 1.39k
• 2
mteb/toxic_conversations_50k
Viewer
• Updated • 100k • 3.46k
• 19
mteb/tweet_sentiment_extraction
Viewer
• Updated • 30.2k • 4.53k
• 38
Viewer
• Updated • 5.34k • 52.8k
• 8
mteb/sts22-crosslingual-sts
Viewer
• Updated • 17.2k • 12.7k
• 12
Viewer
• Updated • 7.96k • 7.47k
• 2
mteb/stackoverflowdupquestions-reranking
Viewer
• Updated • 22.8k • 2.28k
• 3
reach-vb/jenny_tts_dataset
Viewer
• Updated • 21k • 275
• 34
ai4privacy/pii-masking-200k
Viewer
• Updated • 209k • 2.89k
• 120
ai4privacy/pii-masking-300k
Viewer
• Updated • 225k • 3.94k
• 84
bigcode/bigcode-pii-dataset-training
Viewer
• Updated • 11.9k • 16
• 11
TypicaAI/pii-masking-60k_fr
Viewer
• Updated • 61.9k • 37
• 2
davanstrien/code-prompt-similarity-model
Sentence Similarity
• 0.1B • Updated • 10
• 6
Viewer
• Updated • 2.34M • 693
• 160
Preview
• Updated • 1.35k
• 50
Image-Text-to-Text
• 9B • Updated • 4.42k
• 189