Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Dataset-Tools 's Collections
Dataset transformation, preparation and edition
Models for dataset curation
Dataset Exploration
Synthetic Dataset Creation
Dataset Creation

Models for dataset curation

updated Dec 5, 2024
Upvote
17

  • HuggingFaceFW/fineweb-edu-classifier

    Text Classification • 0.1B • Updated Nov 17, 2024 • 59.1k • • 214

    Note Classify texts based on their educational quality


  • minishlab/potion-base-8M

    Updated Mar 27 • 728k • 77

    Note A blazing-fast embedding generator


  • nvidia/domain-classifier

    Updated Sep 22, 2025 • 9.82k • 97

    Note A model to classify text according to different domains


  • nvidia/quality-classifier-deberta

    Updated Sep 22, 2025 • 3.18k • 75

    Note Classify texts based on their general quality


  • urchade/gliner_multi_pii-v1

    Token Classification • Updated Apr 20, 2024 • 60.1k • 169

    Note Identify and classify personal identifiable information PII


  • giacomoarienti/nsfw-classifier

    Image Classification • 85.8M • Updated Mar 26, 2025 • 48.1k • • 51

  • Falconsai/nsfw_image_detection

    Image Classification • Updated Apr 6, 2025 • 8.81M • • 1.07k

  • PleIAs/celadon

    Text Classification • 0.1B • Updated Jun 12, 2025 • 340 • 39
Upvote
17
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs