Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Reset Other
Autonomous
art
Synthetic
medical
code
biology
finance
legal
chemistry
agent
music
climate
Apply filters
Datasets
3,192
Full-text search
Edit filters
Sort: Trending
Active filters:
multilingual
Clear all
allenai/c4
Viewer
•
Updated
Jan 9, 2024
•
10.4B
•
795k
•
586
Cognitive-Lab/NayanaOCR_Corpus_2025
Viewer
•
Updated
9 days ago
•
1.01M
•
7.05k
•
13
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
4.17k
•
70
bigcode/the-stack-v2
Viewer
•
Updated
Apr 23, 2024
•
5.45B
•
17.3k
•
571
Helsinki-NLP/opus_books
Viewer
•
Updated
Mar 29, 2024
•
1.25M
•
15.2k
•
91
masakhane/mafand
Viewer
•
Updated
Sep 11, 2023
•
143k
•
4.76k
•
17
bigcode/starcoderdata
Viewer
•
Updated
May 16, 2023
•
207M
•
27.2k
•
516
openlanguagedata/flores_plus
Viewer
•
Updated
11 days ago
•
883k
•
17.1k
•
142
OpenLLM-France/Luciole-Training-Dataset
Updated
17 minutes ago
•
3.68k
•
2
ruggsea/infini-news-corpus
Viewer
•
Updated
20 days ago
•
776M
•
25.9k
•
5
tmquan/thuvienphapluat-vn-tnpl
Viewer
•
Updated
10 days ago
•
33.3k
•
936
•
4
bigcode/the-stack
Viewer
•
Updated
Apr 13, 2023
•
546M
•
18.5k
•
1.01k
code-search-net/code_search_net
Viewer
•
Updated
Feb 23
•
4.14M
•
22.9k
•
330
speechbrain/common_language
Updated
Jun 12, 2023
•
441
•
44
Helsinki-NLP/news_commentary
Viewer
•
Updated
Feb 29, 2024
•
4.23M
•
4.27k
•
39
google-research-datasets/tydiqa
Viewer
•
Updated
Aug 8, 2024
•
241k
•
4.11k
•
38
facebook/multilingual_librispeech
Viewer
•
Updated
Aug 12, 2024
•
1.49M
•
49.1k
•
180
codeparrot/github-code
Updated
Oct 20, 2022
•
24.3k
•
363
oscar-corpus/OSCAR-2201
Updated
Aug 6, 2025
•
54.3k
•
131
ontonotes/conll2012_ontonotesv5
Updated
Jan 18, 2024
•
926
•
45
google/fleurs
Viewer
•
Updated
19 days ago
•
768k
•
63.6k
•
406
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
339
•
69
Muennighoff/flores200
Updated
Jan 7, 2024
•
2.76k
•
22
bigcode/the-stack-smol
Viewer
•
Updated
May 2, 2023
•
300k
•
19.6k
•
79
miracl/miracl
Updated
Dec 29, 2024
•
2.28k
•
72
masakhane/masakhaner2
Viewer
•
Updated
Sep 11, 2023
•
153k
•
3.56k
•
13
oscar-corpus/OSCAR-2301
Updated
Aug 6, 2025
•
2.07k
•
180
masakhane/afriqa
Viewer
•
Updated
Jul 7, 2023
•
12.2k
•
323
•
10
CohereLabs/xP3x
Viewer
•
Updated
May 23, 2025
•
434M
•
262k
•
95
masakhane/afriqa-gold-passages
Updated
Sep 27, 2024
•
167
•
6
Previous
1
2
3
...
100
Next