Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
wxjiao
's Collections
Datasets-for-LLMs
Pretrain-Dataset-Korean
Pretrain-Dataset-Japanese
Pretrain-Dataset-Korean
updated
Jun 11
Upvote
-
heegyu/namuwiki-extracted
Viewer
•
Updated
Jan 15, 2023
•
565k
•
66
•
9
heegyu/kowikitext
Viewer
•
Updated
Oct 2, 2022
•
1.33M
•
23
•
5
maywell/korean_textbooks
Viewer
•
Updated
Jan 10
•
4.42M
•
996
•
96
heegyu/korean-petitions
Viewer
•
Updated
Jan 15, 2023
•
437k
•
87
•
5
hac541309/basic_korean_dict
Viewer
•
Updated
Jul 26, 2023
•
74.9k
•
22
•
4
lcw99/oscar-ko-only
Viewer
•
Updated
Oct 21, 2022
•
3.68M
•
3
•
3
lbox/lbox_open
Viewer
•
Updated
Nov 9, 2022
•
301k
•
224
•
12
lcw99/wikipedia-korean-20240501
Viewer
•
Updated
May 5
•
515k
•
104
•
12
lcw99/wikipedia-korean-20221001
Viewer
•
Updated
May 5
•
607k
•
51
•
5
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
11.1k
•
459
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
68k
•
533
Upvote
-
Share collection
View history
Collection guide
Browse collections