收集繁體中文在語言模型上存在多國語言翻譯的資料集,例如:中轉英、中轉越南等。繁體中文與東亞、東南亞關係密切,需考量未來延展性
Heng-Shiou Sheu | 許恆修
Heng666
AI & ML interests
Graph Neural Learning
Recent Activity
upvoted
an
article
20 days ago
upvoted
a
collection
about 1 month ago
Traditional Chinese LLM Corpus
Organizations
Collections
6
spaces
16
models
31
Heng666/gemma-2b-GGUF
Updated
•
2
Heng666/paligemma_construction_safety
Updated
•
2
Heng666/my_awesome_billsum_model
Updated
Heng666/madlad400-10b-mt-ct2-int8
Updated
•
4
Heng666/madlad400-7b-bt-mt-ct2-int8
Updated
Heng666/madlad400-7b-mt-ct2-int8
Translation
•
Updated
•
19
•
3
Heng666/madlad400-3b-mt-ct2
Translation
•
Updated
•
2
Heng666/madlad400-3b-mt-ct2-int8
Translation
•
Updated
•
30
Heng666/NeuralPipe-7B-slerp
Text Generation
•
Updated
•
11
Heng666/phi-2-GGUF
Updated
•
3
datasets
11
Heng666/dot_embedding
Viewer
•
Updated
•
152
•
37
Heng666/Taiwan-patent-corpus
Viewer
•
Updated
•
28
•
40
•
1
Heng666/Taiwan-patent-qa
Viewer
•
Updated
•
1.22k
•
222
•
3
Heng666/Taiwan-patent-qa-eval
Viewer
•
Updated
•
192
•
68
•
2
Heng666/OpenSubtitles-TW-Corpus
Viewer
•
Updated
•
7.22M
•
69
•
2
Heng666/Traditional_Chinese-aya_evaluation_suite
Viewer
•
Updated
•
650
•
59
•
3
Heng666/Traditional_Chinese-aya_dataset
Viewer
•
Updated
•
4.91k
•
143
•
3
Heng666/Traditional_Chinese-aya_collection
Viewer
•
Updated
•
2.02M
•
2.14k
•
5
Heng666/MultiCCAligned-TW-Corpus
Viewer
•
Updated
•
3.13M
•
96
•
3
Heng666/Taoyuan-Airport-MRT-MT-Challenge
Viewer
•
Updated
•
1.14k
•
66