XLM-Roberta-base --> 8emotions!
Label Dictionry
- label_dictionary
- emo2int = { "κΈ°μ¨": 0, "λΉν©": 1, "λΆλ Έ": 2, "λΆμ": 3, "μμ²": 4, "μ¬ν": 5, "μ€λ¦½": 6 }
- kore2en = { "κΈ°μ¨": "joy", "λΉν©": "surprise", "λΆλ Έ": "anger", "λΆμ": "fear", "μμ²": "hurt", "μ¬ν": "sadness", "μ€λ¦½": "neutral" }
Dataset
κ°μ±λνλ§λμΉ(AI Hub)
- input format(recommendation) - this model is trained by ChatBOT dataset.
- ref: https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=86
νκ΅μ΄ κ°μ μ λ³΄κ° ν¬ν¨λ μ°μμ λν λ°μ΄ν°μ (AIHub)
And.. this dataset doesn't have neutral class..
So additional dataset is used.
ref: https://aihub.or.kr/aihubdata/data/view.do?dataSetSn=271
finally I Concatenate 2 Datasets.
Input Format(Please Use Special Tokens [USR], [BOT] to use model API!)
(example) [USR] μλ . [BOT] μλ νμΈμ! 무μμ λμλ릴κΉμ? [USR] λ³μΌ μμ΄.
μ΄ λκ°μ νΉμ ν ν°μ λ°λμ μ¬μ©ν΄μ£ΌμκΈΈ λΆνλ립λλ€.
Metrics(F1, Accuracy, and Confusion Matrix!)
- Downloads last month
- 5
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.