Nero10578 commited on
Commit
5054926
1 Parent(s): 9cb65ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -1,3 +1,13 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - su
5
+ - en
6
+ - id
7
  ---
8
+ This is a fine tune of Mistral-7B-v0.1 on a very limited range of Sundanese language datasets that are available.
9
+ This is a learning project for me where I just wanted to see if it's possible to teach a model a new language that it does not inherently support with just a QLora fine tune. It won't only speak sundanese but it just adds sundanese capability to the model that is to me impressive for the limited data and short amount of training time.
10
+
11
+ Datasets used:
12
+ Sundanese sources from this repo. Cleaned and deduped myself.
13
+ https://github.com/w11wo/nlp-datasets