Chinese-Vicuna
/

Chinese-Vicuna-lora-7b-chatv1

Model card Files Files and versions Community

lu-vae commited on May 10, 2023

Commit

3108df2

•

1 Parent(s): c0faf78

Update README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -1,3 +1,32 @@
 ---
 license: gpl-3.0
 ---

 ---
 license: gpl-3.0
+datasets:
+- philschmid/sharegpt-raw
+language:
+- zh
+- en
 ---
+This is a Chinese instruction-tuning lora checkpoint based on llama-7B from [this repo's work](https://github.com/Facico/Chinese-Vicuna)
+We finetune it on the combination of [alpaca_chinese_dataset](https://github.com/hikariming/alpaca_chinese_dataset.git) and sharegpt-90k data.
+We finetune it for 3 epochs use a single 4090 with ctxlen=2048.
+You can use it like this:
+```python
+from transformers import LlamaForCausalLM
+from peft import PeftModel
+model = LlamaForCausalLM.from_pretrained(
+    "decapoda-research/llama-7b-hf",
+    load_in_8bit=True,
+    torch_dtype=torch.float16,
+    device_map="auto",
+)
+model = PeftModel.from_pretrained(
+    model,
+    "Chinese-Vicuna/Chinese-Vicuna-lora-7b-chatv1"
+    torch_dtype=torch.float16,
+    device_map={'': 0}
+)
+```