nihao

Browse files

Files changed (5) hide show

README.assets/clip_image002.gif +0 -0
README.assets/clip_image004.gif +0 -0
README.assets/clip_image006.gif +0 -0
README.assets/clip_image008.gif +0 -0
README.md +8 -8

README.assets/clip_image002.gif ADDED Viewed

README.assets/clip_image004.gif ADDED Viewed

README.assets/clip_image006.gif ADDED Viewed

README.assets/clip_image008.gif ADDED Viewed

README.md CHANGED Viewed

@@ -22,11 +22,11 @@ DPO训练：采用动态提示优化技术，进一步优化模型在特定任
 ## 安装与加载
-克隆本项目到本地：
 git clone
-cd llama-3.1-8b-it-ch-dpo
@@ -38,16 +38,16 @@ C-Eval 是一个全面的中文基础模型评估套件。它包含了大量的
 | C-Eval | Average | Average(hard) | STEM | Social Sciences | Humanities | Other |
 | ------ | ------- | ------------- | ---- | --------------- | ---------- | ----- |
-| 原模型 | 25.2    | 23.6          | 25   | 26.5            | 25.1       | 24.3  |
-| 训练后 | 44.0    | 32.5          | 41.6 | 51.9            | 41.1       | 44.0  |
 #### Cmmlu
 CMMLU是一个综合性的中文评估基准，专门用于评估语言模型在中文语境下的知识和推理能力。CMMLU涵盖了从基础学科到高级专业水平的67个主题。它包括：需要计算和推理的自然科学，需要知识的人文科学和社会科学,以及需要生活常识的中国驾驶规则等。
 | CMMLU  | Average | STEM  | Social Sciences | Humanities | Other |
 | ------ | ------- | ----- | --------------- | ---------- | ----- |
-| 原模型 | 24.99   | 26.04 | 24.84           | 25.23      | 24.05 |
-| 训练后 | 44.63   | 37.5  | 45.21           | 45.76      | 49.14 |
@@ -55,7 +55,7 @@ CMMLU是一个综合性的中文评估基准，专门用于评估语言模型在
 微调数据集：
-|                       |                                                              |
 | --------------------- | ------------------------------------------------------------ |
 | 中文微调数据集        | https://modelscope.cn/datasets/zhuangxialie/Llama3-Chinese-Dataset/files |
 | train_1M_CN           | https://huggingface.co/datasets/BelleGroup/train_1M_CN       |
@@ -86,6 +86,6 @@ Training loss:
 ![img](README.assets/clip_image006.gif)
-Training rewards:
 ![img](README.assets/clip_image008.gif)

 ## 安装与加载
+克隆本项目到本地：https://huggingface.co/jiangfb/llama-3.1-chinese-8b-it-dpo
 git clone
+cd llama-3.1-chinese-8b-it-dpo
 | C-Eval | Average | Average(hard) | STEM | Social Sciences | Humanities | Other |
 | ------ | ------- | ------------- | ---- | --------------- | ---------- | ----- |
+| 原模型 | 24.1    | 23.5          | 23.9 | 25.3            | 24.6       | 22.7  |
+| 训练后 | 44.7    | 32.9          | 41.8 | 52.7            | 42.0       | 44.5  |
 #### Cmmlu
 CMMLU是一个综合性的中文评估基准，专门用于评估语言模型在中文语境下的知识和推理能力。CMMLU涵盖了从基础学科到高级专业水平的67个主题。它包括：需要计算和推理的自然科学，需要知识的人文科学和社会科学,以及需要生活常识的中国驾驶规则等。
 | CMMLU  | Average | STEM  | Social Sciences | Humanities | Other |
 | ------ | ------- | ----- | --------------- | ---------- | ----- |
+| 原模型 | 25.3    | 26.04 | 25.19           | 25.79      | 25.26 |
+| 训练后 | 46.54   | 39.31 | 47.21           | 47.41      | 51.34 |
 微调数据集：
+|                         |                                  |
 | --------------------- | ------------------------------------------------------------ |
 | 中文微调数据集        | https://modelscope.cn/datasets/zhuangxialie/Llama3-Chinese-Dataset/files |
 | train_1M_CN           | https://huggingface.co/datasets/BelleGroup/train_1M_CN       |
 ![img](README.assets/clip_image006.gif)
+Training rewards:
 ![img](README.assets/clip_image008.gif)