Add links for GGUF and ONNX versions
Browse files
README.md
CHANGED
@@ -25,6 +25,16 @@ datasets:
|
|
25 |
- Felladrin/ChatML-oasst2_curated
|
26 |
- CohereForAI/aya_dataset
|
27 |
- Felladrin/ChatML-aya_dataset
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
widget:
|
29 |
- text: |-
|
30 |
<|im_start|>system
|
@@ -70,10 +80,10 @@ inference:
|
|
70 |
repetition_penalty: 1.176
|
71 |
---
|
72 |
|
73 |
-
# Minueza-32M-Chat: A
|
74 |
|
75 |
- Base model: [Felladrin/Minueza-32M-Base](https://huggingface.co/Felladrin/Minueza-32M-Base)
|
76 |
-
- Datasets:
|
77 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-databricks-dolly-15k)] [databricks/databricks-dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
|
78 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-reddit-instruct-curated)] [euclaise/reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated)
|
79 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-WebGLM-QA)] [THUDM/webglm-qa](https://huggingface.co/datasets/THUDM/webglm-qa)
|
@@ -84,6 +94,15 @@ inference:
|
|
84 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-HelpSteer)] [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
|
85 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_curated)] [sablo/oasst2_curated](https://huggingface.co/datasets/sablo/oasst2_curated)
|
86 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-aya_dataset)] [CohereForAI/aya_dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
87 |
|
88 |
## Recommended Prompt Format
|
89 |
|
|
|
25 |
- Felladrin/ChatML-oasst2_curated
|
26 |
- CohereForAI/aya_dataset
|
27 |
- Felladrin/ChatML-aya_dataset
|
28 |
+
- argilla/distilabel-capybara-dpo-7k-binarized
|
29 |
+
- Felladrin/ChatML-distilabel-capybara-dpo-7k-binarized
|
30 |
+
- argilla/distilabel-intel-orca-dpo-pairs
|
31 |
+
- Felladrin/ChatML-distilabel-intel-orca-dpo-pairs
|
32 |
+
- argilla/ultrafeedback-binarized-preferences
|
33 |
+
- Felladrin/ChatML-ultrafeedback-binarized-preferences
|
34 |
+
- sablo/oasst2_dpo_pairs_en
|
35 |
+
- Felladrin/ChatML-oasst2_dpo_pairs_en
|
36 |
+
- NeuralNovel/Neural-DPO
|
37 |
+
- Felladrin/ChatML-Neural-DPO
|
38 |
widget:
|
39 |
- text: |-
|
40 |
<|im_start|>system
|
|
|
80 |
repetition_penalty: 1.176
|
81 |
---
|
82 |
|
83 |
+
# Minueza-32M-Chat: A chat model with 32 million parameters
|
84 |
|
85 |
- Base model: [Felladrin/Minueza-32M-Base](https://huggingface.co/Felladrin/Minueza-32M-Base)
|
86 |
+
- Datasets used during SFT:
|
87 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-databricks-dolly-15k)] [databricks/databricks-dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
|
88 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-reddit-instruct-curated)] [euclaise/reddit-instruct-curated](https://huggingface.co/datasets/euclaise/reddit-instruct-curated)
|
89 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-WebGLM-QA)] [THUDM/webglm-qa](https://huggingface.co/datasets/THUDM/webglm-qa)
|
|
|
94 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-HelpSteer)] [nvidia/HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer)
|
95 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_curated)] [sablo/oasst2_curated](https://huggingface.co/datasets/sablo/oasst2_curated)
|
96 |
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-aya_dataset)] [CohereForAI/aya_dataset](https://huggingface.co/datasets/CohereForAI/aya_dataset)
|
97 |
+
- Datasets used during DPO:
|
98 |
+
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-distilabel-capybara-dpo-7k-binarized)] [argilla/distilabel-capybara-dpo-7k-binarized](https://huggingface.co/datasets/argilla/distilabel-capybara-dpo-7k-binarized)
|
99 |
+
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-distilabel-intel-orca-dpo-pairs)] [argilla/distilabel-intel-orca-dpo-pairs](https://huggingface.co/datasets/argilla/distilabel-intel-orca-dpo-pairs)
|
100 |
+
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-ultrafeedback-binarized-preferences)] [argilla/ultrafeedback-binarized-preferences](https://huggingface.co/datasets/argilla/ultrafeedback-binarized-preferences)
|
101 |
+
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-oasst2_dpo_pairs_en)] [sablo/oasst2_dpo_pairs_en](https://huggingface.co/datasets/sablo/oasst2_dpo_pairs_en)
|
102 |
+
- [[ChatML](https://huggingface.co/datasets/Felladrin/ChatML-Neural-DPO)] [NeuralNovel/Neural-DPO](https://huggingface.co/datasets/NeuralNovel/Neural-DPO)
|
103 |
+
- Availability in other ML formats:
|
104 |
+
- GGUF: [Felladrin/gguf-Minueza-32M-Chat](https://huggingface.co/Felladrin/gguf-Minueza-32M-Chat)
|
105 |
+
- ONNX: [Felladrin/onnx-Minueza-32M-Chat](https://huggingface.co/Felladrin/onnx-Minueza-32M-Chat)
|
106 |
|
107 |
## Recommended Prompt Format
|
108 |
|