asiansoul
/

Meta-Llama-3-11.5B-Instruct-GGUF

Text Generation

Model card Files Files and versions Community

asiansoul commited on Apr 26

Commit

fab2134

•

1 Parent(s): f79147a

Update README.md

Files changed (1) hide show

README.md +15 -14

README.md CHANGED Viewed

@@ -1,7 +1,8 @@
 ---
 language:
 - en
-- ko
 pipeline_tag: text-generation
 tags:
 - mergekit
@@ -188,10 +189,14 @@ extra_gated_fields:
 extra_gated_description: The information you provide will be collected, stored, processed and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
-# KoDolphin
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method
@@ -200,8 +205,7 @@ This model was merged using the passthrough merge method.
 ### Models Merged
 The following models were included in the merge:
-* [beomi/Llama-3-Open-Ko-8B-Instruct-preview](https://huggingface.co/beomi/Llama-3-Open-Ko-8B-Instruct-preview)
-* [cognitivecomputations/dolphin-2.9-llama3-8b](https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b)
 ### Configuration
@@ -210,14 +214,11 @@ The following YAML configuration was used to produce this model:
 ```yaml
 slices:
   - sources:
-      - model: beomi/Llama-3-Open-Ko-8B-Instruct-preview
-        layer_range: [0, 20]  # Use foundational and intermediate language processing layers in Korean
   - sources:
-      - model: cognitivecomputations/dolphin-2.9-llama3-8b
-        layer_range: [15, 24]  # Utilize advanced coding and domain-specific layers
-merge_method: passthrough  # Direct combination of layers without transformation
-dtype: float16  # Efficient resource usage
-```

 ---
+base_model:
+- Undi95/Meta-Llama-3-8B-Instruct-hf
 language:
 - en
 pipeline_tag: text-generation
 tags:
 - mergekit
 extra_gated_description: The information you provide will be collected, stored, processed and shared in accordance with the [Meta Privacy Policy](https://www.facebook.com/privacy/policy/).
 extra_gated_button_content: Submit
 ---
+# Meta-Llama-3-11.5B-Instruct
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+I had this idea at night that it would make sense to make a frankenmerge of Llama 3.. since we didn't get a 13B or 34B versions this time..
+Here's the same thing but for the base model: [mpasila/Meta-Llama-3-11.5B](https://huggingface.co/mpasila/Meta-Llama-3-11.5B/)
 ## Merge Details
 ### Merge Method
 ### Models Merged
 The following models were included in the merge:
+* [Undi95/Meta-Llama-3-8B-Instruct-hf](https://huggingface.co/Undi95/Meta-Llama-3-8B-Instruct-hf)
 ### Configuration
 ```yaml
 slices:
   - sources:
+    - model: Undi95/Meta-Llama-3-8B-Instruct-hf
+      layer_range: [0, 24]
   - sources:
+    - model: Undi95/Meta-Llama-3-8B-Instruct-hf
+      layer_range: [8, 32]
+merge_method: passthrough
+dtype: bfloat16
+```