Delta-Vector
/

Odin-9B

Safetensors

gemma2

chat

Eval Results

Model card Files Files and versions Community

Delta-Vector commited on 10 days ago

Commit

c1262f1

•

1 Parent(s): a00daa6

Update README.md

Browse files

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -20,8 +20,10 @@ tags:
 - chat
 ---
-A earlier checkpoint of the [Magnum 9B V4], using the same configuration as [Tor-8B]() but on Gemma rather then Nemo-8B, A finetune made for creative writing and roleplay tasks, Finetuned ontop of the base Gemma2 9B model, I trained the model for 4 epochs, with the 4 epoch checkpoint becoming the V4 Magnum 9B and the 2 epoch checkpoint becoming my own personal release. This model aims to have good prose and writing while not as `Suggestive` as Magnum models usually are, along with keeping some of the intelligence that was nice to have with the Gemma2 family.
 # Quants
@@ -101,7 +103,7 @@ load_in_4bit: false
 strict: false
 datasets:
-  - path: anthracite-core/c2_logs_16k_llama_v1.1
     type: sharegpt
     conversation: chatml
   - path: NewEden/Claude-Instruct-5K
@@ -203,11 +205,11 @@ special_tokens:
 - [Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned](https://huggingface.co/datasets/Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned)
 - [anthracite-org/kalo_opus_misc_240827](https://huggingface.co/datasets/anthracite-org/kalo_opus_misc_240827)
 - [anthracite-org/kalo_misc_part2](https://huggingface.co/datasets/anthracite-org/kalo_misc_part2)
-- [anthracite-core/c2_logs_16k_llama_v1.1](https://huggingface.co/datasets/anthracite-core/c2_logs_16k_llama_v1.1)
 ## Training
-The training was done for 2 epochs. We used  8 x [H100s](https://www.nvidia.com/en-us/data-center/h100/) GPUs graciously provided by [Lucy Knada](https://huggingface.co/lucyknada) for the full-parameter fine-tuning of the model.
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)

 - chat
 ---
+![](https://huggingface.co/Delta-Vector/Odin-9B/resolve/main/FinalOdin9B.jpg)
+A earlier checkpoint of an unreleased (for now) model, using the same configuration as [Tor-8B]() but on Gemma rather then Nemo-8B, A finetune made for creative writing and roleplay tasks, Finetuned ontop of the base Gemma2 9B model, I trained the model for 4 epochs, with the 4 epoch checkpoint becoming the a future unreleased model and the 2 epoch checkpoint becoming my own personal release. This model aims to have good prose and writing while not as `Suggestive` as Magnum models usually are, along with keeping some of the intelligence that was nice to have with the Gemma2 family.
 # Quants
 strict: false
 datasets:
+  - path: [PRIVATE CLAUDE LOG FILTER]
     type: sharegpt
     conversation: chatml
   - path: NewEden/Claude-Instruct-5K
 - [Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned](https://huggingface.co/datasets/Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned)
 - [anthracite-org/kalo_opus_misc_240827](https://huggingface.co/datasets/anthracite-org/kalo_opus_misc_240827)
 - [anthracite-org/kalo_misc_part2](https://huggingface.co/datasets/anthracite-org/kalo_misc_part2)
+- [Private re-Filter of Claude Logs](https://google.com)
 ## Training
+The training was done for 4 epochs. We used  8 x [H100s](https://www.nvidia.com/en-us/data-center/h100/) GPUs graciously provided by [Lucy Knada](https://huggingface.co/lucyknada) for the full-parameter fine-tuning of the model.
 [<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)