EpistemeAI
/

Fireball-Alpaca-Llama3.1.08-8B-Philos-C-R1-KTO-beta

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

legolasyiu commited on Sep 16

Commit

a6da892

•

1 Parent(s): 2257f37

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -103,8 +103,10 @@ The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a
 Where to send questions or comments about the model Instructions on how to provide feedback or comments on the model can be found in the model [README](https://github.com/meta-llama/llama3). For more technical information about generation parameters and recipes for how to use Llama 3.1 in applications, please go [here](https://github.com/meta-llama/llama-recipes).
 ## Training
-**SFT Supervised Fine tuning**:
-Experimental: Supervised fine tuning with chain of thought, philsophy and PHD level dataset.
 # Intended Use
 **Intended Use Cases** Llama 3.1 is intended for commercial and research use in multiple languages. Instruction tuned text only models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks. The Llama 3.1 model collection also supports the ability to leverage the outputs of its models to improve other models including synthetic data generation and distillation. The Llama 3.1 Community License allows for these use cases.

 Where to send questions or comments about the model Instructions on how to provide feedback or comments on the model can be found in the model [README](https://github.com/meta-llama/llama3). For more technical information about generation parameters and recipes for how to use Llama 3.1 in applications, please go [here](https://github.com/meta-llama/llama-recipes).
 ## Training
+**KTO Fine tuning**:
+Experimental: KTO fine tuning
+KTO - Kahneman-Tversky Optimization (KTO) that makes it easier and cheaper than ever before to align LLMs on your data without compromising performance
 # Intended Use
 **Intended Use Cases** Llama 3.1 is intended for commercial and research use in multiple languages. Instruction tuned text only models are intended for assistant-like chat, whereas pretrained models can be adapted for a variety of natural language generation tasks. The Llama 3.1 model collection also supports the ability to leverage the outputs of its models to improve other models including synthetic data generation and distillation. The Llama 3.1 Community License allows for these use cases.