anakin87
/

Llama-3-8b-ita-slerp

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

anakin87 commited on May 24

Commit

e5e5316

•

1 Parent(s): ef6ab66

Update README.md

Files changed (1) hide show

README.md +13 -1

README.md CHANGED Viewed

@@ -10,10 +10,22 @@ license: llama3
 language:
 - it
 ---
-# merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method

 language:
 - it
 ---
+# Llama-3-8b-ita-slerp
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+I tried to merge two of the best Italian LLMs using Mergekit. The results are acceptable, but I could not improve on the best existing model.
+## Evaluation
+For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
+Here's a breakdown of the performance metrics:
+| Metric                      | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
+|:----------------------------|:----------------------|:----------------|:---------------------|:--------|
+| **Accuracy Normalized**     | 0.6879               | 0.5714        | 0.5732              | 0.6109  |
 ## Merge Details
 ### Merge Method