---
base_model: []
library_name: transformers
license: other
license_name: llama-3
license_link: https://llama.meta.com/llama3/license/
thumbnail: "https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png"
---
This is the first version of upscaling llama-3. Version 2 is now out and does not have any of the issues that this version has. Please use version 2 instead. Linked bellow:

- https://huggingface.co/Replete-AI/Llama-3-11.5B-v2
__________________________________________________________________
Llama-3-13B

Thank you to Meta for the weights for Meta-Llama-3-8B 

![image/png](https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png)

This is an upscaling of the Llama-3-8B Ai using techniques created for Mistral-Evolved-11b-v0.1. This Ai model has been upscaled from 8b parameters to 13b parameters without any continuous pretraining or fine-tuning.

From testing, the model seems to function perfectly at fp16, but has some issues at 4-bit quantization using bitsandbytes.

The model that was used to create this one is linked below:

https://huggingface.co/meta-llama/Meta-Llama-3-8B

- Llama-3-13B

|             Metric              |Value|
|---------------------------------|----:|
|Avg.                             |54.61|
|AI2 Reasoning Challenge (25-Shot)|52.99|
|HellaSwag (10-Shot)              |80.66|
|MMLU (5-Shot)                    |62.12|
|TruthfulQA (0-shot)              |39.28|
|Winogrande (5-shot)              |70.72|
|GSM8k (5-shot)                   |21.91|


- Original Meta-Llama-3-8B
  
|             Metric              |Value|
|---------------------------------|----:|
|Avg.                             |62.87|
|AI2 Reasoning Challenge (25-Shot)|59.47|
|HellaSwag (10-Shot)              |82.09|
|MMLU (5-Shot)                    |66.69|
|TruthfulQA (0-shot)              |43.90|
|Winogrande (5-shot)              |77.35|
|GSM8k (5-shot)                   |45.34|