README.md · Replete-AI/Llama-3-13B at main

metadata

base_model: []
library_name: transformers
license: other
license_name: llama-3
license_link: https://llama.meta.com/llama3/license/
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/642cc1c253e76b4c2286c58e/aJJxKus1wP5N-euvHEUq7.png

This is the first version of upscaling llama-3. Version 2 is now out and does not have any of the issues that this version has. Please use version 2 instead. Linked bellow:

https://huggingface.co/Replete-AI/Llama-3-11.5B-v2

Llama-3-13B

Thank you to Meta for the weights for Meta-Llama-3-8B

This is an upscaling of the Llama-3-8B Ai using techniques created for Mistral-Evolved-11b-v0.1. This Ai model has been upscaled from 8b parameters to 13b parameters without any continuous pretraining or fine-tuning.

From testing, the model seems to function perfectly at fp16, but has some issues at 4-bit quantization using bitsandbytes.

The model that was used to create this one is linked below:

https://huggingface.co/meta-llama/Meta-Llama-3-8B

Llama-3-13B

Metric	Value
Avg.	54.61
AI2 Reasoning Challenge (25-Shot)	52.99
HellaSwag (10-Shot)	80.66
MMLU (5-Shot)	62.12
TruthfulQA (0-shot)	39.28
Winogrande (5-shot)	70.72
GSM8k (5-shot)	21.91

Original Meta-Llama-3-8B

Metric	Value
Avg.	62.87
AI2 Reasoning Challenge (25-Shot)	59.47
HellaSwag (10-Shot)	82.09
MMLU (5-Shot)	66.69
TruthfulQA (0-shot)	43.90
Winogrande (5-shot)	77.35
GSM8k (5-shot)	45.34