File size: 1,397 Bytes
dcdcb91
 
 
 
 
 
 
 
2bec01c
 
 
 
dcdcb91
 
 
 
 
b939b84
dcdcb91
 
 
b939b84
 
 
 
7ffdc56
b939b84
 
 
7cd5a6a
c435a90
ab823a6
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
---
license: apache-2.0
tags:
- merge
- mergekit
- lazymergekit
- NousResearch/Meta-Llama-3.1-8B-Instruct
- EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta
- nvidia/OpenMath2-Llama3.1-8B
base_model:
- NousResearch/Meta-Llama-3.1-8B-Instruct
- EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta
- nvidia/OpenMath2-Llama3.1-8B
---

# Llama-3.1-8B-Squareroot

This is a TIES merge that combines the performance of the following models:
* [NousResearch/Meta-Llama-3.1-8B-Instruct](https://huggingface.co/NousResearch/Meta-Llama-3.1-8B-Instruct)
* [EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta](https://huggingface.co/EpistemeAI/Fireball-Alpaca-Llama3.1.07-8B-Philos-Math-KTO-beta)
* [nvidia/OpenMath2-Llama3.1-8B](https://huggingface.co/nvidia/OpenMath2-Llama3.1-8B)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/6479f6dbed75e95d3e97bb4d/LpWI-ug9WZdpcrjBy44iw.png)


*Disclaimer: This one's a failed attempt. Working on a better version, so check back soon!*

# Benchmarks

The model ranks in the top 5 for MATH benchmarks but performs severely badly on others (which isn't quite what I was expecting). I’m hoping to improve its general abilities without losing its math skills. Qwen still has the top spot :(


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6479f6dbed75e95d3e97bb4d/IPC7gTS4wJPOXVm1nCqLV.png)