freewheelin
/

free-evo-qwen72b-v0.8-re

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

free-evo-qwen72b-v0.8-re / README.md

freewheelin's picture

Update README.md

30c1e02 verified 6 months ago

|

953 Bytes

metadata

language:
  - ko
  - en
license: mit

Model Card for free-evo-qwen72b-v0.8

Developed by : Freewheelin AI Technical Team

1st place : 2024 4th May - avg. 81.28 Open Llm Leaderboard

but this kicked away. maybe the explanation was not enough.

Method

We were inspired by this Sakana project

Process

you need two models with the same architecture
1. choose one model and fine-tune a model to make a gap between the original one and fine-tuned one.
1. merge two of them
1. evaluate the merged model
1. fine-tune a specific evaluation part of the model
1. evaluate again
1. merge again
1. evaluate again
1. keep going until evaluate avg is higher then original one

that's it. simple.

Base Architecture

QWEN2

Base Models

several QWEN2 based models