freewheelin's picture
Update README.md
140e766 verified
|
raw
history blame
No virus
1.09 kB
metadata
language:
  - ko
  - en
license: mit

Model Card for free-evo-qwen72b-v0.8

Developed by : Freewheelin AI Technical Team

1st place : 2024 4th May - avg. 81.28 Open Llm Leaderboard

but this kicked away. maybe the explanation was not enough.

Method

Process

  • you need two models with the same architecture
    1. choose one model and finetune the model to make a gap between the original one and fine-tuned one. it doesn't matter the evaluation score is higher or lower.
    1. merge two of them
    1. evaluate the merged model
    1. finetune a specific evaluation part if you need to increase score of the part of the model. (sure it's not gonna work like you think. but try it)
    1. merge again
    1. evaluate again
    1. keep going until evaluate avg is higher then original one

that's it. simple.

Base Architecture

  • QWEN2

Base Models

  • several QWEN2 based models