Why dataset tag?

by rombodawg - opened 9 days ago

9 days ago

Why does this model have the dataset tag:

datasets:
- arcee-ai/EvolKit-20k

Is this a finetuned model? If so why haven't you uploaded the base model? Nobody can finetune on top of this if its an instruct model.

AaronFeng753

9 days ago

https://github.com/arcee-ai/EvolKit

Daemontatox

9 days ago

So the 130 fine tunes of llama 3.1 8b instruct just dont exist ?

rombodawg

9 days ago

Im realizing the model card is very misleading, and this isnt a new distillation of llama-3.1-405b. Its just another finetune of llama-3.1-8b-instruct. Like every other model

Daemontatox

9 days ago

Is Distillation from 405b to 8b even possible?
(While keeping the model functioning and getting good results).
I think the dataset was created using a distalled 405b.

qnguyen3

Arcee AI org 9 days ago

•

edited 9 days ago

i believe it was first distilled from 405b using: https://github.com/arcee-ai/DistillKit
then EvolKit made the additional data to finetune after the distillation

Crystalcareai

Arcee AI org 8 days ago

There is nothing in the readme that in anyway gives the impression that this is somehow a new base model - I will, however, add a link to one of our announcement overviews.

Crystalcareai changed discussion status to closed 8 days ago

rombodawg

8 days ago

@Crystalcareai bruh "Llama-3.1-SuperNova-Lite is an 8B parameter model developed by Arcee.ai, based on the Llama-3.1-8B-Instruct architecture. It is a distilled version of the larger Llama-3.1-405B-Instruct model"

You literally say my model is distilled from llama-3.1-405b. Your wording is just very confusing. I see that you didnt meat to say it that way

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment