adamo1139
/

Yi-1.5-34B-32K-rebased-1406

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Still active?

#1

by DazzlingXeno - opened about 1 month ago

about 1 month ago

Are you still working on this?

Owner about 1 month ago

I have some plans to finetune Yi 1.5 32B 32K on Magpie Ultra or something similar, with this or just 01's base model used as a base for the finetune.

about 1 month ago

Nice, I look forward to seeing it.

Owner 3 days ago

FYI I published this model here adamo1139/Yi-1.5-34B-32K-Magpie-Ultra-0611

3 days ago

Will have to give this a go tonight! Has it been tuned with your AEZAKMI dataset as well?

Owner 3 days ago

No, it's just Magpie Ultra. AEZAKMI dataset is in a bit of an existential crisis, I like v2 more then newer versions. I'll try to make a new version of the AEZAKMI that I'll like and then train Yi 34B 200K/Yi 1.5 34B / Yi 1.5 9B on it.

Owner 3 days ago

I'm open to ideas when it comes to the direction of the AEZAKMI dataset, I'm thinking about adding some distilled multi-turn conversations from Hermes 3 70B to it and some non-synthetic reddit&4chan data

3 days ago

•

edited 3 days ago

I think that would be cool and maybe some Gutenberg and LimaRP stuff?

https://huggingface.co/datasets/Dampfinchen/Creative_Writing_Multiturn

Owner 3 days ago

I'll look into it. I know LimaRP has some ERP samples with small kids so I am hesitant to use it anywhere, unless it would be filtered out properly.

3 days ago

•

edited 3 days ago

I had no idea! Jesus!

I remember using a model for creative writing once and it suggested I include some paedophilia in it. Never deleted a model so fast in my life!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment