SHRDFU-7b Γ
- Developed by: maldv
- License: cc-by-nc-4.0
- Finetuned from model: ammarali32/multi_verse_model
- Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon
As I work on understanding how to layer information in to the model, this dataset has some good parts and bad. I think one or two more experiments and I move on.
- Downloads last month
- 13
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.