renyiyu
/

llama-2-7b-sft-lora

Model card Files Files and versions Community

Edit model card

Model Details

Supervised fine-tuning (sft) based on meta-llama/Llama-2-7b-hf on yahma/alpaca-cleaned
Trained with Deepspeed ZeRO-1 + TRL + QLoRA + Flash-Attntion 2 within 1h with 3090x4
The LoRa adapter is uploaded

Model and Training Details

Finetuned from model: meta-llama/Llama-2-7b-hf
Dataset: yahma/alpaca-cleaned

Preprocessing

preprocessed and packed the sft dataset with trl.trainer.ConstantLengthDataset

Results

Compute Infrastructure

The model is trained using 4 * RTX 3090 - 24GB

Model Card Authors

Yiyu (Michael) Ren

Model Card Contact

Email: [email protected]

Framework versions

PEFT 0.8.2

Downloads last month: 3

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for renyiyu/llama-2-7b-sft-lora

Base model

meta-llama/Llama-2-7b-hf

Adapter

(1065)

this model