Apply SFT and DPO to TinyLlama 1.1B
-
SebastianSchramm/tinyllama-1.1B-intermediate-step-715k-1.5T-sft-lora
Text Generation • Updated • 10 -
SebastianSchramm/tinyllama-1.1B-intermediate-step-715k-1.5T-sft-lora-merged
Text Generation • Updated • 3 -
SebastianSchramm/tinyllama-1.1B-intermediate-step-715k-1.5T-dpo-lora
Text Generation • Updated • 3 -
SebastianSchramm/tinyllama-1.1B-intermediate-step-715k-1.5T-dpo-lora-merged
Text Generation • Updated • 1.32k • 1