Is vicuna1.5 tuned from Llama-2 with or without reinforcement learning?

by zhiyuanyou - opened Oct 28, 2023

Discussion

zhiyuanyou

Oct 28, 2023

Llama-2 provides two visions, with / without reinforcement learning, i.e., with / without "-chat".

I wonder vicuna1.5 is tuned from Llama-2 with or without reinforcement learning?

martincpt

Nov 15, 2023

This is something what I was also wondering. Lmsys does not explicitly specify this information in their documentation.

Vicuna uses a specific template to inference and Llama-2-chat's format is differs from that. So I suppose they trained Vicuna upon the Llama2 base model.

lmzheng

Large Model Systems Organization org Nov 27, 2023

We finetune from the base.

lmzheng changed discussion status to closed Nov 27, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment