digitous
/

GPT-R

@@ -2,7 +2,8 @@ GPT-R [Ronin]
 This is an experimental model containing a parameter-wise 60/40 blend (weighted average) of the weights of ppo_hh_gpt-j and GPT-JT-6B-v1.
--Intended Merge Value -
 As with fine-tuning, merging weights does not add information but transforms it, therefore it is important to consider trade-offs.
 GPT-Ronin combines ppo_hh_gpt-j and GPT-JT; both technical
 achievements are blended with the intent to elevate the strengths of
@@ -27,7 +28,7 @@ by instruct.
 Merge tested using KoboldAI with Nucleus Sampling Top-P set to 0.7, Temperature at 0.5, and Repetition Penalty at 1.14; extra samplers
 disabled.
--Credits to-
 Core Model:
 https://huggingface.co/EleutherAI/gpt-j-6B

 This is an experimental model containing a parameter-wise 60/40 blend (weighted average) of the weights of ppo_hh_gpt-j and GPT-JT-6B-v1.
+-Intended Merge Value-
 As with fine-tuning, merging weights does not add information but transforms it, therefore it is important to consider trade-offs.
 GPT-Ronin combines ppo_hh_gpt-j and GPT-JT; both technical
 achievements are blended with the intent to elevate the strengths of
 Merge tested using KoboldAI with Nucleus Sampling Top-P set to 0.7, Temperature at 0.5, and Repetition Penalty at 1.14; extra samplers
 disabled.
+-Credits To-
 Core Model:
 https://huggingface.co/EleutherAI/gpt-j-6B