nothingiisreal
/

L3-8B-Celeste-v1

@@ -187,16 +187,19 @@ We think there is too much secrecy around what data is being used, and different
 ### The Good
 We found that increasing the amount of ranks from 64 to 256 has reduced repetition but also led to the language used resembling Claude more than the 64 rank version. No worries, it's still far enough from Claude.
-<br>It also led to increased coherency but reduced instruction following, likely because the model started diverging more away from L3 8B Instruct.
-<br>**The model is uncensored for RP. For Instruct it needs 2-3 words of prefill for the first message.**
 <br>We found that increasing the amount of data from 1K to 6.5K reduced repetition aswell.
-<br>**The prose is much better than other synthetic data generations. The model also demonstrates increased style copying abilities likely a result of the long data and varying writing styles found in WritingPrompts.**
-<br>**The model is exceptional at being creative in roleplaying**, knows different persona's and even a single character will change persona in different contexts, persona is tied to last few messages rather than system message or character card. **This is great as it often means the model can do impressive things without you needing to explicitly specify.**
 ### Improvements for Next Run
 Formatting can break sometimes.
-Repetition can become an issue with certain types of prompts. Removing system helps.
 ### Comments about training

 ### The Good
 We found that increasing the amount of ranks from 64 to 256 has reduced repetition but also led to the language used resembling Claude more than the 64 rank version. No worries, it's still far enough from Claude.
+<br>**Model follows "OOC:" prompts religiously. Exceptional!**
+<br>It also led to **increased coherency but reduced system prompt following (when not OOC)**, likely because the model started diverging more away from L3 8B Instruct.
 <br>We found that increasing the amount of data from 1K to 6.5K reduced repetition aswell.
+<br>The model is uncensored for RP. For Instruct it needs 2-3 words of prefill for the first message.
+<br>The **prose is much better** and **the style range is huge** than other synthetic data generations. The model also demonstrates increased **style copying abilities** (from fewshot) likely a result of human longform data and varying writing styles found in WritingPrompts.
+<br>The model is **exceptional at being creative in roleplaying**, knows different persona's and even a single character will change persona in different contexts, persona is tied to last few messages rather than system message or character card. **This is great as it often means the model can do impressive things without you needing to explicitly specify.**
 ### Improvements for Next Run
 Formatting can break sometimes.
+<br>Repetition can become an issue with certain types of prompts. Removing system helps.
+<br>In some contexts the model is "all over the place" and doesn't stick to a coherent narrative. I need to study this further as its a complex trait which manifests in different quantities and can be good or bad depending on what the user wants to get out of the model.
 ### Comments about training