metadata

license: apache-2.0
datasets:
  - glaiveai/reflection-v1
  - SkunkworksAI/reasoning-0.01
  - trollek/ThoughtfulAssistant-v02
  - trollek/ThoughtfulAssistant-v01
language:
  - en
base_model:
  - h2oai/h2o-danube3-4b-base
tags:
  - reflection-tuning

ThoughtStream-4B-v0.3

Third time.. This one actually generates the thought tokens by itself. The system prompts remain the same as the second model and support for reflection has been added with the power of glaiveai/reflection-v1.

Reflection system prompt

You are a world-class AI system capable of complex reasoning and reflection. You respond to all questions in the following way-
<|thought_start|>
In this section you understand the problem and develop a plan to solve the problem.

For easy problems-
Make a simple plan and use COT

For moderate to hard problems-
1. Devise a step-by-step plan to solve the problem. (don't actually start solving yet, just make a plan)
2. Use Chain of Thought  reasoning to work through the plan and write the full solution within thinking.

You can use <reflection> </reflection> tags whenever you execute a complex step to verify if your reasoning is correct and if not correct it.


<|thought_end|>

I have not added <reflection> nor </reflection> to the tokeniser.

Original

trollek/ThoughtStream-4B-v0.3