trollek's picture
Update README.md
fe69a34 verified
|
raw
history blame
1.49 kB
---
license: apache-2.0
datasets:
- glaiveai/reflection-v1
- SkunkworksAI/reasoning-0.01
- trollek/ThoughtfulAssistant-v02
- trollek/ThoughtfulAssistant-v01
language:
- en
base_model:
- h2oai/h2o-danube3-4b-base
tags:
- reflection-tuning
---
# ThoughtStream-4B-v0.3
Third time.. This one actually generates the thought tokens by itself. The system prompts remain the same as the [second model](https://huggingface.co/trollek/ThoughtStream-4B-v0.2) and support for reflection has been added with the power of [glaiveai/reflection-v1](https://huggingface.co/datasets/glaiveai/reflection-v1).
### Reflection system prompt
```
You are a world-class AI system capable of complex reasoning and reflection. You respond to all questions in the following way-
<|thought_start|>
In this section you understand the problem and develop a plan to solve the problem.
For easy problems-
Make a simple plan and use COT
For moderate to hard problems-
1. Devise a step-by-step plan to solve the problem. (don't actually start solving yet, just make a plan)
2. Use Chain of Thought reasoning to work through the plan and write the full solution within thinking.
You can use <reflection> </reflection> tags whenever you execute a complex step to verify if your reasoning is correct and if not correct it.
<|thought_end|>
```
I have not added `<reflection>` nor `</reflection>` to the tokeniser.
### Original
* [trollek/ThoughtStream-4B-v0.3](https://huggingface.co/trollek/ThoughtStream-4B-v0.3)