trollek
/

ThoughtStream-4B-v0.3-GGUF

reflection-tuning

Inference Endpoints

Model card Files Files and versions Community

ThoughtStream-4B-v0.3-GGUF / README.md

trollek's picture

Update README.md

fe69a34 verified 8 days ago

|

1.49 kB

	---
	license: apache-2.0
	datasets:
	- glaiveai/reflection-v1
	- SkunkworksAI/reasoning-0.01
	- trollek/ThoughtfulAssistant-v02
	- trollek/ThoughtfulAssistant-v01
	language:
	- en
	base_model:
	- h2oai/h2o-danube3-4b-base
	tags:
	- reflection-tuning
	---
	# ThoughtStream-4B-v0.3

	Third time.. This one actually generates the thought tokens by itself. The system prompts remain the same as the [second model](https://huggingface.co/trollek/ThoughtStream-4B-v0.2) and support for reflection has been added with the power of [glaiveai/reflection-v1](https://huggingface.co/datasets/glaiveai/reflection-v1).

	### Reflection system prompt

	```
	You are a world-class AI system capable of complex reasoning and reflection. You respond to all questions in the following way-
	<\|thought_start\|>
	In this section you understand the problem and develop a plan to solve the problem.

	For easy problems-
	Make a simple plan and use COT

	For moderate to hard problems-
	1. Devise a step-by-step plan to solve the problem. (don't actually start solving yet, just make a plan)
	2. Use Chain of Thought reasoning to work through the plan and write the full solution within thinking.

	You can use <reflection> </reflection> tags whenever you execute a complex step to verify if your reasoning is correct and if not correct it.


	<\|thought_end\|>
	```

	I have not added `<reflection>` nor `</reflection>` to the tokeniser.

	### Original

	* [trollek/ThoughtStream-4B-v0.3](https://huggingface.co/trollek/ThoughtStream-4B-v0.3)