liminerity
/

Mistral-quiet-star-demo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

gate369 commited on Mar 23

Commit

4756708

•

1 Parent(s): 33e0fc2

Update README.md

Files changed (1) hide show

README.md +9 -8

README.md CHANGED Viewed

@@ -15,14 +15,15 @@ datasets:
 - QSTAR
 -  I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
    be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
--Steps:   -Loss:
-- 356	     - 0.014300
-- 357	     - 0.012400
-- 358	     - 0.016800
-- 359	     - 0.022200
-- 360	     - 0.015000
-- 361	     - 0.018300
-- 362	     - 0.016000
 - 363	     - 0.019000
 - 364	     - 0.017600
 - 365	     - 0.015600

 - QSTAR
 -  I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
    be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
+- Steps:        -Loss:
+- 1	          - 1.373000
+- 2	          - 1.551400
+- 3	          - 1.083100
+- 4	          - 1.164900
+- 5 	      - 1.196500
+- 6	          - 1.015400
+-       ....
 - 363	     - 0.019000
 - 364	     - 0.017600
 - 365	     - 0.015600