Update README.md
Browse files
README.md
CHANGED
@@ -15,14 +15,15 @@ datasets:
|
|
15 |
- QSTAR
|
16 |
- I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
|
17 |
be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
|
18 |
-
|
19 |
-
-
|
20 |
-
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
-
|
24 |
-
-
|
25 |
-
-
|
|
|
26 |
- 363 - 0.019000
|
27 |
- 364 - 0.017600
|
28 |
- 365 - 0.015600
|
|
|
15 |
- QSTAR
|
16 |
- I think this model is proof of my theory that you dont need a special architecture to train a llm to reason. The techniques i imployed to make this could
|
17 |
be greatly expanded on and coupled with a agent system to achieve a functioning low cost agi system.
|
18 |
+
|
19 |
+
- Steps: -Loss:
|
20 |
+
- 1 - 1.373000
|
21 |
+
- 2 - 1.551400
|
22 |
+
- 3 - 1.083100
|
23 |
+
- 4 - 1.164900
|
24 |
+
- 5 - 1.196500
|
25 |
+
- 6 - 1.015400
|
26 |
+
- ....
|
27 |
- 363 - 0.019000
|
28 |
- 364 - 0.017600
|
29 |
- 365 - 0.015600
|