ezelikman
/

quietstar-8-ahead

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ezelikman commited on Mar 23

Commit

f86e33e

•

1 Parent(s): fdd9b3e

Create README.md

Files changed (1) hide show

README.md +6 -0

README.md ADDED Viewed

	@@ -0,0 +1,6 @@

+---
+datasets:
+- open-web-math/open-web-math
+---
+Mistral-7b with continued pretraining using Quiet-STaR (https://arxiv.org/abs/2403.09629) for generating 8 thought tokens before each output token.