Thank you for this interesting initiative.
The dataset is a combination of wiki, stories, arxiv, math and code.
Detailed documentation of the dataset would be very helpful.
· Sign up or log in to comment