by @RachidAR
Note Trained from scratch with TinyStories dataset. Report: https://api.wandb.ai/links/rachidar05/gl1grgwq -Oct 15 2023-