LLamaStory-70M / README.md

Update README.md

a48b7af 11 months ago

376 Bytes

metadata

license: mit
datasets:
  - qwedsacf/story-generation
language:
  - en

LLamaStory-70M is a LLama Model Pre-trained on a story-generation dataset

About Training:

this model will be used to Debug 4 and 8 bit training and inference in JAX and Rust with EasyDel