rinna
/

nekomata-14b

Text Generation

Model card Files Files and versions Community

tianyuz commited on Dec 21, 2023

Commit

4d79168

•

1 Parent(s): 6c7893a

Update README.md

Files changed (1) hide show

README.md +1 -15

README.md CHANGED Viewed

@@ -28,7 +28,7 @@ The name `nekomata` comes from the Japanese word [`猫又/ねこまた/Nekomata`
 * **Library**
-    The model was trained using code based on [EleutherAI/gpt-neox](https://github.com/EleutherAI/gpt-neox).
 * **Model architecture**
@@ -126,19 +126,5 @@ We compared the `Qwen` tokenizer (as used in `nekomata`) and the `llama-2` token
 ~~~
 ---
-# Citations
-~~~
-@software{gpt-neox-library,
-    title = {{GPT-NeoX: Large Scale Autoregressive Language Modeling in PyTorch}},
-    author = {Andonian, Alex and Anthony, Quentin and Biderman, Stella and Black, Sid and Gali, Preetham and Gao, Leo and Hallahan, Eric and Levy-Kramer, Josh and Leahy, Connor and Nestler, Lucas and Parker, Kip and Pieler, Michael and Purohit, Shivanshu and Songz, Tri and Phil, Wang and Weinbach, Samuel},
-    url = {https://www.github.com/eleutherai/gpt-neox},
-    doi = {10.5281/zenodo.5879544},
-    month = {8},
-    year = {2021},
-    version = {0.0.1},
-}
-~~~
----
 # License
 [Tongyi Qianwen LICENSE AGREEMENT](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)

 * **Library**
+    The model was trained using code based on [aws-neuron/neuronx-nemo-megatron](https://github.com/aws-neuron/neuronx-nemo-megatron/).
 * **Model architecture**
 ~~~
 ---
 # License
 [Tongyi Qianwen LICENSE AGREEMENT](https://github.com/QwenLM/Qwen/blob/main/Tongyi%20Qianwen%20LICENSE%20AGREEMENT)