Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM-385M
like
8
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2009.03300
License:
other
Model card
Files
Files and versions
Community
Train
Use this model
dbdaec7
TransNormerLLM-385M
1 contributor
History:
14 commits
OpenNLPLab
Update README.md
dbdaec7
11 months ago
.gitattributes
1.52 kB
initial commit
12 months ago
Community License for TransNormerLLM Model.pdf
263 kB
Upload Community License for TransNormerLLM Model.pdf
12 months ago
README.md
13.6 kB
Update README.md
11 months ago
TransNormerLLM模型社区许可协议.pdf
294 kB
Upload TransNormerLLM模型社区许可协议.pdf
12 months ago
config.json
1.03 kB
Publish 385M Model
12 months ago
configuration_transnormer.py
2.27 kB
Publish 385M Model
12 months ago
generation_config.json
110 Bytes
Publish 385M Model
12 months ago
lightning_attention.py
15.3 kB
Publish 385M Model
12 months ago
modeling_transnormer.py
40.3 kB
Publish 385M Model
12 months ago
norm.py
1.25 kB
Publish 385M Model
12 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
What is a pickle import?
798 MB
LFS
Publish 385M Model
12 months ago
special_tokens_map.json
410 Bytes
Publish 385M Model
12 months ago
srmsnorm_triton.py
5.75 kB
Publish 385M Model
12 months ago
tokenization_baichuan.py
9.82 kB
Publish 385M Model
12 months ago
tokenizer.model
1.14 MB
LFS
Publish 385M Model
12 months ago
tokenizer_config.json
819 Bytes
Publish 385M Model
12 months ago
utils.py
3.77 kB
Publish 385M Model
12 months ago