Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM2-3B-300B
like
3
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2210.10340
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
main
TransNormerLLM2-3B-300B
1 contributor
History:
10 commits
OpenNLPLab
Upgrade to lightning att2
5ba41e2
verified
9 months ago
images
Upload lightning-leopard.jpg
10 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
Community License for TransNormerLLM Model.pdf
263 kB
Add license
11 months ago
README.md
9.89 kB
Update README.md
10 months ago
TransNormerLLM模型社区许可协议.pdf
294 kB
Add license
11 months ago
config.json
926 Bytes
Fix 3B config error
10 months ago
configuration_transnormer.py
2.27 kB
Publish 3B2-300B
11 months ago
generation_config.json
164 Bytes
Publish 3B2-300B
11 months ago
lightning_attention.py
15.3 kB
Publish 3B2-300B
11 months ago
lightning_attention2.py
15.3 kB
Upgrade to lightning att2
9 months ago
modeling_transnormer.py
34.6 kB
Upgrade to lightning att2
9 months ago
norm.py
1.27 kB
Publish 3B2-300B
11 months ago
pytorch_model-00001-of-00003.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
11 months ago
pytorch_model-00002-of-00003.bin
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.97 GB
LFS
Publish 3B2-300B
11 months ago
pytorch_model-00003-of-00003.bin
pickle
Detected Pickle imports (3)
"torch.BFloat16Storage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
1.88 GB
LFS
Publish 3B2-300B
11 months ago
pytorch_model.bin.index.json
13.8 kB
Publish 3B2-300B
11 months ago
special_tokens_map.json
410 Bytes
Publish 3B2-300B
11 months ago
srmsnorm_triton.py
5.76 kB
Publish 3B2-300B
11 months ago
tokenization_baichuan.py
9.57 kB
Publish 3B2-300B
11 months ago
tokenizer.model
1.14 MB
LFS
Publish 3B2-300B
11 months ago
tokenizer_config.json
819 Bytes
Publish 3B2-300B
11 months ago
utils.py
4.39 kB
Publish 3B2-300B
11 months ago