Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
deepseek-ai
/
deepseek-moe-16b-base
like
83
Follow
DeepSeek
793
Text Generation
Transformers
Safetensors
deepseek
custom_code
arxiv:
2401.06066
License:
deepseek
Model card
Files
Files and versions
Community
7
Train
Use this model
main
deepseek-moe-16b-base
3 contributors
History:
9 commits
DeepSeekDDM
Update README.md
521d2bc
verified
10 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
README.md
Safe
2.19 kB
Update README.md
10 months ago
config.json
Safe
1.07 kB
initial commit
10 months ago
configuration_deepseek.py
Safe
10.2 kB
initial commit
10 months ago
generation_config.json
Safe
121 Bytes
initial commit
10 months ago
model-00001-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00002-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00003-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00004-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00005-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00006-of-00007.safetensors
Safe
5 GB
LFS
initial commit
10 months ago
model-00007-of-00007.safetensors
Safe
2.77 GB
LFS
initial commit
10 months ago
model.safetensors.index.json
Safe
490 kB
initial commit
10 months ago
modeling_deepseek.py
Safe
72.7 kB
initial commit
10 months ago
tokenizer.json
Safe
4.61 MB
initial commit
10 months ago
tokenizer_config.json
Safe
793 Bytes
initial commit
10 months ago