Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
aisingapore
/
sea-lion-3b
like
14
Text Generation
Transformers
Safetensors
11 languages
mpt
custom_code
text-generation-inference
Inference Endpoints
arxiv:
2101.09635
License:
mit
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
refs/pr/2
sea-lion-3b
4 contributors
History:
41 commits
RaymondAISG
Add citation for Thai dataset
18e8e3b
9 months ago
.gitattributes
1.52 kB
initial commit
11 months ago
LICENSE
1.06 kB
Update LICENSE
11 months ago
README.md
5.53 kB
Add citation for Thai dataset
9 months ago
adapt_tokenizer.py
1.72 kB
Add 3B model files
11 months ago
attention.py
21.6 kB
Add 3B model files
11 months ago
blocks.py
2.84 kB
Add 3B model files
11 months ago
config.json
1.27 kB
Add 3B model files
11 months ago
configuration_mpt.py
11 kB
Add 3B model files
11 months ago
custom_embedding.py
292 Bytes
Add 3B model files
11 months ago
fc.py
167 Bytes
Add 3B model files
11 months ago
ffn.py
1.75 kB
Add 3B model files
11 months ago
flash_attn_triton.py
28.2 kB
Add 3B model files
11 months ago
generation_config.json
91 Bytes
Add 3B model files
11 months ago
hf_prefixlm_converter.py
11.4 kB
Update codes to be in line with LLM-foundry update on October 30, 2023
10 months ago
meta_init_context.py
3.96 kB
Add 3B model files
11 months ago
model.safetensors
6.36 GB
LFS
Add 3B model files
11 months ago
modeling_mpt.py
24.2 kB
Add 3B model files
11 months ago
norm.py
3.12 kB
Add 3B model files
11 months ago
param_init_fns.py
11.9 kB
Add 3B model files
11 months ago
special_tokens_map.json
59 Bytes
Add 3B model files
11 months ago
tokenization_SEA_BPE.py
7.8 kB
Add 3B model files
11 months ago
tokenizer.model
4.57 MB
LFS
Add 3B model files
11 months ago
tokenizer_config.json
795 Bytes
Add 3B model files
11 months ago