|
--- |
|
license: mit |
|
language: en |
|
tags: |
|
- LLM |
|
- XVERSE-13B-Chat |
|
--- |
|
## Model Card for lyraXVERSE |
|
|
|
## Speed |
|
|
|
* Evaluated at tokens/s |
|
* test on A100 40G |
|
* MEMOPT mode |
|
|
|
### XVERSE-13B-Chat |
|
|
|
## Docker Environment Recommendation |
|
|
|
- For Cuda 11.X: we recommend ```nvcr.io/nvidia/pytorch:22.12-py3``` |
|
- For Cuda 12.0: we recommend ```nvcr.io/nvidia/pytorch:23.02-py3``` |
|
|
|
```bash |
|
docker pull nvcr.io/nvidia/pytorch:23.02-py3 |
|
docker run --rm -it --gpus all -v ./:/lyraXVERSE nvcr.io/nvidia/pytorch:23.02-py3 |
|
|
|
pip install -r requirements.txt |
|
python demo.py |
|
``` |
|
|
|
## Uses |
|
|
|
```python |
|
from lyra_xverse import lyraXVERSE |
|
|
|
model_path = "./models/" |
|
tokenizer_path = "./models/" |
|
inference_dtype = 'fp16' |
|
prompt = "讲个故事:" |
|
memopt_mode = 1 |
|
max_output_length = 512 |
|
arch = "Ampere" # Ampere or Volta |
|
cuda_version = 12 # cuda version, we currently support 11 and 12 |
|
|
|
model = lyraXVERSE(model_path, |
|
tokenizer_path = tokenizer_path, |
|
dtype = inference_dtype, |
|
memopt_mode = memopt_mode, |
|
arch = arch, |
|
cuda_version = cuda_version) |
|
|
|
``` |
|
|
|
## Demo Outputs |
|
|
|
### XVERSE-13B-Chat |
|
#### input |
|
|
|
讲个故事: |
|
|
|
#### output |
|
|
|
有一天,一位年轻的画家来到了一个偏远的村庄。他以其超凡的绘画技巧,为村民画了一幅美丽的图画。图画里,村庄的周围是翠绿的森林,清澈的溪流在其中流淌,村民们正在劳作,孩子们在田野里嬉戏。村民们看着这幅画,都对这位画家赞不绝口。\n\n村庄的领袖看到了这幅画,他想:“这幅画将会让我们的村庄更加美丽,我们应该让村民们知道这幅画。”于是,他带着画家去村庄的各个角落,让每一个村民都看到了这幅画。\n\n画家看着村民们看画的眼神,他意识到了自己的价值。他意识到,他不仅仅是一个画家,他也是一个能让人们看见希望的人。他的画不仅仅是艺术品,它是连接人们与希望的一座桥梁。\n\n这个故事告诉我们,画家的价值不只是他们的绘画技巧,而是他们的画作带给人们的感动和希望。画家的价值并不在于他们的画有多么昂贵,有多么独特,而在于他们能用画作打开人们的心扉,让人们看见希望,看见生活的美好。 |
|
|
|
## TODO |
|
|
|
## Citation |
|
``` bibtex |
|
@Misc{lyraXVERSE2023, |
|
author = {Kangjian Wu, Zhengtao Wang, Yibo Lu, Haoxiong Su, Bin Wu}, |
|
title = {lyraXVERSE: Accelerating XVERSE-13B-Chat(fp16) to 3000+ tokens/s}, |
|
howpublished = {\url{https://huggingface.co/TMElyralab/lyraXVERSE}}, |
|
year = {2023} |
|
} |
|
``` |
|
|
|
## Report bug |
|
- start a discussion to report any bugs!--> https://huggingface.co/TMElyralab/lyraXVERSE |
|
- report bug with a `[bug]` mark in the title. |
|
|