Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DiscoResearch
/
Llama3-German-8B-32k
like
11
Text Generation
Transformers
Safetensors
German
llama
text-generation-inference
Inference Endpoints
arxiv:
2404.10830
License:
llama3
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
Experimental 128k context with rope scaling
#2
by
jphme
- opened
about 1 month ago
base:
refs/heads/main
←
from:
refs/pr/2
Discussion
Files changed
+8
-2
Experimental 128k context with rope scaling
74f021d4
jphme
Disco Research org
about 1 month ago
Changed Rope scaling config
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Ready to merge
This branch is ready to get merged automatically.
Comment
·
Sign up
or
log in
to comment