GPT2 French base model (Uncased)
Prerequisites
transformers==4.19.2
Model architecture
This model uses GPT2 base setttings except vocabulary size.
Tokenizer
Using BPE tokenizer with vocabulary size 50,000.
Training Data
- wiki40b/fr (French Wikipedia)
- Subset of CC-100/fr : Monolingual Datasets from Web Crawl Data
Usage
from transformers import pipeline
generator = pipeline('text-generation', model='ClassCat/gpt2-base-french')
generator("Je vais à la", max_length=50, num_return_sequences=5)
- Downloads last month
- 444
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.