tmp_trainer
This model is a fine-tuned version of facebook/opt-350m on the addressWithContext dataset.
Model description
Make sure to set max_new_tokens = 20; otherwise, the model will generate one token at a time.
nlp = pipeline("text-generation",
model="piazzola/tmp_trainer",
max_new_tokens=20)
nlp("I live at 15 Firstfield Road.")
Note that if you would like to try longer sentences using the Hosted inference API on the right hand side on this website, you might need to click "Compute" more than one time to get the address.
Intended uses & limitations
The model is intended to detect addresses that occur in a sentence.
Training and evaluation data
This model is trained on piazzola/addressWithContext
.
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3.0
Framework versions
- Transformers 4.34.0
- Pytorch 2.0.1+cu118
- Datasets 2.14.5
- Tokenizers 0.14.1
- Downloads last month
- 28
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for piazzola/address-detection-model
Base model
facebook/opt-350m