speech to text
Collection
Speech to text models
•
8 items
•
Updated
This model is a fine-tuned version of openai/whisper-base on the mozilla-foundation/common_voice_16_0 hi dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.553 | 0.1 | 100 | 0.6445 | 39.4988 |
0.3683 | 1.08 | 200 | 0.5342 | 33.0660 |
0.2855 | 2.07 | 300 | 0.4983 | 31.4251 |
0.2233 | 3.06 | 400 | 0.4868 | 30.1547 |
0.1832 | 4.04 | 500 | 0.4783 | 28.9540 |
0.1431 | 5.03 | 600 | 0.4902 | 29.1828 |
0.0972 | 6.01 | 700 | 0.5049 | 28.6380 |
0.0715 | 6.11 | 800 | 0.5205 | 28.5029 |
0.0579 | 7.09 | 900 | 0.5366 | 28.9475 |
0.0519 | 8.08 | 1000 | 0.5381 | 28.7949 |
Base model
openai/whisper-base