chrisjay
/

afrospeech-wav2vec-run

Audio Classification

afro-digits-speech

Inference Endpoints

Model card Files Files and versions Community

chrisjay commited on Oct 10, 2022

Commit

4d194d2

•

1 Parent(s): a14186b

added updates

Files changed (1) hide show

README.md +14 -11

README.md CHANGED Viewed

@@ -25,15 +25,7 @@ model-index:
 # afrospeech-wav2vec-run
-This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa), which was a crowd-sourced dataset collected using the [afro-speech Space](https://huggingface.co/spaces/chrisjay/afro-speech). It achieves the following results on the [validation set](VALID_rundi_run_audio_data.csv):
-- F1: 0.8
-- Accuracy: 0.8
-The confusion matrix below helps to give a better look at the model's performance across the digits. Through it, we can see the precision and recall of the model as well as other important insights.
-![confusion matrix](afrospeech-wav2vec-run_confusion_matrix_VALID.png)
 ## Training and evaluation data
@@ -46,8 +38,19 @@ Below is a distribution of the dataset (training and valdation)
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-run.png)
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
@@ -56,7 +59,7 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - num_epochs: 150
-### Training results
 | Training Loss | Epoch |  Validation Accuracy |
 |:-------------:|:-----:|:--------:|

 # afrospeech-wav2vec-run
+This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa), which was a crowd-sourced dataset collected using the [afro-speech Space](https://huggingface.co/spaces/chrisjay/afro-speech).
 ## Training and evaluation data
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-run.png)
+## Evaluation performance
+It achieves the following results on the [validation set](VALID_rundi_run_audio_data.csv):
+- F1: 0.8
+- Accuracy: 0.8
+The confusion matrix below helps to give a better look at the model's performance across the digits. Through it, we can see the precision and recall of the model as well as other important insights.
+![confusion matrix](afrospeech-wav2vec-run_confusion_matrix_VALID.png)
+## Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - num_epochs: 150
+## Training results
 | Training Loss | Epoch |  Validation Accuracy |
 |:-------------:|:-----:|:--------:|