EzraWilliam
/

wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-large-xlsr-53
 datasets:
 - common_voice_13_0
 metrics:
@@ -11,8 +11,8 @@ model-index:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: id
     metrics:
-    - type: wer
-      value: 0.4416482300884956
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4428
-- Wer: 0.4416
 ## Model description
@@ -65,16 +65,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 2.9087        | 0.9   | 500  | 2.8298          | 1.0    |
-| 2.2394        | 1.8   | 1000 | 1.0606          | 0.8388 |
-| 1.1265        | 2.7   | 1500 | 0.6463          | 0.6179 |
-| 0.8905        | 3.6   | 2000 | 0.5702          | 0.5400 |
-| 0.7668        | 4.5   | 2500 | 0.5134          | 0.4991 |
-| 0.7048        | 5.4   | 3000 | 0.4763          | 0.4715 |
-| 0.667         | 6.29  | 3500 | 0.4657          | 0.4618 |
-| 0.6309        | 7.19  | 4000 | 0.4515          | 0.4506 |
-| 0.6002        | 8.09  | 4500 | 0.4407          | 0.4417 |
-| 0.6036        | 8.99  | 5000 | 0.4428          | 0.4416 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_13_0
 metrics:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
       split: test
       args: id
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.5390394542772862
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5309
+- Wer: 0.5390
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 4.6611        | 0.9   | 500  | 2.9516          | 1.0    |
+| 2.9146        | 1.8   | 1000 | 2.8772          | 1.0    |
+| 2.816         | 2.7   | 1500 | 2.4276          | 1.0    |
+| 1.9159        | 3.6   | 2000 | 1.0100          | 0.9116 |
+| 1.1756        | 4.5   | 2500 | 0.7206          | 0.7062 |
+| 0.9638        | 5.4   | 3000 | 0.6271          | 0.6327 |
+| 0.8657        | 6.29  | 3500 | 0.5767          | 0.5855 |
+| 0.7978        | 7.19  | 4000 | 0.5478          | 0.5578 |
+| 0.7513        | 8.09  | 4500 | 0.5329          | 0.5421 |
+| 0.7503        | 8.99  | 5000 | 0.5309          | 0.5390 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:63713071ab0dfe1c7568747db09ba5301fffdb597ac149f7c8ee9d2a4ad574cc
 size 1261991980

 version https://git-lfs.github.com/spec/v1
+oid sha256:087069421d11b0b24341bf7d8abd8b85d4b6713e2b61b9923613d172538156ac
 size 1261991980