EzraWilliam
/

wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-large-xlsr-53
 datasets:
 - common_voice_13_0
 metrics:
@@ -11,8 +11,8 @@ model-index:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: id
     metrics:
-    - type: wer
-      value: 0.5518989675516224
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5399
-- Wer: 0.5519
 ## Model description
@@ -65,16 +65,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 4.7217        | 0.9   | 500  | 2.9517          | 1.0    |
-| 2.9149        | 1.8   | 1000 | 2.8778          | 1.0    |
-| 2.851         | 2.7   | 1500 | 2.6437          | 1.0    |
-| 2.0653        | 3.6   | 2000 | 1.0367          | 0.8727 |
-| 1.1893        | 4.5   | 2500 | 0.7226          | 0.7006 |
-| 0.9685        | 5.4   | 3000 | 0.6301          | 0.6358 |
-| 0.8742        | 6.29  | 3500 | 0.5778          | 0.5890 |
-| 0.8076        | 7.19  | 4000 | 0.5576          | 0.5696 |
-| 0.7624        | 8.09  | 4500 | 0.5412          | 0.5525 |
-| 0.7604        | 8.99  | 5000 | 0.5399          | 0.5519 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_13_0
 metrics:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod13
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
       split: test
       args: id
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.4928097345132743
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4972
+- Wer: 0.4928
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 4.7378        | 0.9   | 500  | 2.9498          | 1.0    |
+| 2.91          | 1.8   | 1000 | 2.8716          | 1.0    |
+| 2.683         | 2.7   | 1500 | 1.9348          | 1.0    |
+| 1.5179        | 3.6   | 2000 | 0.8042          | 0.6992 |
+| 1.014         | 4.5   | 2500 | 0.6370          | 0.5932 |
+| 0.87          | 5.4   | 3000 | 0.5648          | 0.5443 |
+| 0.795         | 6.29  | 3500 | 0.5328          | 0.5177 |
+| 0.742         | 7.19  | 4000 | 0.5148          | 0.5016 |
+| 0.701         | 8.09  | 4500 | 0.4969          | 0.4943 |
+| 0.7002        | 8.99  | 5000 | 0.4972          | 0.4928 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e262a9d044347bd9218c883b7e8e57ebce504b7c87ac902849b45a5a2c147a60
 size 1261991980

 version https://git-lfs.github.com/spec/v1
+oid sha256:286120cf983b26af6d7afa25b73a519cdbf77884a9e4ba09f263f8bb67afb0cb
 size 1261991980