bookbot
/

distil-wav2vec2-xls-r-adult-child-cls-64m

@@ -1,68 +1,75 @@
 ---
 license: apache-2.0
 tags:
-- generated_from_trainer
 metrics:
-- accuracy
-- f1
 model-index:
-- name: distil-wav2vec2-xls-r-adult-child-cls
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# distil-wav2vec2-xls-r-adult-child-cls
-This model is a fine-tuned version of [w11wo/wav2vec2-xls-r-adult-child-cls](https://huggingface.co/w11wo/wav2vec2-xls-r-adult-child-cls) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.2571
-- Accuracy: 0.9386
-- F1: 0.9425
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
-- train_batch_size: 16
-- eval_batch_size: 16
-- seed: 42
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 64
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
-| 0.5509        | 1.0   | 191  | 0.3685          | 0.9086   | 0.9131 |
-| 0.4543        | 2.0   | 382  | 0.3113          | 0.9247   | 0.9285 |
-| 0.409         | 3.0   | 573  | 0.2723          | 0.9372   | 0.9418 |
-| 0.3024        | 4.0   | 764  | 0.2786          | 0.9381   | 0.9417 |
-| 0.3103        | 5.0   | 955  | 0.2571          | 0.9386   | 0.9425 |
-### Framework versions
 - Transformers 4.17.0.dev0
 - Pytorch 1.10.2+cu102
 - Datasets 1.18.3
-- Tokenizers 0.11.0

 ---
+language: en
 license: apache-2.0
 tags:
+  - audio-classification
+  - generated_from_trainer
 metrics:
+  - accuracy
+  - f1
 model-index:
+  - name: distil-wav2vec2-xls-r-adult-child-cls-64m
+    results: []
 ---
+# DistilWav2Vec2 XLS-R Adult/Child Speech Classifier 64M
+DistilWav2Vec2 XLS-R Adult/Child Speech Classifier is an audio classification model based on the [XLS-R](https://arxiv.org/abs/2111.09296) architecture. This model is a distilled version of [wav2vec2-xls-r-adult-child-cls](https://huggingface.co/bookbot/wav2vec2-xls-r-adult-child-cls) on a private adult/child speech classification dataset.
+This model was trained using HuggingFace's PyTorch framework. All training was done on a Tesla P100, provided by Kaggle. Training metrics were logged via Tensorboard.
+## Model
+| Model                                       | #params | Arch. | Training/Validation data (text)           |
+| ------------------------------------------- | ------- | ----- | ----------------------------------------- |
+| `distil-wav2vec2-xls-r-adult-child-cls-64m` | 64M     | XLS-R | Adult/Child Speech Classification Dataset |
+## Evaluation Results
+The model achieves the following results on evaluation:
+| Dataset                           | Loss   | Accuracy | F1     |
+| --------------------------------- | ------ | -------- | ------ |
+| Adult/Child Speech Classification | 0.2571 | 93.86%   | 0.9425 |
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- `learning_rate`: 3e-05
+- `train_batch_size`: 16
+- `eval_batch_size`: 16
+- `seed`: 42
+- `gradient_accumulation_steps`: 4
+- `total_train_batch_size`: 64
+- `optimizer`: Adam with `betas=(0.9,0.999)` and `epsilon=1e-08`
+- `lr_scheduler_type`: linear
+- `lr_scheduler_warmup_ratio`: 0.1
+- `num_epochs`: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |   F1   |
+| :-----------: | :---: | :--: | :-------------: | :------: | :----: |
+|    0.5509     |  1.0  | 191  |     0.3685      |  0.9086  | 0.9131 |
+|    0.4543     |  2.0  | 382  |     0.3113      |  0.9247  | 0.9285 |
+|     0.409     |  3.0  | 573  |     0.2723      |  0.9372  | 0.9418 |
+|    0.3024     |  4.0  | 764  |     0.2786      |  0.9381  | 0.9417 |
+|    0.3103     |  5.0  | 955  |     0.2571      |  0.9386  | 0.9425 |
+## Disclaimer
+Do consider the biases which came from pre-training datasets that may be carried over into the results of this model.
+## Authors
+DistilWav2Vec2 XLS-R Adult/Child Speech Classifier was trained and evaluated by [Ananto Joyoadikusumo](https://anantoj.github.io/). All computation and development are done on Kaggle.
+## Framework versions
 - Transformers 4.17.0.dev0
 - Pytorch 1.10.2+cu102
 - Datasets 1.18.3
+- Tokenizers 0.11.0