Update about.md
Browse files
about.md
CHANGED
@@ -225,8 +225,8 @@ This version is tailored for the Catalan language, as it was trained only on Cat
|
|
225 |
## Adaptation to Catalan
|
226 |
|
227 |
The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
|
228 |
-
First, we fine-tuned the English to
|
229 |
-
The selection of this small set of samples has been performed automatically using the UTMOS system, a predictor of values of the metric Mean Opinion Score (MOS) a score usually set by human evaluators according to their subjective perception of speech quality.
|
230 |
|
231 |
Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 3.5 hours of recordings for four dialectal variants:
|
232 |
|
|
|
225 |
## Adaptation to Catalan
|
226 |
|
227 |
The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
|
228 |
+
First, we fine-tuned the English to Catalan model by creating a Matxa-base, using a 100h subset of the [CommonVoice](https://commonvoice.mozilla.org/es/datasets) v.16 Catalan database.
|
229 |
+
The selection of this small set of samples has been performed automatically using the [UTMOS](https://arxiv.org/abs/2204.02152) system, a predictor of values of the metric Mean Opinion Score (MOS) a score usually set by human evaluators according to their subjective perception of speech quality.
|
230 |
|
231 |
Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 3.5 hours of recordings for four dialectal variants:
|
232 |
|