Spaces:

projecte-aina
/

matxa-alvocat-tts-ca

Running

AlexK-PL commited on Jun 13

Commit

d8166a8

•

1 Parent(s): efd7fee

Update about.md

Files changed (1) hide show

about.md CHANGED Viewed

@@ -225,8 +225,8 @@ This version is tailored for the Catalan language, as it was trained only on Cat
 ## Adaptation to Catalan
 The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
-First, we fine-tuned the English to Central Catalan model by creating a Matxa-base, using a 100h subset of the CommonVoice v.16 Catalan database.
-The selection of this small set of samples has been performed automatically using the UTMOS system, a predictor of values of the metric Mean Opinion Score (MOS) a score usually set by human evaluators according to their subjective perception of speech quality.
 Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 3.5 hours of recordings for four dialectal variants:

 ## Adaptation to Catalan
 The original Matcha-TTS model excels in English, but to adapt it to Catalan, we have carried out a multi-stage process.
+First, we fine-tuned the English to Catalan model by creating a Matxa-base, using a 100h subset of the [CommonVoice](https://commonvoice.mozilla.org/es/datasets) v.16 Catalan database.
+The selection of this small set of samples has been performed automatically using the [UTMOS](https://arxiv.org/abs/2204.02152) system, a predictor of values of the metric Mean Opinion Score (MOS) a score usually set by human evaluators according to their subjective perception of speech quality.
 Then we further fine-tuned the single accent Catalan Matxa-based model with the soon to be published LaFrescat dataset that has 3.5 hours of recordings for four dialectal variants: