Pendrokar
/

xvapitch

speech-to-speech

voice conversion

Model card Files Files and versions Community

Pendrokar commited on Aug 31

Commit

20c3e6b

•

1 Parent(s): 460e46a

repo link; papers

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -37,6 +37,8 @@ tags:
 pipeline_tag: text-to-speech
 ---
 The base model for training other xVASynth's "xVAPitch" type models (v3). Model itself used by the xVATrainer TTS model training app. All created by Dan Ruta.
 When used in xVASynth editor, it is an American Adult Male voice. Default pacing is too fast and has to be adjusted.
@@ -46,4 +48,10 @@ xVAPitch_5820651 model sample: <audio controls>
   Your browser does not support the audio element.
 </audio>
 Used datasets: Unknown/Non-permissiable data

 pipeline_tag: text-to-speech
 ---
+GitHub project: https://github.com/DanRuta/xVA-Synth
 The base model for training other xVASynth's "xVAPitch" type models (v3). Model itself used by the xVATrainer TTS model training app. All created by Dan Ruta.
 When used in xVASynth editor, it is an American Adult Male voice. Default pacing is too fast and has to be adjusted.
   Your browser does not support the audio element.
 </audio>
+xVAPitch model referenced Papers:
+- Multi-head attention with Relative Positional embedding - https://arxiv.org/pdf/1809.04281.pdf
+- Transformer with Relative Potional Encoding- https://arxiv.org/abs/1803.02155
+- SDP - https://arxiv.org/pdf/2106.06103.pdf
+- Spline Flow - https://arxiv.org/abs/1906.04032
 Used datasets: Unknown/Non-permissiable data