WaveGRU-Text-To-Speech / wavegru.yaml
NTT123
new 1024 gru unit model
012ab0b
raw
history blame
256 Bytes
## dsp
sample_rate : 24000
window_length: 50.0 # ms
hop_length: 12.5 # ms
mel_min: 1.0e-5 ## need .0 to make it a float
mel_dim: 80
n_fft: 2048
## wavegru
embed_dim: 32
rnn_dim: 1024
frames_per_sequence: 67
num_pad_frames: 62
upsample_factors: [5, 3, 20]