reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

  • This model was trained on 15,000 hours of ReazonSpeech corpus.
  • Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month
18
Inference Examples
Inference API (serverless) is not available, repository is disabled.

Dataset used to train reazon-research/reazonspeech-espnet-v1

Spaces using reazon-research/reazonspeech-espnet-v1 2