ydshieh
/

wav2vec2-large-xlsr-53-chinese-zh-cn-gpt

Automatic Speech Recognition

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

ydshieh HF staff commited on Mar 29, 2021

Commit

ef8e371

•

1 Parent(s): f4a9497

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -74,7 +74,7 @@ The model can be evaluated as follows on the zh-CN test data of Common Voice.
 Original CER calculation refer to https://huggingface.co/ctl/wav2vec2-large-xlsr-cantonese
 ```python
-!pip install jiwer
 import torch
 import torchaudio
@@ -114,7 +114,7 @@ processor = Wav2Vec2Processor.from_pretrained("ydshieh/wav2vec2-large-xlsr-53-ch
 model = Wav2Vec2ForCTC.from_pretrained("ydshieh/wav2vec2-large-xlsr-53-chinese-zh-cn-gpt")
 model.to("cuda")
-chars_to_ignore_regex = '[\\\\,\\\\?\\\\.\\\\!\\\\-\\\\;\\\\:"\\\\“\\\\%\\\\‘\\\\”\\\\�\\\\．\\\\⋯\\\\！\\\\－\\\\：\\\\–\\\\。\\\\》\\\\,\\\\）\\\\,\\\\？\\\\；\\\\～\\\\~\\\\…\\\\︰\\\\，\\\\（\\\\」\\\\‧\\\\《\\\\﹔\\\\、\\\\—\\\\／\\\\,\\\\「\\\\﹖\\\\·\\\\×\\\\̃\\\\̌\\\\ε\\\\λ\\\\μ\\\\и\\\\т\\\\─\\\\□\\\\〈\\\\〉\\\\『\\\\』\\\\ア\\\\オ\\\\カ\\\\チ\\\\ド\\\\ベ\\\\ャ\\\\ヤ\\\\ン\\\\・\\\\丶\\\\ａ\\\\ｂ\\\\ｆ\\\\ｇ\\\\ｉ\\\\ｎ\\\\ｐ\\\\ｔ' + "\\\\']"
 resampler = torchaudio.transforms.Resample(48_000, 16_000)

 Original CER calculation refer to https://huggingface.co/ctl/wav2vec2-large-xlsr-cantonese
 ```python
+# pip install jiwer
 import torch
 import torchaudio
 model = Wav2Vec2ForCTC.from_pretrained("ydshieh/wav2vec2-large-xlsr-53-chinese-zh-cn-gpt")
 model.to("cuda")
+chars_to_ignore_regex = '[\\\\\\\\,\\\\\\\\?\\\\\\\\.\\\\\\\\!\\\\\\\\-\\\\\\\\;\\\\\\\\:"\\\\\\\\“\\\\\\\\%\\\\\\\\‘\\\\\\\\”\\\\\\\\�\\\\\\\\．\\\\\\\\⋯\\\\\\\\！\\\\\\\\－\\\\\\\\：\\\\\\\\–\\\\\\\\。\\\\\\\\》\\\\\\\\,\\\\\\\\）\\\\\\\\,\\\\\\\\？\\\\\\\\；\\\\\\\\～\\\\\\\\~\\\\\\\\…\\\\\\\\︰\\\\\\\\，\\\\\\\\（\\\\\\\\」\\\\\\\\‧\\\\\\\\《\\\\\\\\﹔\\\\\\\\、\\\\\\\\—\\\\\\\\／\\\\\\\\,\\\\\\\\「\\\\\\\\﹖\\\\\\\\·\\\\\\\\×\\\\\\\\̃\\\\\\\\̌\\\\\\\\ε\\\\\\\\λ\\\\\\\\μ\\\\\\\\и\\\\\\\\т\\\\\\\\─\\\\\\\\□\\\\\\\\〈\\\\\\\\〉\\\\\\\\『\\\\\\\\』\\\\\\\\ア\\\\\\\\オ\\\\\\\\カ\\\\\\\\チ\\\\\\\\ド\\\\\\\\ベ\\\\\\\\ャ\\\\\\\\ヤ\\\\\\\\ン\\\\\\\\・\\\\\\\\丶\\\\\\\\ａ\\\\\\\\ｂ\\\\\\\\ｆ\\\\\\\\ｇ\\\\\\\\ｉ\\\\\\\\ｎ\\\\\\\\ｐ\\\\\\\\ｔ' + "\\\\\\\\']"
 resampler = torchaudio.transforms.Resample(48_000, 16_000)