waveletdeboshir
/

whisper-small-ru-pruned

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

waveletdeboshir commited on Aug 14

Commit

a33e071

•

1 Parent(s): 8c17c7e

Add usage example

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -45,5 +45,30 @@ Model size is 15%  less then original whisper-small:
 You can fine-tune this model on your data to achive better performance.
 ## Colab for pruning
 TODO

 You can fine-tune this model on your data to achive better performance.
+## Usage
+Model can be used as an original whisper:
+```python
+>>> from transformers import WhisperProcessor, WhisperForConditionalGeneration
+>>> import torchaudio
+>>> # load audio
+>>> wav, sr = torchaudio.load("audio.wav")
+>>> # load model and processor
+>>> processor = WhisperProcessor.from_pretrained("waveletdeboshir/whisper-small-ru-pruned")
+>>> model = WhisperForConditionalGeneration.from_pretrained("waveletdeboshir/whisper-small-ru-pruned")
+>>> input_features = processor(wav[0], sampling_rate=sr, return_tensors="pt").input_features
+>>> # generate token ids
+>>> predicted_ids = model.generate(input_features)
+>>> # decode token ids to text
+>>> transcription = processor.batch_decode(predicted_ids, skip_special_tokens=False)
+['<|startoftranscript|><|ru|><|transcribe|><|notimestamps|> Начинаем работу.<|endoftext|>']
+```
+The context tokens can be removed from the start of the transcription by setting `skip_special_tokens=True`.
 ## Colab for pruning
 TODO