Meaningless or repetitive sentences continue to be generated.
#105 opened 7 months ago
by
kbuwel
Audio input consists of only 3000. Short-form transcription is activated.no_speech_threshold is set to 0.5, but will be ignored.
#104 opened 7 months ago
by
rizwanishaq
transcribing chinese audio with the this model, the output text lacks punctuation
2
#103 opened 7 months ago
by
TaoGoblin
Issues with transcription of Burmese audio
#102 opened 7 months ago
by
hanshupe
How can i return both word level and segment together when using hugging face transformer?
#101 opened 7 months ago
by
lilijy
Incoherences in timestamps between chunked and sequential form
4
#100 opened 7 months ago
by
vprosadm
How we can use this model to achieve a real-time trans?
4
#99 opened 7 months ago
by
Von-violet
How to get accuracy of transcription from the model?
5
#98 opened 8 months ago
by
Atulad
Module not found error. No module named GTTS.
#97 opened 8 months ago
by
BeyondStreamlit
Enhancing Pipeline with Speech Probability for Reduced Hallucination
#96 opened 8 months ago
by
rizwanishaq
Rename README.md to persian
#95 opened 8 months ago
by
Zabs
OpenAI Whisper Support Urdu (Pakistan) language?
2
#94 opened 8 months ago
by
AamirFarooq
How to mention lanuage parameter on inference api
1
#93 opened 8 months ago
by
aitransync
Download and Load model on local system.
1
#92 opened 8 months ago
by
RebelloAlbina
How to save the loss value for each step during the training process?
2
#91 opened 8 months ago
by
zhouwen999
Auto speech/languages detection in Real Time streaming?
1
#89 opened 8 months ago
by
sabys
not transcribing to english
#88 opened 8 months ago
by
aitransync
Does it support Korean translation and speech? :)
#87 opened 8 months ago
by
JinPark
Transcript an Spanish audio
4
#86 opened 8 months ago
by
Andrews99
is it possible to access the transcript result in batches after each chunk has finished?
3
#85 opened 8 months ago
by
hanifanggawi
how to download model and load model and use it
1
#84 opened 8 months ago
by
r5avindra
[AUTOMATED] Model Memory Requirements
#83 opened 9 months ago
by
model-sizer-bot
Output Discrepancy
1
#82 opened 9 months ago
by
eBisw
Transcription and Translation In the same call
1
#81 opened 9 months ago
by
saalnlp
model in closed network
3
#78 opened 9 months ago
by
iamwhoiamm
Trouble shooting local inference on whisper
2
#77 opened 9 months ago
by
RESOLVER101757
The new Longform transcription method
5
#76 opened 9 months ago
by
deep-intel
Upload vocab.json
#74 opened 9 months ago
by
smerchi
How to adapt for low resource language?
9
#73 opened 10 months ago
by
Imran1
Problems with AutoProcessor
#72 opened 10 months ago
by
Kwang442
Suddenly all my transcriptions are in English
10
#71 opened 10 months ago
by
WWCF
Transcription in different languages for Punjabi audio
#67 opened 10 months ago
by
jssaluja
I want to use Whisper on a piece of hardware
#66 opened 10 months ago
by
Avinier
How to fix "TypeError: expected str, bytes or os.PathLike object, not NoneType" when specifying the local whisper model
2
#65 opened 10 months ago
by
BenjaminChu
Speaker Embedding
2
#64 opened 10 months ago
by
bertrand-fournel
whisper jax diarization Icelandic
#62 opened 10 months ago
by
Dondada79
Translating English Audio Into Spanish Text
4
#61 opened 11 months ago
by
stvnchnsn
Error with word level timestamps - ValueError: set return_segments=True
5
#60 opened 11 months ago
by
dkincaid
Passing parameters to the model deployed on HF Inference Endpoints
3
#59 opened 11 months ago
by
dkincaid
Does whisper-large-v3 work on Sagemaker?
3
#58 opened 11 months ago
by
dkincaid
Which File Shall I Download from The Files and Versions
2
#57 opened 11 months ago
by
BenjaminChu
Transcribing multiple languages in single audio file
3
#56 opened 11 months ago
by
supercharge19
Is there any way we can get no_speech_probability from the pipeline?
1
#55 opened 11 months ago
by
rizwanishaq
how to handle input audio files with either white noise or general noise and no speech
2
#54 opened 11 months ago
by
unk1911
changed use_flash_attention_2=True to attn_implementation="flash_attention_2"
1
#53 opened 11 months ago
by
macadeliccc
Whisper parameters?
1
#52 opened 11 months ago
by
Megatron17
Whisper large v3 can not recognize speech after Fine-Tune
5
#51 opened 11 months ago
by
bardenthenry
how download large version?
1
#50 opened 11 months ago
by
Timnorth
I've noticed that Uyghur language is not available. Is it possible to add the Uyghur (ug) dataset from the Mozilla Foundation's Common Voice 13.0 for training?
2
#48 opened 11 months ago
by
almjanx
is parallel processing possible with DLC Deployement?
#47 opened 12 months ago
by
SharatChandra