openai/whisper-large-v3

#103 opened 7 months ago by

TaoGoblin

Issues with transcription of Burmese audio

#102 opened 7 months ago by

hanshupe

How can i return both word level and segment together when using hugging face transformer?

#101 opened 7 months ago by

lilijy

Incoherences in timestamps between chunked and sequential form

#100 opened 7 months ago by

vprosadm

How we can use this model to achieve a real-time trans？

#99 opened 7 months ago by

Von-violet

How to get accuracy of transcription from the model?

#98 opened 8 months ago by

Atulad

Module not found error. No module named GTTS.

#97 opened 8 months ago by

BeyondStreamlit

Enhancing Pipeline with Speech Probability for Reduced Hallucination

#96 opened 8 months ago by

rizwanishaq

Rename README.md to persian

#95 opened 8 months ago by

Zabs

OpenAI Whisper Support Urdu (Pakistan) language?

#94 opened 8 months ago by

AamirFarooq

How to mention lanuage parameter on inference api

#93 opened 8 months ago by

aitransync

Download and Load model on local system.

#92 opened 8 months ago by

RebelloAlbina

How to save the loss value for each step during the training process?

#91 opened 8 months ago by

zhouwen999

Auto speech/languages detection in Real Time streaming?

#89 opened 8 months ago by

sabys

not transcribing to english

#88 opened 8 months ago by

aitransync

Does it support Korean translation and speech? :)

#87 opened 8 months ago by

JinPark

Transcript an Spanish audio

#86 opened 8 months ago by

Andrews99

is it possible to access the transcript result in batches after each chunk has finished?

#85 opened 8 months ago by

hanifanggawi

how to download model and load model and use it

#84 opened 8 months ago by

r5avindra

[AUTOMATED] Model Memory Requirements

#83 opened 9 months ago by

model-sizer-bot

Output Discrepancy

#82 opened 9 months ago by

eBisw

Transcription and Translation In the same call

#81 opened 9 months ago by

saalnlp

model in closed network

#78 opened 9 months ago by

iamwhoiamm

Trouble shooting local inference on whisper

#77 opened 9 months ago by

RESOLVER101757

The new Longform transcription method

#76 opened 9 months ago by

deep-intel

Upload vocab.json

#74 opened 9 months ago by

smerchi

How to adapt for low resource language?

9

#73 opened 10 months ago by

Imran1

Problems with AutoProcessor

#72 opened 10 months ago by

Kwang442

Suddenly all my transcriptions are in English

10

#71 opened 10 months ago by

WWCF

Transcription in different languages for Punjabi audio

#67 opened 10 months ago by

jssaluja

I want to use Whisper on a piece of hardware

#66 opened 10 months ago by

Avinier

How to fix "TypeError: expected str, bytes or os.PathLike object, not NoneType" when specifying the local whisper model

#65 opened 10 months ago by

BenjaminChu

Speaker Embedding

#64 opened 10 months ago by

bertrand-fournel

whisper jax diarization Icelandic

#62 opened 10 months ago by

Dondada79

Translating English Audio Into Spanish Text

#61 opened 11 months ago by

stvnchnsn

Error with word level timestamps - ValueError: set return_segments=True

#60 opened 11 months ago by

dkincaid

Passing parameters to the model deployed on HF Inference Endpoints

#59 opened 11 months ago by

dkincaid

Does whisper-large-v3 work on Sagemaker?

#58 opened 11 months ago by

dkincaid

Which File Shall I Download from The Files and Versions

#57 opened 11 months ago by

BenjaminChu

Transcribing multiple languages in single audio file

#56 opened 11 months ago by

supercharge19

Is there any way we can get no_speech_probability from the pipeline?

#55 opened 11 months ago by

rizwanishaq

how to handle input audio files with either white noise or general noise and no speech

#54 opened 11 months ago by

unk1911

changed use_flash_attention_2=True to attn_implementation="flash_attention_2"

#53 opened 11 months ago by

macadeliccc

Whisper parameters?

#52 opened 11 months ago by

Megatron17

Whisper large v3 can not recognize speech after Fine-Tune

#51 opened 11 months ago by

bardenthenry

how download large version?

#50 opened 11 months ago by

Timnorth

I've noticed that Uyghur language is not available. Is it possible to add the Uyghur (ug) dataset from the Mozilla Foundation's Common Voice 13.0 for training?