Inference on fine-tuned whisper-large-v3 is not working, but is working on pre-trained model and whisper-medium
#169 opened 2 days ago
by
ivabojic
🚩 Report: Not working
#168 opened 7 days ago
by
ednsinf
Speech recognition broken down by speakers
3
#167 opened 24 days ago
by
tur0kmag
finetuning whisper large on google colab - pro or pro+ which one is best?
1
#166 opened about 1 month ago
by
andromeda01111
the chinese training data of the model is contaminated
1
#165 opened about 1 month ago
by
bookwoods123
Specify language for transcribing with HuggingFace API
7
#164 opened about 1 month ago
by
mikealexx
Isolate search for a single language
#163 opened about 1 month ago
by
edyrkaj
Whisper-GUI
#162 opened about 1 month ago
by
PrensCin
How to get SRT file as output
#161 opened about 1 month ago
by
dinesh-001
whisper large v3 turbo
2
#160 opened about 2 months ago
by
deepdml
Missing spaces between chunks in longform fine tune outputs & importance of tokenizer.json
#159 opened about 2 months ago
by
saj-bot
Update README.md
1
#158 opened about 2 months ago
by
Ironajijiul11
What is the accuracy of the model? Why can't I fine-tune when I set the accuracy to FP16?
1
#157 opened 2 months ago
by
chengligen
Help for absolute AI beginners
1
#156 opened 2 months ago
by
TobiasKuch
Single word transcription for a audio file with ~1.5m frames
#155 opened 2 months ago
by
KevalRx
how to get n-best list generated by Whisper.
#153 opened 3 months ago
by
louisguo
Hugging Face Model Deployment and Library Dependency Issues
#152 opened 3 months ago
by
NeuraFusionAI
Issue - Internal Server Error (Serverless API)
#151 opened 3 months ago
by
tushar310
how many GPU memory do I need to finetune largeV3
4
#150 opened 3 months ago
by
lanejohn
Better output in INT8
1
#149 opened 3 months ago
by
aney
how to translate model ( whisper-small ) to pt file (small.pt)?
4
#146 opened 3 months ago
by
lihenan1996
how to get the same output result format from the pipeline as we get from the open ai whisper?
#145 opened 4 months ago
by
aheed911
الأصدقاء
1
#142 opened 4 months ago
by
monir2006
Add Urdu (Pakistan) language speech to text detection
1
#141 opened 4 months ago
by
AamirFarooq
whisper segments
#138 opened 4 months ago
by
world-of-ai
Git repository or how to instructions on downloading and using the model
#137 opened 5 months ago
by
J-PROGRAMMER
Rename README.md to wangdaoqi
#136 opened 5 months ago
by
dqoqi
Hyperparams optimization with LoRA on Whisper
#135 opened 5 months ago
by
luigimontaleone
Rename README.md to Quaranthuit
1
#134 opened 5 months ago
by
Etaduri
Update README.md
#133 opened 5 months ago
by
MMXXMM
Is it possible to set/output segments as in Open AI's API? For example avg_threshold,temperature,compression_ratio.
#132 opened 5 months ago
by
rodosabbath
list index out of range with word level timestamps
3
#131 opened 5 months ago
by
mkvbn
YOU ARE HYPOCRITES!
3
#129 opened 5 months ago
by
jawad1347
Set temperature and prompt possible?
1
#128 opened 5 months ago
by
jeffuli755
Issue when trying to run Whisper offline from locally saved pretrained model
3
#125 opened 6 months ago
by
georgis-agent
Best strategy for inference on multiple GPUs
#124 opened 6 months ago
by
symdec
"Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained."
#123 opened 6 months ago
by
dwiraamadhan
Issue with inference endpoints
1
#122 opened 6 months ago
by
simon5454
Update README.md
1
#120 opened 6 months ago
by
KashaSasha
Upload 28 May, 15.58_.aac
#119 opened 6 months ago
by
lenaelena
Browse to upload file results in same default file, cannot upload audio file on demo page
#118 opened 6 months ago
by
BladedSupernova
Update README.md
#117 opened 6 months ago
by
ochoseistres
KeyError: 'whisper'
1
#116 opened 6 months ago
by
aiyaqingzheng
The meaning of some special token fields
#115 opened 6 months ago
by
tlain
how to transcribe hundreds of local audio files once?
1
#114 opened 6 months ago
by
myspace-ai
Error: ffmpeg was not found but is required to load audio files from filename
3
#113 opened 6 months ago
by
Vladmir1235432
Improving reliability, using prompt parameter
#111 opened 7 months ago
by
dcd12345678