Yanis L PRO

Pendrokar

AI & ML interests

STT/STS/TTS you know, something that is solveable

Recent Activity

updated a dataset 3 minutes ago
Pendrokar/TTS_Arena
updated a dataset 13 minutes ago
Pendrokar/TTS_Arena
updated a dataset about 1 hour ago
Pendrokar/TTS_Arena

Organizations

Pendrokar's activity

reacted to hexgrad's post with ๐Ÿ”ฅ 5 days ago
posted an update 6 days ago
view post
Post
778
Added @amphion MaskGCT & @hexgrad StyleTTS fine tuned model by the name of kokoro to the forked TTS Arena Space. If things keep up from what is seen in the preliminary results, then these two may end up in the TOP 5 of all TTS models. ๐Ÿคž๏ธ๐Ÿ€๏ธ

Pendrokar/TTS-Spaces-Arena
Svngoku/maskgct-audio-lab
hexgrad/Kokoro-TTS

I chose @Svngoku 's forked HF space over amphion's due to the overly high ZeroGPU duration demand on the latter. 300s!

amphion/maskgct

Had to remove @mrfakename 's MetaVoice-1B Space from the available models as that space has been down for quite some time. ๐Ÿค•๏ธ

mrfakename/MetaVoice-1B-v0.1

I'm close to syncing the code to the original Arena's code structure. Then I'd like to use ASR in order to validate and create synthetic public datasets from the generated samples. And then make the Arena multilingual, which will surely attract quite the crowd!
  • 1 reply
ยท
reacted to mrfakename's post with ๐Ÿ‘ 30 days ago
view post
Post
4433
I just released an unofficial demo for Moonshine ASR!

Moonshine is a fast, efficient, & accurate ASR model released by Useful Sensors. It's designed for on-device inference and licensed under the MIT license!

HF Space (unofficial demo): mrfakename/Moonshine
GitHub repo for Moonshine: https://github.com/usefulsensors/moonshine
replied to their post about 1 month ago
replied to their post about 1 month ago
posted an update about 1 month ago
view post
Post
628
How the ๐Ÿ—ฃ๐Ÿ† leaderboard of a merged TTS Arena with the ๐Ÿค— Spaces fork would look like. These results are somewhat unreliable as some models have not challenged the other in the list. And the original TTS Arena used only narration type sentences.
  • 2 replies
ยท
posted an update about 1 month ago
view post
Post
1362
Made a notable change to the TTS Arena fork. I do not think anyone is interested in which bottomfeeder TTS is better than another beside it. So one of the top 5 TTS is always chosen in a challenge for more scrutiny. Also these top 5 are taken from preliminary results.
Pendrokar/TTS-Spaces-Arena
reacted to mrfakename's post with ๐Ÿ‘ 6 months ago