this post was submitted on 25 Feb 2025
38 points (100.0% liked)

Open Source

832 readers
39 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 5 years ago
MODERATORS
 

I would be plus if it has a simple CLI or GUI.

top 13 comments
sorted by: hot top controversial new old
[–] Guenther_Amanita@slrpnk.net 9 points 1 week ago (1 children)
[–] Trent@lemmy.ml 6 points 1 week ago (1 children)

I use piper TTS. Probably not as good as the fancy AI APIs, but it's all local and runs from command line and is good enough for my purposes. YMMV.

[–] Neptr@lemmy.blahaj.zone 2 points 6 days ago (1 children)
[–] Tundra@lemmy.ml 1 points 6 days ago (1 children)

I was disappointed with this at first, until I loaded the "Cori" voiceset. It outshines the others

[–] Neptr@lemmy.blahaj.zone 1 points 6 days ago

The ones I liked the most was Kusal and Lessac.

[–] Xanza@lemm.ee 4 points 6 days ago (1 children)

Depends on your setup, but generally I recommend: https://github.com/SYSTRAN/faster-whisper

If you have an available GPU for processing it's insanely quick and better than OpenAI's whisper.

[–] octochamp@lemmy.ml 7 points 6 days ago

this is speech-to-text! OP is looking for text-to-speech.

[–] sp3ctre@feddit.org 4 points 6 days ago

F5-TTS. Only needs 15 seconds of reference audio and you're good to go.

[–] sonalder@lemmy.ml 3 points 6 days ago

I think Bark from Suno is quite good : https://github.com/suno-ai/bark

[–] BuboScandiacus@mander.xyz 2 points 1 week ago
[–] 4oreman@lemy.lol 2 points 6 days ago

speechnote by kde

[–] ililiililiililiilili@lemm.ee 1 points 6 days ago