@RobertJene

⌚ Timestamps
0:00 - introduction
0:14 - Suno Bark
1:22 - Valle-X
3:00 - StyleTTS2
4:07 - CoquiTTS - XTTS
5:40 - Tortoise TTS

@SawunirMasunta

AI-generated writing looks way more natural after running it through Walter Writes AI Humanizer

@dontrez8412

Pretty good results. I didn't think that first Bark voice bad, though. Thanks for the comparisons.

@chaks2432

I built a GUI for XTTS using flask and svelte and finally got rvc running yesterday. Got inspired by your audiobok_maker, but it was missing some features I figured could be pretty useful (Like allowing users to edit text inside the GUI, add/delete/reorder lines), I'm pretty happy with the result, even if the UI looks like crap and it's still a little buggy. I also got everything to run together, so I don't need the ai-voice-cloning webUI running for it to work

@vikramr60

Coqui TTS is not open source, means it can't be used for commercial purposes,only for research and educational

@spiritual_audiobooks

A Open Source  local, fast neural text to speech system that sounds great is Piper TTS.

@Edward_ZS

What option runs the fastest 

And do any of these work without a GPU

@StudioPersimmon

Tortoise wins out, but I'm very interested in seeing more from Eleven's. From these examples, Coqui definitely seems to get the closest to your voice out the box, but the actual quality of the audio sounds very low. Is there a way to set the bitrate?

@damon_81

many of these voices are fantastic!

@sergialbert97

Jarods just for my benefit, when you apply RVC to some of these. First you do a voice cloning with for example XTTS, and then apply RVC or directly u use one default voice that has a similar tone and apply the RVC. Or maybe ir better apply finetuning and then RVC. Thanks for your videos mate!

@Raghav_Kapoor_

3:21 , how to use this voice ?

@ozerune

Could you do one of these for 2025? Curious to see how much the landscape has changed in the last year.

@dohyunio

Could you include your mic in your hardware list?
Great vid!

@poco7193

With tortoise TTS I have been issues with training it. I will upload my audio for training and go through the first two steps smoothly, when I actually try to run the training it freezes with some text then just never unfreezes no matter how long I wait. Also I was wanting to know what the 2nd software you were using in this video to make the tortoise tts sound smoother. I am trying to make a podcast for a school project and desperately need a smooth tts for some of my characters

@Jonathan_Dawson

Hi can you please make a tutorial of the Audiobook Maker, or how to create such pipelines? In particular you mentioned something along the lines of "RVCS" which seemed to make a dramatic difference in the last voice that you demonstrated! How is it done?

@Lenox-bp3lu

Which one of these did you use for the accent conversion at the start of this video? Please and thank you

@zonas7915

A video on how to train a model would be great, like the best settings etc

@bestcureremedies

Hi, could you make a video showing step by step how to install one of these open-source apps on a windows 11 computer and a demo how to use it once it is installed. Good work!

@TerrennonPriv

By the way, thanks you Jarod, update on my side, for my lore my language project. xtts was the way to go and I'm happy with the results.

@nielsieboy19

From what I've seen StyleTTS does a much better job of cloning a voice, it's also an order of magnitude faster than Tortoise. Only thing holding it back are the absolutely mental VRAM requirements for training and multilingual models (which are being worked on by the community).