@RobertJene

⌚ Timestamps
0:00 - introduction
0:14 - Suno Bark
1:22 - Valle-X
3:00 - StyleTTS2
4:07 - CoquiTTS - XTTS
5:40 - Tortoise TTS

@SawunirMasunta

AI-generated writing looks way more natural after running it through Walter Writes AI Humanizer

@dontrez8412

Pretty good results. I didn't think that first Bark voice bad, though. Thanks for the comparisons.

@chaks2432

I built a GUI for XTTS using flask and svelte and finally got rvc running yesterday. Got inspired by your audiobok_maker, but it was missing some features I figured could be pretty useful (Like allowing users to edit text inside the GUI, add/delete/reorder lines), I'm pretty happy with the result, even if the UI looks like crap and it's still a little buggy. I also got everything to run together, so I don't need the ai-voice-cloning webUI running for it to work

@vikramr60

Coqui TTS is not open source, means it can't be used for commercial purposes,only for research and educational

@spiritual_audiobooks

A Open Source  local, fast neural text to speech system that sounds great is Piper TTS.

@bestcureremedies

Hi, could you make a video showing step by step how to install one of these open-source apps on a windows 11 computer and a demo how to use it once it is installed. Good work!

@nielsieboy19

From what I've seen StyleTTS does a much better job of cloning a voice, it's also an order of magnitude faster than Tortoise. Only thing holding it back are the absolutely mental VRAM requirements for training and multilingual models (which are being worked on by the community).

@poco7193

With tortoise TTS I have been issues with training it. I will upload my audio for training and go through the first two steps smoothly, when I actually try to run the training it freezes with some text then just never unfreezes no matter how long I wait. Also I was wanting to know what the 2nd software you were using in this video to make the tortoise tts sound smoother. I am trying to make a podcast for a school project and desperately need a smooth tts for some of my characters

@StudioPersimmon

Tortoise wins out, but I'm very interested in seeing more from Eleven's. From these examples, Coqui definitely seems to get the closest to your voice out the box, but the actual quality of the audio sounds very low. Is there a way to set the bitrate?

@ozerune

Could you do one of these for 2025? Curious to see how much the landscape has changed in the last year.

@Lenox-bp3lu

Which one of these did you use for the accent conversion at the start of this video? Please and thank you

@Jonathan_Dawson

Hi can you please make a tutorial of the Audiobook Maker, or how to create such pipelines? In particular you mentioned something along the lines of "RVCS" which seemed to make a dramatic difference in the last voice that you demonstrated! How is it done?

@RobertJene

6:15 - what do you mean, pipeline from Tortoise TTS to RVC? Like you train a model in Tortoise and then use it in RVC or something?

@stefanomonziocompagnoni8302

hi Jarods!
Nice video!
I'm looking for software (better if open source) that changes a recorded audio voice.
I mean, If I record my voice, I would like to use a different voice, keeping my prosody, tone, speed, etc....just changing the timbre.
Any advice?

@Edward_ZS

What option runs the fastest 

And do any of these work without a GPU

@moqingyu730

Hi, excellent work, have you found out which one is the fastest one?

@Raghav_Kapoor_

3:21 , how to use this voice ?

@Morpheus.999

i can't find the  audio sample at 3:21 , how could download this?

@trush1090

Hi Jarod can you make a video on how to resume training? Say I finished training at 50 epochs. How would I add 50 more without resetting.

Also how to eliminate static sounds from generated sounds. I trained 2 hours on 60 epochs just for it to have a static sound.