AI-generated writing looks way more natural after running it through Walter Writes AI Humanizer
Pretty good results. I didn't think that first Bark voice bad, though. Thanks for the comparisons.
I built a GUI for XTTS using flask and svelte and finally got rvc running yesterday. Got inspired by your audiobok_maker, but it was missing some features I figured could be pretty useful (Like allowing users to edit text inside the GUI, add/delete/reorder lines), I'm pretty happy with the result, even if the UI looks like crap and it's still a little buggy. I also got everything to run together, so I don't need the ai-voice-cloning webUI running for it to work
Coqui TTS is not open source, means it can't be used for commercial purposes,only for research and educational
A Open Source local, fast neural text to speech system that sounds great is Piper TTS.
Hi, could you make a video showing step by step how to install one of these open-source apps on a windows 11 computer and a demo how to use it once it is installed. Good work!
From what I've seen StyleTTS does a much better job of cloning a voice, it's also an order of magnitude faster than Tortoise. Only thing holding it back are the absolutely mental VRAM requirements for training and multilingual models (which are being worked on by the community).
With tortoise TTS I have been issues with training it. I will upload my audio for training and go through the first two steps smoothly, when I actually try to run the training it freezes with some text then just never unfreezes no matter how long I wait. Also I was wanting to know what the 2nd software you were using in this video to make the tortoise tts sound smoother. I am trying to make a podcast for a school project and desperately need a smooth tts for some of my characters
Tortoise wins out, but I'm very interested in seeing more from Eleven's. From these examples, Coqui definitely seems to get the closest to your voice out the box, but the actual quality of the audio sounds very low. Is there a way to set the bitrate?
Could you do one of these for 2025? Curious to see how much the landscape has changed in the last year.
Which one of these did you use for the accent conversion at the start of this video? Please and thank you
Hi can you please make a tutorial of the Audiobook Maker, or how to create such pipelines? In particular you mentioned something along the lines of "RVCS" which seemed to make a dramatic difference in the last voice that you demonstrated! How is it done?
6:15 - what do you mean, pipeline from Tortoise TTS to RVC? Like you train a model in Tortoise and then use it in RVC or something?
hi Jarods! Nice video! I'm looking for software (better if open source) that changes a recorded audio voice. I mean, If I record my voice, I would like to use a different voice, keeping my prosody, tone, speed, etc....just changing the timbre. Any advice?
What option runs the fastest And do any of these work without a GPU
Hi, excellent work, have you found out which one is the fastest one?
3:21 , how to use this voice ?
i can't find the audio sample at 3:21 , how could download this?
Hi Jarod can you make a video on how to resume training? Say I finished training at 50 epochs. How would I add 50 more without resetting. Also how to eliminate static sounds from generated sounds. I trained 2 hours on 60 epochs just for it to have a static sound.
@RobertJene