Free neural text to speech with emotion!

Guides Free neural text to speech with emotion!

brilliant piece of info
Upvote 0
I still need some questions to be answered: Did you just save the custom audio clips as oggs and played it on AudioSource? Or did you used something else, i think we need another guide on how to implement this into the game with model emotions
Bob Nothing
Bob Nothing
Typically, I will write up an entire script with various emotions used. (You can even use different emotions within the same line.) Then I do an export of .wav files and I select the 'each paragraph is it's own file' option. Then in animation triggers I use HeadAudio on the person atom to play the audio file using RT Lip Sync plugin. It works really well. If you're going to have a conversation back and forth, I'll have my first audio file play 5 seconds in for example, hit play, then hit stop once the audio file is done playing. Then I hit play again for a second or so, then stop. Then on the next audio trigger click the 'current time' button, and so on and so on so the audio spacing is correct.
Upvote 0
Excellent!
Upvote 0
Thanks for the tip !
Upvote 0
Great guide. I'm going to try it right now! I've been using ttsfree.com. They source from various places (IBM, Microsoft, etc I think) They have a pretty good number of voices but no emotion/inflection options. I agree 100% voiced scenes are much better than speech bubbles and it's incredible how good the voices are these days. Death to speech bubbles!!!
Upvote 0
After going through all those sign-ups I realized I could just record the voices from the demo page into my daw and make my own audio files for the same result. Thanks for sharing this, really gonna make my scenes more interesting!
Upvote 0
Thanks soo much for pointing this out!
Now my girls never stop talking ;)

I hope they add some emotions to other languages than US soon, since my girls are from Germany, lol.

Do you know if there’s some kind of community sharing custom voices?

Anyway, you are my hero!
Bob Nothing
Bob Nothing
Well, hopefully I'm about to make your day. Some of the US English voices are multilingual. There is a "Jenny multilingual" for example that speaks German. Also, if it helps, you can give the English speaking voices that aren't multilingual German accents, if that helps.
Upvote 0
Thank you for taking the time to make a detailed tutorial. I've been looking for a solution like this for a long time. Google offers a similar function. But I didn't have Microsoft on the screen. Thanks for that!
Bob Nothing
Bob Nothing
I haven't tested Google's yet, but Azure is lightyears ahead of AWS Polly, it's free, and there's a lot more options in voices plus emotions. Honestly it crushes AWS. I'm really looking forward to seeing some content from you that uses it! :-)
Upvote 0
Thanks for the guide! Microsoft Azure is very realistic text to speech. Not robot like at all.
Bob Nothing
Bob Nothing
I agree, it's the best I've tested. I was using AWS Polly for a bit since I had a free year to try it out. It was only costing me like $.35 a month with my usage but I thought hey, why not see if there's anything else out there free, and the Azure stuff is SOOOO much better!
Upvote 0
God i wish this was able to be standalone/portable.
Bob Nothing
Bob Nothing
Well, you generate your audio files and then it would be stand alone/portable. It creates audio files that you download and then use in VAM. Via API it could be doing in real time/live, but I doubt anyone would want to front that, and I don't really see the advantage.
Upvote 0
Definitely interesting, I hope creators use this kind of tech.
Bob Nothing
Bob Nothing
Agreed, that's my entire purpose in posting this :-)
Upvote 0
Very useful guide. Do you have video of any sample scenes where you've used this?
Bob Nothing
Bob Nothing
I do actually, I have a sample video I made using the above script, but it's close to 700MB (4k) and I'm not sure how to upload it to share on the hub... So, if you want to point me in the right direction, I'll get it up.
Upvote 0
Heheheheehee
Upvote 0
I've tried some options before, always difficult to get the right tones for hmmm... vam stuff... but this is looking really interesting, will need to test it deeper.
Thanks for the tip.
Bob Nothing
Bob Nothing
My pleasure!
Upvote 0
Back
Top Bottom