I am a Text to Speech VTuber and this is how I do it! [UPDATED VIDEO] Speak like Zentreya!
Round 2 of explaining how I do my TTS Setup! I ... I at least try my best at giving you an insight because oh boy ... I am not good at explaining stuff, sorry!! 💦💦💦
This is the NEW version of my little guide! I completely switched to VRC STT now and I also dig a bit deeper into my setup. There ya go!
Apps I use:
vrcstt.com/ VRC STT
obsproject.com/ OBSdenchisoft.com/ VTube Studio
vb-audio.com/Voicemeeter/bana... Voicemeeter Banana#vtuber #vtubers #envtuber #ttsvtuber #texttospeech #vtubestudio #obs #voicemeeter #tutorial #guide #howto
++++++++++++++++++++++++++++Lyro is an android VTuber and the host of the jollyrose channel. He does cozy art and gaming streams!Fantag: #AntennArmy++++++++++++++++++++++++++++Character Design, Model Art & Rig by jollyroseTwitch ( / jollyrose)twitter ( / jollyrose_art)lyro Twitter ( / lyro_vt)++++++++++++++... BGM by @kuwanano_vt
This was a great explanation! Even if I don't plan on doing this myself, it's really interesting to see how the VTubers I watch do it.
For three days I've been looking for a way to use A.I Voices in OBS. I can't believe this is where I found the solution. Thank you.
Wow, your content is absolutely amazing! I love how you engage your audience with informative and entertaining videos. Your passion and enthusiasm shine through every video you create. Keep up the fantastic work and continue inspiring us with your valuable content. You've definitely earned a loyal subscriber here. Looking forward to more incredible videos from you. Keep rocking! 👍😊
I began using this to talk to some of my friends even though I have Selective Mutism, so its really nice
that's so cool to hear that this helps you to communicate ❤️ the voice I use is AWS Justin, but pitched down ✌️
this was a nice cute little informative video, your little quips were plenty fun to listen to also^^
Thank you for making this Lyro, I wanna be vtuber but am not confident in my voice right now tts will really help me i think
I was really worried because i totally dislike my voice but this video helped me a lot to understand and how to set up for TTS It also helps that i had a similar robot concept for my Vtuber so a TTS voice would fit perfectly, thank you for the tutorial and i believe the tutorial is very well explained
I think you did a great job at explaining it! To be fair I already have a deep understanding of voicemeeter and OBS, but I would hope that many people would be able to follow along. Looking forward to seeing more jollyrose
thank you so much for making this, i actually tried to use rvc for this and generated an ai voice based on samples of the microsoft voice but i would have had to buy a whole new graphics card just to talk in real time. this is a lot cheaper and will get me a way better result!
Super helpful thank you!!
Thats very interessting, thank you so much
In a ttrpg game I'm starting soon i'm playing a rabbit that suddenly got incredibly smart and began creating machines. thinking about using this method to voice him, because rabbits dont have the correct pipes for human speech but this one is slowly becoming a machine.thanks for the informative and interesting tutorial!
This is very cool
I’ll be back to rewatch this video in the future 😉😉😉😊😊😊
This is helpful for a small chimera bunny Vtuber like me thanks so much Lyro-san
I've been thinking about doing this as there are some privacy implications that I'm worried about in regards to using my real voice. This video helps a lot and I'm very thankful for it! ^-^ Also, your model is so hecc'n cute, ahhhh!!! >w< ❤
All the artificial voices give me a headache. As annoying as it is, I'm thankful, no fooling me. Neurosama probably the only one that doesn't give me a headache straightaway.
I have a question with the Microphone setup. So how do I get the Cable Output ??? I only have my Yeti Stereo microphone as an option.
Finnaly i need it
Thanks for the video! I may start actually streaming thanks to you! I'm autistic and have selective mutism, talking always makes me scared of streaming...
I also made(assembled) my own using VOSK for STT and google translate for TTS and VB Cable for the virtual microphone thing, Just for fun(I don't stream,maybe troll my friends with it). so I'm going completely free route. Well, downside of it is multi language flexibility is bad. Also thx for sharing your setups. It is nice to know another alternative plan like vrc stt. My original plan was using openai's whisper model for stt. Either way it would save a lot of ram from vosk
Thank You.
gracias por el video
Heyo, thanks for the video but i had a little issue, how do u settup with live2d models? I'm having a lot of troubles with it and don´t know how to do it properly :c also ty for the guide x2 xd
What app you do for the vr tuber
when i commented on the other video, i was having an issue with attempting to use the voicemeeter virtual audio cable but it refused to work properly as a virtual microphone. i got lucky in that my elgato capture card must have simply come along with its own virtual audio cable because i can have the audio split properly by having tts on the "line" virtual audio cable that's associated with my capture card software. i have yet to figure out how to properly use this in discord. the audio is doubled or something
when i get a job ill be able to buy afford the patreon xd, i cant wait !
How did you have the text pop out in that mannor?
Thank you so much for the tip of using Voicemeeter's virtual audio cables to connect things properly! 🥰👍
⭐O⭐ wow you voice is so cute ⭐O⭐
great video, however i am only looking for voice to text, so that in VR it is much easier to type stuff, preferably with more than just English support, looking for Asian language supports as well. what is the best free option we have now?
can't really help with that 💦 VRCSTT can also only do STT, but idk what your intended purpose is! you can always check out their website or discord if you have questions. For free options, I sadly I have no idea if there are any. Transcribing unfortunately is very costly with most good services.
So I'm personally not interested in TTS because I'm fine with using my real voice, but I was really curious to see how it worked, so that's why I'm here. That being said, I AM very interested in accessibility tools and features, and would be interested in knowing how to have live captions on streams/videos like you showed near the end of the video. Is that possible to do with any of the programs listed here while still using your real voice? Preferably a program that doesn't cost money or is very cheap, I'm broke lol
hey there! there's a few options for having live captions during stream, the one I know and probably is the most used is the closed caption OBS plugin! just google it and you will find plenty of tutorials about that!
If you have a beefy enough GPU, you can also use RVC to convert your speech into an AI voice, instead of going from speech to text to speech 😇
for those that are interested about the delay using RVC: With a Ryzen 5 5600 and GTX 1070, it's a bit under 1 second with stable settings, so still very suitable for real time communication. If you have a faster (newer) GPU and/or CPU, you can lower it further.
hi Lyro, i have a question (since i'm alil confused and smol brain) my question is: did "Vrc stt" have the ability to just be Tts? i've gone back n forth in your vid and tried looking on there website, but i'm havin issues in understaning clearly if it has a simple "yes text to speech is a feature" i'm trying to help a friend in finding an tts and most results end up in being stts witch ain't really an possibility for them :c
yes, it has a textbox where you can type into!
@@superjollyrose tysm for the fast reply! this is great news :D
I didn't know that OBS now allowed you to set up audio from apps/windows separately. Do you have a tutorial for that? I currently use VM banana, but even with the TTS, Discord, Mic and such set up, I only have 1 input left so my music and game sound runs through desktop. Which isn't ideal. I use OBS 28
if you add a new source, on the top is the option to add just a audio from an application. alternatively you can get the win audio capture plugin for obs, which does the same!
@@superjollyrose Thank you very much!
is it possible to customize the voice and how it sounds?
Would there be an alternate free version of an app like the vrchat thingy
You know. It must be possible to do STT>TTS combined right? With all the new AI tech right?
Sorry editing because I am dumb. Will this work for 2d vtubers as well? Can I just use this as a stand-alone software? Not looking to use this for VRChat. Thank you for your time and help!
yes you can use this for vtubing, that is what I explain in this video 😊
@@superjollyrose Ah okay! Sorry I misunderstood, I thought it was only for 3D Models lol
hii ouo thanks for the great vid! I do have one question though is do you pick the voice or theyre presets in vrcstt?
vrc stt offers a variety of voices from differerent services like amazon, azure and even tiktok! they are already part of the program!
@@superjollyrose Thank you!!!
May I ask how you made this actual video? Do you just screen record and stream, or is there something else you use? You deserve so many more subscribers! Cant wait to see your channel take off, and if I ever create one, I'll definitely promote you :)
I use Davinci Resolve to edit my videos, if that's what you wanna know 😂
Do you or other streamers with STTS listen the generated speech to know how it is spoken or that it is generated correctly?
yes, I monitor the voice and even the text, just to laugh about whenever it fails to transcribe properly and I speak garbage 🤣💦
Hey there! Can I just say that stumbling across this could very well prove to be a godsend? I'm going to be launching as a VTuber under a different name at the end of this month, but I've been having so many issues trying to find my voice -- literally. I don't like my own voice, but it's very easy for me to communicate as my persona using text-to-speech and speech-to-speech. The problem is that the voice I'm set on is tied exclusively to ElevenLabs. I have a Creator's License, and able to make as many recordings as I want, except that's all they are right now, recordings. Would it be possible to work with you to try and port my voice from ElevenLabs to whatever program it is I need, or at the very least try to recreate it with your help?
Hey there! VRChat STT supports Eleven Labs, though idk how to use it, you can find more info on the VRC STT website or Discord 👍
Thank you so much again,@@superjollyrose! Once I get my model up, I'll do just that! Definitely looking forward to showing you the final results; couldn't have done this without you!
can you please do a step by step guide please? This guide is helping to point you to the general direction but a step by step guide would be a great help.
hey there! sadly I don't have the time to do this 💦 making videos is not my main job, so this little video is all I can do 🥲
@@superjollyrose understandable. thank you for replying! I finally got mine to work so I might make a step by step myself
@@ZellisArt Still up to date?
do I have to stay subscribed to the patron to use it? or can I just subscribe once, grab the app and not have to resubscribe? sorry if it’s a dumb question I don’t know much about it
yes, you have to stay subscribed.
I said to someone that I know that they sound just like this, but they claimed they sound nothing like this text to speech
hey! I know its been like 5 years this video was posted but does anyone know why I cant see the setting for my speech chat?
Hey can you help me out, i think you forgot a very important part, do i set up my model to move her mouth with the TTS intead of my mic voice?
yes, just see 4:33. that's why you use the virtual audio cable!
does the lipsync matching only work with PC in VTube Studio? I can't get it to work. I use Ios linked with the desktop version
yes, you need a PC setup. because you need Voicemeeter Banana to reroute the audio of the TTS software. dunno if there is anyway to do this on mobile 💦
@@superjollyrose Thanks! I ended up figuring it out - I thought the warning on vtubestudio meant i couldn't use my iphone to face track while also being to use volume for my mouth open parameter, but that turned out to not be the case.
woooah thankies! by the way quick question. could the stts program also be broadcasted through discord and xbox party? being able to communicate while also moving and shootin while also keeping 100% anonymous would be like THE best thing ever! also love your channel. keep up the top tier work.
hey there! yes, everything that requires a microphone input works! in Discord you just set your microphone to the virtual output! be aware that you might play around with the settings, especially disabling noise cancelling or the voice might get cut off or sound crunchy!
@@superjollyrose oh thats just perfect! thank you so much!
Was wondering if there was a way I could use this for discord as well? Paid for the highest tier but cannot seem to figure it out lmao
yes, use voicemeeter banana to set up a virtual audio cable, tutorial for that is on the vrc STT website! I'm discord you then use this audio cable as microphone ✌️
Hello fellow robot, what about free alternatives for VRCHAT stt? Sadly i'm a robot a bit broke, that surpassed it's budget cap
Can't help you with that, sorry! Most good TTS services like Azure, AWS & co have costs, especially the transcribing if you want to go for Speech to Text to Speech. There are probably some free alternatives that are very limited, but I don't know of any right now.
ok, I cant find this out so some help would be greatly appreciated, but where do you get the voices, and where do you put them. I dont think I've ever been so confused. might just be me tho
If you get VRCSTT, the voices are included. There's Azure, Amazon, TikTok and even ElevenLabs voices to choose from. These are all hosted on the service's servers, you don't need to download a voice.
AI cant steal my voice if I steal its first
I wish this explained how to set up obs with it, other then saying what needs to be changed, idk how to get to areas in obs that your talking about.
I recommend first learning OBS basics, there is plenty of tutorials on YT bc it will be helpful anyways to understand everything about an app before you use it 🥰
@@superjollyrose o.o wow a reply right away, this I am not used to. Thank you!
Where did you get your background music?
it's literally in the description of the video
@@superjollyrose Found it. TY!
Maybe this will help me with my bad speech problem
Do you know anyone who does this on fivver
For some reason vtubestudio isn’t making my mouth move when I use stts. Do you happen to know why?
Did you use Voicemeeter virtual input/output and activated the microphone in VTubeStudio and set the Mouth Open and Mouth Smile parameter input to Volume / Frequency?
I'm very confused on the subtitles part I spent an hour trying to figure it got noting if you can point me to the right direction. Be great
In VRC STT you can go to the Logging tab and enable logging, then there is a button that takes you to the folder direction of the .txt file. In OBS you make a new text source and in the settings for the text source is an option to grab text from a local file, this is were you put the .txt file in.
@@superjollyrose so I have an issue where when I say a couple things that is longer it just cut out the word and half and doesn't show the full sentence im been testing for a bit now to fix it so the words are not so big and not small but no luck it keeps cuting out the words
Can you use it on discord? I'm a mute and having the ability to use tts in vc would be amazing
Yes you can with Voicemeeter banana!
Can this be done with a custom trained TTS voice?
It uses plenty of services, but idk exactly about custom voices, you can always ask the VRC developers! They also have a discord!
@@superjollyrose Might do! Ty for the answer ❤
How to make OBS grab the text form VRC STT?
In VRC STT go to the "Logging" tab and enable it. Click on "Open Log Folder" to navigate to the .txt file that is needed in OBS. Then, in OBS, simply add a new text source and enable "read from file" and browse to the .txt file location.
@@superjollyrose Thank you~~~~~~
YOO I ALSO SPEAK GERMAN
is there a free way of doing this? i'm shy and don't want to use my real voice, but i also want a better way to interact with friends then using a text chat. i whould spend money but can't, i hope i'm not wasting your time. whatever the answer is thanks.
hey there! sadly I don't know any free method, since most TTS services have a fee, especially when you want to use STTS, because transcribing costs even more.
plenty of free that do the exact same thing on github
how do i actually make it hear my voice you never explained this??
I recommend you looking into VRC STT to get your answers! this is not a proper tutorial, but to show off what I use. if you have any more questions, please take a look at the apps website or discord for support.
do i only have to pay for vrc stt once?
either pay monthly via patreon or per use with tokens, for more Infos please go to their website!
thank you! @@superjollyrose
Heya Lyro! Great video, first of all! I do have a quick question, however. I've watched a few of your twitch vods (Dropped a follow too
hey there! thanks a lot! and yes, it's basically just acting haha you gotta practice the timing of the TTS and then just move around! Though the mouth automatically lipsyncs to the voice, which can easily be set up in VTube Studio! So all I do is move my head/body to the words that I spoke.
i can use it with loquendo?
please ask the developers, I don't really know what that even is!
@@superjollyrose a text to speech software
is there a way to accidentally reveal my voice using the voice to speech thing i really don't like my voice and i will die if anyone heard it i just wanna make sure
no, just make sure the microphone you speak into is not active in OBS or wherever you use it!
do you need a mic for thus
Not really, you may just type stuff
you can either just type, or how I do it, use speech to text to speech, which requires a microphone!
😶🌫️🎉
you are so cute AAAAAAAAAAAAAAAAAAAAA
I'll stick to Ryan Reynolds voice for now.. but good luck with everything you do.
14.50 euros par month tho:( Cant affort = cant stream yay
how to use vtuber
you're awesome, My question is, I'm Mute, is there a way to use somehtng lie tis set up to bradcast the voice I choose over Discro text? insted of the basic robo voice they use?
yeah, if you use Voicemeeter you can use the output everywhere like a microphone input!
...that moment when you're poor, thinking maybe tts is a way to talk...then you find out costs as much as a microphone... those things costs money my dude!! ...