General statistics
List of Youtube channels
Youtube commenter search
Distinguished comments
About

List of all parsed channels
AI Search
Hearted comments

Hearted Youtube comments on AI Search (@theAIsearch) channel.

Previous
1
Next
...
All

I'm not gonna lie, that Gura voice was so unsettlingly spot on... I'm feeling all sorts of mixed emotions.
5100
That Markiplier voice is absolutely spot on wtf
2500
14:37 Jesus Christ! It sounds exactly like him
2400
Markiplier's voice sounds way too real
1400
Hey everyone! This is the developer here. Let me know if you have any questions!
1400
The time is coming very soon where people will treat human made music like handmade furniture
1300
Bro calm. I was already blown away by the first example you gave at the beginning of the video and then you kept making it even crazier. This video truly deserves the SHOCKING or STUNNING title
1300
I dont even need an Ai to sound like an anime girl. Its a unique skill that i unlocked at the bathroom.
1200
Now I can be a skinwalker like I've always wanted to. Thank you so much
1200
Wow. So a young girl can look like a 30 year old programmer with help from ai. That's scary
1100
Cute girl with an Adams Apple! 😳
948
so basically you don't need anymore to be female in order to become a moe vtuber
906
So I made some discoveries which could help some people. -Chunk: You should increase this if you experience any distortions or voice lag when gaming with this. This adds more graphics processing time, and if you set it longer it wont rush out bad audio. -Extra: This setting gives bonus CPU usage to help iron out the audio. I found that sometimes the changer would translate an F sound to an S sound, but adding a bit "extra" CPU to it (like the 8k setting or higher) fixes the problem. You don't want to max this out unless the only thing you're doing is using your voice, as maxing it out will use all or nearly all of your CPU. -Noise: I recommend using Sup2 option if you have an Air-Conditioner or other background noise. Sup1 didn't work as well and is probably for a different frequency range, so your millage may vary. When you start real time voice changing, you'll see some info in a box with millisecond timers. The thing to watch is the "res" time. If this time starts going up, from around 300ms it starts rising to a thousand then two thousand etc, this means the computer is unable to get the voice processing out in time, its being pushed back in priority you could say. The fix is to increase the Chunk, this will give it more time to process with the remainder of your resources, and switching it you should see the number start decreasing rapidly. If it doesn't just raise it even higher, and also again keep in mind that if you're doing something that is CPU intensive, you need to keep the Extra setting fairly low (like around 8k). I have a powerful computer (10 core i9 10900kf, with a reference 3080ti), and I found that if im going to play a "serious" game like GTAV or StarCitizen etc, its best to have the Chunk as high as 192, or 256, with the Extra set to 8192. If you're just on discord, or playing some very light game, you can crank the Extra up, and reduce the Chunk to maintain high quality audio but process it considerably faster. Hope this helps someone!! Good luck o/
695
Imagine a bug, glitched or an error happened in the middle of the stream and you don't know that your audience is hearing your original voice. :)
617
Damn this is kinda scary but also fascinating. The fact this is real-time and it sounds so good just blows my mind.
505
Well, we've had a good run, folks.
499
MEN baiting MEN with pretty girl AI filters. What a lovely world we live in - enjoy the fun.
478
Can we appreciate the amount of research he does for his videos?
457
Now i can troll in over watch people be like " your voice is so cute" enables markiplier "you wanna say that again?"
440
What a time to be alive
420
The next era of trolling will be wild 🐴
404
I swear new AI is coming out so fast. It's hard to keep up with it.
387
Dude!!! Thank you. I normally don't subscribe and when I do it normally takes months of viewing to convince me. You got me with one video. Subscribed! It's rare to see such clear straight-to-the-point honesty from a YouTube video. There's normally a pile of BS to stretch out the video and a bunch of other junk I'm not interested in. Your video was spot on, to the point, no BS, and honest. 10 out of 10 buddy. Will be checking out all your other videos. Very impressed. I just recently started adding AI content to my channel and finding the right AI tools and best websites is a minefield of credit traps and nasty worthless AI that should not see the light of day. Well done. Keep up the good work! People who are new to AI need more creators like you.
344
can't imagine what will happen in the next 10 years
331
I've been experimenting with this for a bit, and I'm disappointed by how vague and incomplete the English documentation on these settings is. In an effort to remedy this, here's my breakdown of each setting: Response threshold: Controls the noise gate. Any sound below the threshold is suppressed. This is used to prevent background noise and hiss from being turned into strange mumbling. Equivalent to "S. Threshold" in w-okada. Not applicable in RVC WebUI. Pitch settings: Applies a pitch offset to your input voice. Every multiple of 12 setting increases or decreases the voice by an octave. Adjustments by 1 increase or decrease by a semitone. Using whole octaves is primarily used to ensure you can sing in the same key. Equivalent to "TUNE" in w-okada. Equivalent to "Transpose" in RVC WebUI. Index rate: When an index file is provided, this slider augments the target voice by preserving more of its accent and less of the input voice (to reduce tone leakage). This is particularly useful for voices trained with a low epoch count (around 200-ish or less). If set too high, it can cause strange pronunciation artifacts. I usually find something around 0.30 to sound good, but it varies by voice model. Equivalent to "INDEX" in w-okada. Equivalent to "Search feature ratio" in RVC WebUI. Loudness factor: How little to preserve the loudness of the input performance. At 0, the loudness of the cloned voice should match the loudness of the input voice. At 1, the cloned voice will always be at full loudness. 0 is useful if you want to distinguish between whispers, talking, screaming, etc. 1 is useful to have the cloned voice always speak loudly and clearly, as loud as the loudest things it was trained on (which can have artifacts such as mic clipping depending on the training set). Values in-between provide partial volume control biased toward being louder, the closer you get to 1. There is no equivalent in w-okada. Equivalent to "volume envelope scaling" in RVC WebUI. Pitch detection algorithm: Different algorithms are better at different things. rmvpe is the current state-of-the-art and works fastest and usually with the highest quality. Equivalent to "F0 Det." in w-okada. Equivalent to "pitch extraction algorithm" in RVC WebUI. Sample length: The realtime voice changer works by sending small chunks of audio for quick conversion, then stitching them together. Longer sample lengths feed in longer chunks, making the stitches less obvious and reducing GPU requirements but increasing output latency. On a low end GPU, setting this too low will make the GPU unable to keep up and produces stutters. On a high end GPU, setting this too low will cause warbling as an artifact of stitching many overly-short chunks together. Equivalent to "CHUNK" in w-okada. Not applicable in RVC WebUI. Number of CPUs: Self explanatory. Note, however, that rmvpe is a GPU-based pitch extractor and should be relatively unaffected by this setting. There is no equivalent in w-okada. Not applicable in RVC WebUI. Fade length: The length between chunks to crossfade together. Longer may reduce warbling. Equivalent to "overlap" in w-okada advanced settings. Not applicable in RVC WebUI. Extra inference time: How much old audio to load into each chunk. The extra context usually improves voice quality for the generated chunk but is more demanding for the GPU. Equivalent to "EXTRA" in w-okada. Not applicable in RVC WebUI. Input noise reduction: Attempts to remove non-speech background noise from the input to prevent sounds from being turned into strange mumbling. Equivalent to "NOISE" in w-okada. Not applicable in RVC WebUI. Output noise reduction: Applies the same noise reduction to the output voice. Possibly good for poorly trained voices with lots of background noise. There is no equivalent in w-okada, but the usefulness of this setting is dubious. Not applicable in RVC WebUI. Input voice monitor: Lets you hear the voice audio being passed in to the voice changer, sent to the target output device. Useful to ensure you are passing in the audio you actually want or to passthrough your audio without voice changing. Comparable to "monitor" settings in w-okada. Not applicable in RVC WebUI. Output converted voice: Outputs the voice conversion to the target output device. Main features RVC realtime has that w-okoda doesn't: Loudness factor controls. W-okoda seems to always use a value of 0. Significantly lower CPU usage at equivalent performance settings, in my experience. Main features that w-okoda has that RVC realtime doesn't: No system to save model presets. Input/output gain is missing. Input noise reduction is less robust compared to w-okoda, which offers echo reduction and multiple noise suppression techniques. Unlike w-okoda, you cannot passthrough to the input mic, instead requiring the use of virtual audio cable to pass the cloned voice into voice calls and microphone recording programs. In w-okoda, when the mic loudness falls below the response threshold, the tool is paused until speech is once again loud enough, saving GPU and CPU resources. RVC realtime always passes audio whenever it is running. Unlike w-okoda, you cannot monitor the cloned voice while outputting it. You can work around this by using the "listen" feature in the Windows sounds panel on a virtual audio cable instead. No built-in recording functionality. Missing most of the settings in the w-okoda "advanced settings" menu. No way to choose which GPU to run the voice model on. You can get around this by setting CUDA_VISIBLE_DEVICES=# in a terminal before launching the tool from there, where # is the index of your target GPU (0, 1, 2, etc.).
319
Cat fishing went to a whole new level it's even anime.
263
I'm excited for AGI, personally. Genuinely feel blessed to probably be alive at one of the most life-changing points in human history, assuming corporate greed and government censorship don't ruin it all.
261
1 - Damn, hearing the voice of Ai Hoshino once again give me mixed emotions. I miss her so much 😭😭😭 2 - We need a voice model for Dmitry Yazov IMMEDIATELY
229
11:46 for anyone wondering the qr code is unscannable
205
Your videos are really well done, the descriptions and explanations are top tier. This is a game changer for me since I'm a DM at a D&D game that we play on Discord and this will help me make the sessions even more entertaining and will help the players feel like they're interacting with unique characters instead of just me acting different voices. I may also mess around a bit while gaming online. Just a bit.😝
193
I wonder if there is anything better I can do in my life than re-watching all of your videos (I am going crazy)
171
"call your tinder match on facetime to check if she really looks like her pics" this tech:
169
I respect the actual results you've shown and the moral of the whole video at the end. Great video!
161
Pokémon was just the beginning. Stay tuned for more! 🤖
138
This is genuinely such an amazingly well put together tutorial video, no side-tracks, no random edits or cuts that confuses a complete newcomer as well as also including disclaimers and showing some potential issues with the program, this earns a sub!
135
5:00 Short version: The "all or none" principle oversimplifies; both human and artificial neurons modulate signal strength beyond mere presence or absence, akin to adjusting "knobs" for nuanced communication. Longer version: The notion that neurotransmitters operate in a binary fashion oversimplifies the rich, nuanced communication within human neural networks, much like reducing the complexity of artificial neural networks (ANNs) to mere binary signals. In reality, the firing of a human neuron—while binary in the sense of action potential—carries a complexity modulated by neurotransmitter types and concentrations, similar to how ANNs adjust signal strength through weights, biases, and activation functions. This modulation allows for a spectrum of signal strengths, challenging the strict "all or none" interpretation. In both biological and artificial systems, "all" signifies the presence of a modulated signal, not a simple binary output, illustrating a nuanced parallel in how both types of networks communicate and process information.
132
The only youtube channel which dosent adds weird confusing coding things. The way you explain is fabulous. I have never seen someone on YouTube explaining hard things like this simple
129
The potential for this technology is staggering, but also the potential for misuse is even moreso.
128
This is honestly amazing, the ai is spot on and works perfectly for me. your tutorial is also perfect as well and was very easy to understand. i would love to see a video about how to get this to work on discord since i have had trouble doing the same thing with another voice changer before.
125
As the saying goes: if they show the public now, then they already had the technology years ago. 👍
120
THE AI OVERLORD HAS CAME BACK AGAIN WITH A GREAT TUTORIAL VIDEO!
117
I am waiting for the day i can make a movie with ai. I have so many ideas
116
Thanks! , I would be glad to have the manus invite.Tomorrow is my birthday and it can be the best present.😂 In any case, I am grateful for the wonderful reviews
111
Thanks dude!! You're SOOO much more different that other creators who just recommend tools that aren't free but claim to be in their channel. You just gained a sub 🎉
110
This is wild. What a time to be alive
106
those who want to know, the virtual character, Tsukuyomi - chan, the l0li character with white hair he used in beginning for demonstration. She's just a character for free material that can be used for commercial purposes, such as for this voice changer.
100
All these companies aren't gonna release their research tech until someone builds an open-source equivalent. They want someone else to open Pandora's box and absorb the legal liabilities first.
91
"Nobody cared who i was until i put the voice changer on." - Bane Chan
90
I know using Spongebob in the beginning was the demonstrate just how crazy you can go with it, but that took me out bro lmaoooo
89
Not often you find a perfect tutorial - hack - descriptive - helpful video combo, but here we have one TYSM.
88

Previous
1
Next
...
All