
Create
Genshin Impact Voice Models + (Added Zhongli 80 Min ripped dataset)

2k
10
111
Description
Genshin Voice Models (RVC) - Gorou, Xiao, Lisa, Childe (Tartaglia), Zhongli English VA's. Feel free to request for others. - 500 epochs, locally trained - Trained with ripped voice lines from the game with CREPE at 32 hops (Mangio RVC Beta). - All samples normalized and denoised professionally with Waves/iZotope plugins using actual noise samples before training. - Trained with pitch, should work well for both spoken word and songs. - Would recommend inferring with CREPE, experiment with hop length and feature retrieval. 128-192 with 0.4-0.8 usually sounds good. - Examples attached were just recorded with a mediocre headset mic, no mixing at all. I also have a low pitch voice and sound nothing like these characters. Inferred on someone with a closer voice to the target model + better mic would give much better results. Please credit me if used publicly anywhere such as social media etc. Discord: BigSoulja#8888 TikTok: @souljavr YouTube: You can credit any one of the above handles, does not need to be all of them. Xiao (500 Epoch RVC V1): Childe (500 Epoch 32 Hop Length RVC V1): Gorou (500 Epoch 32 Hop Length RVC V1): Lisa (500 Epoch 32 Hop Length RVC V1): Zhongli (750 Epoch 16 Hop Length RVC V2, Huge 80 Min Ripped Dataset): Keqing (300 Epoch 16 Hop Length RVC V2): Will be adding Genshin AI models to my huggingface repository too:
Comments


Roddy Ricch 'Ballin' using these voices

I'll update the links today

Added Zhongli (Keith Silverstein) trained on a huge 80 min dataset

Genshin Impact Voice Models (Added Zhongli 80 Min ripped dataset)

Genshin Impact Voice Models V1 + V2 (Added Zhongli 80 Min ripped dataset)

zhongli

Zhongli singing with this model: https://vm.tiktok.com/ZGJxRsaGC/

Added Keqing, 14 min dataset - seems to overtrain above 300 epoch. Will also be posting my models on my huggingface repository. https://huggingface.co/BigSoulja/Genshin-EN-AI-Voices/resolve/main/Keqing-CREPE-300epoch-By-BigSoulja%238888.zip

what song is this? its really good!

Laufey - From The Start, thank you

yes, usually leads to better pitch shifting even if inferred at a higher length after. I have also noticed syllable pronounciation is more accurate with lower hops. Below 16 doesn't seem to make any difference.

will make training time/memory needed for feature extraction much higher at lower hops though

in my opinion yes, but feel free to experiment. you are welcome.

lower doesnt seem to cause any downsides. you just eventually get no benefit. so might as well go as low as you can lol.
Can't download, asks for a username and password.

I don't understand how to use this ?
to download the voice, there is three dots upper right to the images
Add a comment
Samples
Pitch
More to explore
Saiba Momoi (Blue Archive)

Ariana Grande AI

JENNIE of BLACKPINK [Strong Ver.]

Saiba Momoi (Blue Archive) (VA: Tokui Sora)

Hatsune Miku
SpongeBob SquarePants (Talking And Singing)
Takanashi Hoshino (from Blue Archive)

Satoru Gojo (JJK) [VA Yuichi Nakamura]

ENHYPEN Heeseung

Sunaokami Shiroko (Blue Archive)

Villager (Minecraft)

Mortis [Brawl stars]
Jungkook (BTS)

Tendou Arisu (Blue Archive)

Kanye West
Loading more