Create

Genshin Impact Voice Models + (Added Zhongli 80 Min ripped dataset)

RVCAnimeArtistEnglish

bigsoulja

2 years ago

👀

👍

🪄

111

Description

Genshin Voice Models (RVC) - Gorou, Xiao, Lisa, Childe (Tartaglia), Zhongli English VA's. Feel free to request for others. - 500 epochs, locally trained - Trained with ripped voice lines from the game with CREPE at 32 hops (Mangio RVC Beta). - All samples normalized and denoised professionally with Waves/iZotope plugins using actual noise samples before training. - Trained with pitch, should work well for both spoken word and songs. - Would recommend inferring with CREPE, experiment with hop length and feature retrieval. 128-192 with 0.4-0.8 usually sounds good. - Examples attached were just recorded with a mediocre headset mic, no mixing at all. I also have a low pitch voice and sound nothing like these characters. Inferred on someone with a closer voice to the target model + better mic would give much better results. Please credit me if used publicly anywhere such as social media etc. Discord: BigSoulja#8888 TikTok: @souljavr YouTube: You can credit any one of the above handles, does not need to be all of them. Xiao (500 Epoch RVC V1): Childe (500 Epoch 32 Hop Length RVC V1): Gorou (500 Epoch 32 Hop Length RVC V1): Lisa (500 Epoch 32 Hop Length RVC V1): Zhongli (750 Epoch 16 Hop Length RVC V2, Huge 80 Min Ripped Dataset): Keqing (300 Epoch 16 Hop Length RVC V2): Will be adding Genshin AI models to my huggingface repository too:

Comments

bigsoulja

2 years ago

https://youtu.be/HzJ7J41YZzA

bigsoulja

2 years ago

Roddy Ricch 'Ballin' using these voices

bigsoulja

2 years ago

I'll update the links today

bigsoulja

2 years ago

Added Zhongli (Keith Silverstein) trained on a huge 80 min dataset

bigsoulja

2 years ago

Genshin Impact Voice Models (Added Zhongli 80 Min ripped dataset)

bigsoulja

2 years ago

Genshin Impact Voice Models V1 + V2 (Added Zhongli 80 Min ripped dataset)

bigsoulja

2 years ago

zhongli

bigsoulja

2 years ago

Zhongli singing with this model: https://vm.tiktok.com/ZGJxRsaGC/

bigsoulja

2 years ago

Added Keqing, 14 min dataset - seems to overtrain above 300 epoch. Will also be posting my models on my huggingface repository. https://huggingface.co/BigSoulja/Genshin-EN-AI-Voices/resolve/main/Keqing-CREPE-300epoch-By-BigSoulja%238888.zip

xunnylee

2 years ago

what song is this? its really good!

bigsoulja

2 years ago

Laufey - From The Start, thank you

bigsoulja

2 years ago

yes, usually leads to better pitch shifting even if inferred at a higher length after. I have also noticed syllable pronounciation is more accurate with lower hops. Below 16 doesn't seem to make any difference.

bigsoulja

2 years ago

will make training time/memory needed for feature extraction much higher at lower hops though

bigsoulja

2 years ago

in my opinion yes, but feel free to experiment. you are welcome.

bigsoulja

2 years ago

lower doesnt seem to cause any downsides. you just eventually get no benefit. so might as well go as low as you can lol.