Genshin Impact Voice Models + (Added Zhongli 80 Min ripped dataset)

Create

Genshin Impact Voice Models + (Added Zhongli 80 Min ripped dataset)

RVCAnimeArtistEnglish
bigsoulja user image
bigsoulja
2 years ago
👀

2k

👍

10

🪄

111

Description

Genshin Voice Models (RVC) - Gorou, Xiao, Lisa, Childe (Tartaglia), Zhongli English VA's. Feel free to request for others. - 500 epochs, locally trained - Trained with ripped voice lines from the game with CREPE at 32 hops (Mangio RVC Beta). - All samples normalized and denoised professionally with Waves/iZotope plugins using actual noise samples before training. - Trained with pitch, should work well for both spoken word and songs. - Would recommend inferring with CREPE, experiment with hop length and feature retrieval. 128-192 with 0.4-0.8 usually sounds good. - Examples attached were just recorded with a mediocre headset mic, no mixing at all. I also have a low pitch voice and sound nothing like these characters. Inferred on someone with a closer voice to the target model + better mic would give much better results. Please credit me if used publicly anywhere such as social media etc. Discord: BigSoulja#8888 TikTok: @souljavr YouTube: You can credit any one of the above handles, does not need to be all of them. Xiao (500 Epoch RVC V1): Childe (500 Epoch 32 Hop Length RVC V1): Gorou (500 Epoch 32 Hop Length RVC V1): Lisa (500 Epoch 32 Hop Length RVC V1): Zhongli (750 Epoch 16 Hop Length RVC V2, Huge 80 Min Ripped Dataset): Keqing (300 Epoch 16 Hop Length RVC V2): Will be adding Genshin AI models to my huggingface repository too:

Comments

bigsoulja user image
bigsoulja
2 years ago
bigsoulja user image
bigsoulja
2 years ago

Roddy Ricch 'Ballin' using these voices

bigsoulja user image
bigsoulja
2 years ago

I'll update the links today

bigsoulja user image
bigsoulja
2 years ago

Added Zhongli (Keith Silverstein) trained on a huge 80 min dataset

bigsoulja user image
bigsoulja
2 years ago

Genshin Impact Voice Models (Added Zhongli 80 Min ripped dataset)

bigsoulja user image
bigsoulja
2 years ago

Genshin Impact Voice Models V1 + V2 (Added Zhongli 80 Min ripped dataset)

bigsoulja user image
bigsoulja
2 years ago

zhongli

bigsoulja user image
bigsoulja
2 years ago

Zhongli singing with this model: https://vm.tiktok.com/ZGJxRsaGC/

bigsoulja user image
bigsoulja
2 years ago

Added Keqing, 14 min dataset - seems to overtrain above 300 epoch. Will also be posting my models on my huggingface repository. https://huggingface.co/BigSoulja/Genshin-EN-AI-Voices/resolve/main/Keqing-CREPE-300epoch-By-BigSoulja%238888.zip

xunnylee user image
xunnylee
2 years ago

what song is this? its really good!

bigsoulja user image
bigsoulja
2 years ago

Laufey - From The Start, thank you

bigsoulja user image
bigsoulja
2 years ago

yes, usually leads to better pitch shifting even if inferred at a higher length after. I have also noticed syllable pronounciation is more accurate with lower hops. Below 16 doesn't seem to make any difference.

bigsoulja user image
bigsoulja
2 years ago

will make training time/memory needed for feature extraction much higher at lower hops though

bigsoulja user image
bigsoulja
2 years ago

in my opinion yes, but feel free to experiment. you are welcome.

bigsoulja user image
bigsoulja
2 years ago

lower doesnt seem to cause any downsides. you just eventually get no benefit. so might as well go as low as you can lol.

NyctoDarkMatter user image
NyctoDarkMatter
1 year ago

Can't download, asks for a username and password.

KavehUke user image
KavehUke
8 months ago

I don't understand how to use this ?

Master Gamer user image
Master Gamer
6 months ago

to download the voice, there is three dots upper right to the images

Add a comment

Samples

New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English

Pitch

Selected Audio
Selected Audio