Description
trained on 10 minutes of data from her guerilla stream (melband roformer + rx10 de-click + de-noise(?) ) the dataset has gone through severe de-clicking... pitch extraction: rmvpe steps: 15.5k-15.6k batch size: 7 pretrain: regular v2 huggingface:
Comments

sxndypz
i didn't get a sample yet and i got 2 reactions

CRoyce6448
sample when?

sxndypz
hold on let me get a sample rq

sxndypz
gimme a moment

mrm0dz
💀

sxndypz
i tried it on realtime, and yeah it doesn't work really well

sxndypz
ok *now* the next vtuber voice model is going to be dizzy dokuro

mrm0dz
probably transpose bc her voice is kidna tomboy-ish

sxndypz
i did transpose it tho

litsa_the_dancer
very good work, i like it

sxndypz
the hammy of the wik
Add a comment
Samples
New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English
Pitch