Description
aka the great alchemical voice model trained on a same dataset as the ov2 one, but this one is trained on a regular v2 pretrain steps: 15.9k batch size: 7 pretrain: regular v2 / 40k huggingface:
Comments
realpikachuwu
ado

sxndypz
kit if you're going to update the space by adding muyu, just use this version ok??
Leo_Frixi
Sounds better than the OV2 version.
Leo_Frixi
But i think you should add more laughs to the dataset next time or use a bigger dataset.

sxndypz
i thought that the laughs might ruin the model
Leo_Frixi
As you can see, you can hear how the voice breaks when it tries to laugh on a sample audio.
Leo_Frixi
So, the fact that "laughs might ruin the model" isn't right at all.

sxndypz
next is probably nasa

sxndypz
but this time i'll try to train it locally

sxndypz
might update the dataset and retrain it
Add a comment
Samples
New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English
Pitch