CGO - Adventure Time: Distant Lands BMO (Simone Giertz)

Create

CGO - Adventure Time: Distant Lands BMO (Simone Giertz)

EnglishRVC V2Fictional
Freaky98 user image
Freaky98
1 year ago
👀

577

👍

5

🪄

21

Description

Link - A 1-minute dataset. 20 kHz There aren't a lot of voice lines, so I had to squeeze as much clean data as I could, denoise, remove SFX, de-click, and spectral repair.

Comments

litsa_the_dancer user image
litsa_the_dancer
1 year ago

just a note spectral repair isnt that useful, ur better off just not using it. u can also compress ur audio with a ration of 2.0:1 to lower or remove some left over noise and make the dataset much more consistent. Also use phasing and de ess.

Freaky98 user image
Freaky98
1 year ago

I use spectral repair just to remove some tiny imperfections caused by SFX separation (they are not even audible)

litsa_the_dancer user image
litsa_the_dancer
1 year ago

Usually it does more harm then good cuz it introduces more inconsistent frequencies. GANs don't like that. You can test it out yourself. To compare u should look at the gradients and see how they'll adjust more smoothly compared to the repaired dataset. Just know that just bc u can't hear said repaired frequencies that doesn't mean that the GAN can't be confused by it

litsa_the_dancer user image
litsa_the_dancer
1 year ago

Also if u ripped from yt it's advisable to train with 32k rather than 40k :3

Freaky98 user image
Freaky98
1 year ago

I meant that even before repairing, it was at an inaudible level and outside of the voice frequency range. No, it was ripped from HMAX.

litsa_the_dancer user image
litsa_the_dancer
1 year ago

still u may not hear it but the gan sure can see it thru the mel spectrogram. Regardless mind showing me a close up of the spectrogram on rx?

litsa_the_dancer user image
litsa_the_dancer
1 year ago

oh yeah thats rough

litsa_the_dancer user image
litsa_the_dancer
1 year ago

u should def train with 32k

Freaky98 user image
Freaky98
1 year ago

What do you mean by rough?

litsa_the_dancer user image
litsa_the_dancer
1 year ago

the cut off is much lower than 40k

Add a comment

Samples

New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English

Pitch

More to explore

Loading more

Selected Audio
Selected Audio