Morshu (From Link: The Faces of Evil)

Create

Morshu (From Link: The Faces of Evil)

RVC v2
analogspiderweb user image
analogspiderweb
2 years ago
👀

1.3k

👍

16

🪄

357

Description

Dataset was 1:41 (actually 14 seconds, but repeated several times).

Comments

makiligon user image
makiligon
2 years ago

okay

Autumn user image
Autumn
2 years ago

14 seconds dataset 😭

Autumn user image
Autumn
2 years ago

we need to make it sing the covers that were made with sentence mixing

Autumn user image
Autumn
2 years ago

im gonna do ai morshu scatman now

Autumn user image
Autumn
2 years ago

exactly

Autumn user image
Autumn
2 years ago

instead of sentence mixing it will be ai

Autumn user image
Autumn
2 years ago

if AI was this good in 2010 internet would have gone crazy

Autumn user image
Autumn
2 years ago

he cut EVERY SINGLE LETTER

Autumn user image
Autumn
2 years ago

since he has like 5 lines

makiligon user image
makiligon
2 years ago

wonder how it would sound like if you train it on sentence mixed voicelines and overfit it as much as possible

Autumn user image
Autumn
2 years ago

🤔 should i

makiligon user image
makiligon
2 years ago

if you know how to sentence mix an entire dataset (or take existing ones like the ltg one i made above lol) then sure

Autumn user image
Autumn
2 years ago

i know how to do it by muscle memory, that's what i always did in 2015

makiligon user image
makiligon
2 years ago

based

Autumn user image
Autumn
2 years ago

seems like this model was made on an old version of rvc

Autumn user image
Autumn
2 years ago

i hate this

Autumn user image
Autumn
2 years ago

ill retrain it on crepe

Autumn user image
Autumn
2 years ago

made dataset with a new technique

Autumn user image
Autumn
2 years ago

i dont know if this will make it better or worse

Autumn user image
Autumn
2 years ago

best loss rate ive ever seen in my entire life

analogspiderweb user image
analogspiderweb
2 years ago

And I trained it on crepe too

Autumn user image
Autumn
2 years ago

weirdly enough it doesn't work on the latest version of mangio's rvc fork

Autumn user image
Autumn
2 years ago

throws errors

analogspiderweb user image
analogspiderweb
2 years ago

Huh, that's odd

Autumn user image
Autumn
2 years ago

plus pretty weird since it has a total_fea file

analogspiderweb user image
analogspiderweb
2 years ago

That fork is used in the notebook

Autumn user image
Autumn
2 years ago

which got deprecated almost 3 weeks ago

Autumn user image
Autumn
2 years ago

ill post this V3, maybe will make an eventual V4 if it turns out good with sentence mixed vocals

Autumn user image
Autumn
2 years ago

i tried a new method for small datasets so let's see how it turns out, training to 300 epochs

analogspiderweb user image
analogspiderweb
2 years ago

I put V2 in the name because I trained it using the V2 notebook

analogspiderweb user image
analogspiderweb
2 years ago

So technically this is V1

Autumn user image
Autumn
2 years ago

rename it to v1 lol i thought this was a v2 of the model

Autumn user image
Autumn
2 years ago

Morshu (Link: The Faces of Evil) V1 RVC (300 Epochs)

analogspiderweb user image
analogspiderweb
2 years ago

I don't want to mislead anyone into thinking this was trained with RVC V1

Autumn user image
Autumn
2 years ago

rvc v1 doesn't exist

analogspiderweb user image
analogspiderweb
2 years ago

There should be an RVC V2 tag

Autumn user image
Autumn
2 years ago

what

Autumn user image
Autumn
2 years ago

there is no rvc v2?

Autumn user image
Autumn
2 years ago

i dont understand lol

Autumn user image
Autumn
2 years ago

you mean secondary base models?

analogspiderweb user image
analogspiderweb
2 years ago

Yes, I think it trains a different way as well

Autumn user image
Autumn
2 years ago

only for smaller datasets with smaller epoch count

Autumn user image
Autumn
2 years ago

it trains "faster"

Autumn user image
Autumn
2 years ago

like a v1 500 epochs is the same as v2 300 epochs

Autumn user image
Autumn
2 years ago

it needs less epochs (?)

Autumn user image
Autumn
2 years ago

im training with v1

Autumn user image
Autumn
2 years ago

just rename it to Morshu (Link: The Faces of Evil) RVC-2 (300 Epochs)

Autumn user image
Autumn
2 years ago

cause v2 is usually used for models

analogspiderweb user image
analogspiderweb
2 years ago

Morshu (Link: The Faces of Evil) RVC-2 (300 Epochs)

Autumn user image
Autumn
2 years ago

also the audio was pretty low quality so i used 32k sample target

analogspiderweb user image
analogspiderweb
2 years ago

If you want to make a better quality Morshu dataset, I recommend Elevenlabs

Autumn user image
Autumn
2 years ago

i will never touch elevenlabs, i hate it as a whole

analogspiderweb user image
analogspiderweb
2 years ago

Based

analogspiderweb user image
analogspiderweb
2 years ago

I think someone else did it before but I can't be bothered to find it

Autumn user image
Autumn
2 years ago

alright it's at 250 epochs, 50 left to go

Autumn user image
Autumn
2 years ago

mhm

makiligon user image
makiligon
2 years ago

Generate all phonemes and use that for sentence mixing

Autumn user image
Autumn
2 years ago

yes

Autumn user image
Autumn
2 years ago

high iq

Autumn user image
Autumn
2 years ago

it sounds like sentence mixing 😭

Autumn user image
Autumn
2 years ago

even tho it's ai

analogspiderweb user image
analogspiderweb
2 years ago

Morshu (From Link: The Faces of Evil) (RVC v2) 300 Epoch

Autumn user image
Autumn
2 years ago

Right

Autumn user image
Autumn
2 years ago

Could try

Add a comment

Samples

New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English

Pitch

More to explore

Loading more

Selected Audio
Selected Audio