Description
initially i was going for 475e but then it sounded awful and the 350e version sounded a bit more stable so i went with that. it's also a test to see how the model trained on klm pretrain would handle lossy audio as dataset this character's featured in blue archive. trained on 11 minutes of audio data from both jp and global versions since both of them have different lines. (eq + spectral de-noise) CV. Tanezaki Atsumi pitch extraction: mangio-crepe / (applio) crepe - 64 hops steps: 14k batch size: 7 pretrain: klm 4 hfg / 40k hf
Comments
No comments yet. Start the conversation!
Add a comment
Samples
New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English
Pitch
More to explore
Saiba Momoi (Blue Archive)

LordDavis778
451k
Ariana Grande AI

moonlight_18
127k
JENNIE of BLACKPINK [Strong Ver.]

leelo li
83k
Saiba Momoi (Blue Archive) (VA: Tokui Sora)

ryzusaku
76k
Hatsune Miku
293k
SpongeBob SquarePants (Talking And Singing)
Adrian Ramsey
188k
Takanashi Hoshino (from Blue Archive)

LordDavis778
149k
Satoru Gojo (JJK) [VA Yuichi Nakamura]

_phant0m
116k
ENHYPEN Heeseung

moonlight_18
100k
Sunaokami Shiroko (Blue Archive)

LordDavis778
82k
Villager (Minecraft)

AI Bakery
251k
Mortis [Brawl stars]
fergg209
155k
Jungkook (BTS)

mentosandrice
142k
Tendou Arisu (Blue Archive)

LordDavis778
93k
Kanye West
80k
Loading more