Description
don't mind the image tho lmao trained on 16 minutes of her speaking (and maybe singing?) from her rabbit hole recording stream (public, but vod members only) (rx11 denoise + de-click + de-plosive) might sound gnarly on her high range bc it has this 3 second clip thing on a dataset that makes high notes sound gnarly in a model, will possibly retrain it when i'll remove it. pitch extraction: rmvpe steps: 15.9k batch size: 7 pretrain: klm 4.1 / 32k huggingface:
Comments
No comments yet. Start the conversation!
Add a comment
Samples
New
Classic
1. Singing
Male
English
2. Singing
Female
English
3. Singing (Dry)
Female
English
4. Singing (High)
Female
English
5. Singing 2
Male
English
6. Singing (Dry)
Male
English
7. Singing (Dry, High)
Male
English
Pitch