Blaze (GPT-SoVITS)

Blaze (GPT-SoVITS)

⚠️
EnglishGPT Sovits
Kimid user image
Kimid
1 year ago
👀

65

👍

5

Description

For: @Cyanicide Dataset by: @Cyanicide Model

Comments

Kimid user image
Kimid
1 year ago
Kimid user image
Kimid
1 year ago

Transcript: My flame can penetrate anything! Let's go to the store. You have to use my power to stop it before it's fired, or use your power to grab it.

Kimid user image
Kimid
1 year ago

Blaze (Sonic 06) (GPT-SoVITS) (20 Epochs)

Th4t-Ai-Guy user image
Th4t-Ai-Guy
1 year ago

Bro @Cyanicide. You gotta stop beggin kimid for models everyday homie.

Surble Chev user image
Surble Chev
1 year ago

This is honestly so friggin good. Is there some tutorial I can follow to make a model like this? I've never tried GPT-SoVITS

Kimid user image
Kimid
1 year ago

Its pretty easy once you get the hang of it

Kimid user image
Kimid
1 year ago
Surble Chev user image
Surble Chev
1 year ago

Sweet So I'm seeing one for training & the other for inference. Which one is for what? & is there some instruction thing I could follow, like a video?

Kimid user image
Kimid
1 year ago

Training one is just for training and the inference is the one you have to use for inference

Surble Chev user image
Surble Chev
1 year ago

"Inference" as in?

Kimid user image
Kimid
1 year ago

You make a folder in the colab folder thing on the training one and upload your audio and copy the path and paste it into the thing

Kimid user image
Kimid
1 year ago

Using the model

Surble Chev user image
Surble Chev
1 year ago

I see; thanks for that

Kimid user image
Kimid
1 year ago

If you want to use this model you just copy the huggingface link and put it in the think where it says to paste the link into that

Kimid user image
Kimid
1 year ago

Assuming its already installed

Kimid user image
Kimid
1 year ago

And then you have to download the reference audio and put it into the files in colab and copy the path and transcript and put those into their respective boxes

Surble Chev user image
Surble Chev
1 year ago

So is there a way to use the model off of colab (maybe on a mac)? I know there's usually a 2-3 hr time limit on those links, for GPU usage

Kimid user image
Kimid
1 year ago

I think so but you would have to use docker

Surble Chev user image
Surble Chev
1 year ago

docker?

Kimid user image
Kimid
1 year ago

Yeah whatever that is

Kimid user image
Kimid
1 year ago

It just needs some commands I think

Kimid user image
Kimid
1 year ago

Or you could try running the .sh file in the GPT SoVITS folder

Surble Chev user image
Surble Chev
1 year ago

Where would I get the reference audio, like for this model?

Kimid user image
Kimid
1 year ago

Here

Surble Chev user image
Surble Chev
1 year ago

It's just 9 seconds? Don't models usually need like 10 min worth to sound alright?

Kimid user image
Kimid
1 year ago

Well its a TTS model thats been fine tuned so all it needs is that

Surble Chev user image
Surble Chev
1 year ago

Cool And would I have to change around any of the pre-set numbers on the colab links? (GPT settings, etc)

Kimid user image
Kimid
1 year ago

Not really it usually works with the normal settings for temperature and other ones

Surble Chev user image
Surble Chev
1 year ago

Ok, so does it matter which folder the reference audio gets uploaded onto? Or just stick it on a folder & copy-paste the path?

Kimid user image
Kimid
1 year ago

You can just upload it into the files but you dont need to put the reference in any folder just copy the path of the audio file and paste it

Surble Chev user image
Surble Chev
1 year ago

Understood Also, I'd prefer using flac files, but does training/reference audio/whatever other stuff require mp3 format?

Kimid user image
Kimid
1 year ago

It can be MP3 or WAV or FLAC

Add a comment

More to explore

Loading more

Selected Audio
Selected Audio