✨ Supervoice GPT

A GPT model that converts from text to phonemes with durations that is suitable to feed into voice synthesizer.

How it works?

This model converts raw text to phonemes and their durations compatible with Montreal Forced Aligner.

This model converts string like "Hey, Vera, what time is it?" to list of tuples of phoneme and it's duration: [('ç', 9), ('iː', 7), ('v', 7), ('ɛ', 8), ('ɹ', 8), ('i', 7), ('w', 6), ('ɐ', 5), ('ʔ', 3), ('tʰ', 8), ('aj', 11), ('m', 7), ('ɪ', 6), ('z', 7), ('ɪ', 6), ('ʔ', 8)]

Dataset

This module require extensive dataset preparation. To prepare all needed data next commands are required to be performed:

datasets sync to download datasets
python ./datasets_prepare.py to preprocess audio files and extract texts from datasets
./datasets_align.sh to generate alignments
python ./datasets_mix.py to mix all data together
python ./train_tokenizer.py to train tokenizer on alignments
python ./datasets_tokenize.py to tokenize datasets

Training

To train network execute:

./train.sh

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
supervoice_gpt		supervoice_gpt
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
datasets.ipynb		datasets.ipynb
datasets.yaml		datasets.yaml
datasets_align.sh		datasets_align.sh
datasets_mix.py		datasets_mix.py
datasets_prepare.py		datasets_prepare.py
eval.ipynb		eval.ipynb
export.py		export.py
hubconf.py		hubconf.py
tokenizer_text.model		tokenizer_text.model
tokenizer_text.vocab		tokenizer_text.vocab
train.py		train.py
train.sh		train.sh
train_tokenizer.py		train_tokenizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

✨ Supervoice GPT

How it works?

Dataset

Training

License

About

Uh oh!

Releases 1

Packages

Languages

ex3ndr/supervoice-gpt

Folders and files

Latest commit

History

Repository files navigation

✨ Supervoice GPT

How it works?

Dataset

Training

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages