VI-SVS icon indicating copy to clipboard operation
VI-SVS copied to clipboard

Singing Voice Synthesis based on VITS, different from VISinger

Init

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger, It is just VITS without MAS and DurationPredictor.

本项目基于

https://github.com/jaywalnut310/vits

https://github.com/MoonInTheRiver/DiffSinger

https://wenet.org.cn/opencpop/

采样率转换

使用16K节约内存,方便模型修改

数据预处理

cd VISinger/

export PYTHONPATH=.

python prepare/data_vits.py

输出

1,生成文件../VISinger_data/label_vits/XXX._label.npy|XXX_score.npy|XXX_pitch.npy|XXX_slurs.npy

2,生成文件filelists/vits_file.txt; 内容格式:wave path|label path|score path|pitch path|slurs path;

训练集随机打乱

python prepare/preprocess.py

VITS训练

python train.py -c configs/singing_base.json -m singing_base

测试验证

1,训练集生成验证:F0根据音频提取

python vsinging_debug.py

2,推理验证:F0根据规则生成

python vsinging_infer.py

3,完整歌曲合成(使用release模型

pyton vsinging_song.py

4,F0的问题可以额外训练F0预测器,或者使用UTAU绘制pit曲线

LOSS值 MEL谱

样例音频

vits_singing_样例.wav