VI-SVS
VI-SVS copied to clipboard
Singing Voice Synthesis based on VITS, different from VISinger
Init
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger, It is just VITS without MAS and DurationPredictor.
本项目基于
https://github.com/jaywalnut310/vits
https://github.com/MoonInTheRiver/DiffSinger
https://wenet.org.cn/opencpop/
采样率转换
使用16K节约内存,方便模型修改
数据预处理
cd VISinger/
export PYTHONPATH=.
python prepare/data_vits.py
输出
1,生成文件../VISinger_data/label_vits/XXX._label.npy|XXX_score.npy|XXX_pitch.npy|XXX_slurs.npy
2,生成文件filelists/vits_file.txt; 内容格式:wave path|label path|score path|pitch path|slurs path;
训练集随机打乱
python prepare/preprocess.py
VITS训练
python train.py -c configs/singing_base.json -m singing_base
测试验证
1,训练集生成验证:F0根据音频提取
python vsinging_debug.py
2,推理验证:F0根据规则生成
python vsinging_infer.py
3,完整歌曲合成(使用release模型)
pyton vsinging_song.py
4,F0的问题可以额外训练F0预测器,或者使用UTAU绘制pit曲线

样例音频
vits_singing_样例.wav