Skip to content

Conversation

@KakaruHayate
Copy link

9c10dc0b6ad09559fb3418c1a09aec1d

@yxlllc yxlllc merged commit ffc686b into openvpi:lynxnet Dec 13, 2024
yxlllc added a commit that referenced this pull request Jan 16, 2025
* Change the injection method of conditions on lynxnet (#225)

* update configurations for new-lynxnet

* update configurations for new-lynxnet

* update configurations for new-lynxnet

---------

Co-authored-by: KakaruHayate <97896816+KakaruHayate@users.noreply.github.com>
yxlllc added a commit that referenced this pull request Jan 16, 2025
* Change the injection method of conditions on lynxnet (#225)

* update configurations for new-lynxnet

* update configurations for new-lynxnet

* update configurations for new-lynxnet

---------

Co-authored-by: KakaruHayate <97896816+KakaruHayate@users.noreply.github.com>
@KakaruHayate KakaruHayate deleted the lynxnet-s2 branch January 17, 2025 09:03
yqzhishen added a commit that referenced this pull request Mar 29, 2025
* Add multi-dictionary preprocessing and training

* Fix lang_map.json copy

* Add language embed (inject to txt_embed) for acoustic models

* Save language sequence in variance preprocessing

* Display merged phoneme groups properly in distribution plots

* Add multi-dictionary inference

* Save original phoneme texts for duration plots

* Fix duration plots displaying bug

* Explicit `languages` argument passing

* Add language embed (inject to txt_embed) for variance models

* Fix argument passing

* Add log for lang_map.json copy

* Add language embedding scale

* Add language embedding type

* Preprocessing: only apply lang embed on cross-lingual phonemes

* Inference: only apply lang embed on cross-lingual phonemes

* Revert "Add language embedding type"

This reverts commit 655e9ba.

* Revert lang_embed_scale

* Adapt ONNX exporters for multi-language models

* Refactor configuration schemas for datasets

* Add check of existence for merged phonemes

* Fix spk_id assignment

* Fix languages.json filename

* Fix `languages` key in dsconfig.yaml

* Set `use_lang_id` to false if there are no cross-lingual phonemes

* Support defining extra phonemes

* Refactor configs

* Prefer file copies in work_dir when loading dictionaries

* Fix cannot locate dictionary

* Fix unexpected loading error when dictionary changes

* Update toplevel.py (#219)

* Fix unexpected config passing

* Update lynxnet backbone (#228)

* Change the injection method of conditions on lynxnet (#225)

* update configurations for new-lynxnet

* update configurations for new-lynxnet

* update configurations for new-lynxnet

---------

Co-authored-by: KakaruHayate <97896816+KakaruHayate@users.noreply.github.com>

* Improve fastspeech2 encoder using Rotary Position Embedding (RoPE) in multi-head self-attention (#234)

* update multi-head self attention with RoPE

* RoPE onnx (#230)

* fix requirements.txt (#233)

* fix rope for melody encoder

* support swiglu activation for ffn

* update dependencies

---------

Co-authored-by: KakaruHayate <97896816+KakaruHayate@users.noreply.github.com>

* support mini-nsf-hifigan vocoder

* discard negative pad

* fix MHA inference using low torch version

* Fix missing phoneme list sorting

* Fix single-language dictionary parsing language tag

* Add `pitch_controllable` flag to vocoder exporter

(cherry picked from commit a6deb6b)

* support noise injection

* Allow merging global phonemes and language-specific phonemes

* Check for conflicts between short names and global tags

* Finish documentation for multi-dictionary

---------

Co-authored-by: Anjo <87346264+AnAndroNerd@users.noreply.github.com>
Co-authored-by: yxlllc <33565655+yxlllc@users.noreply.github.com>
Co-authored-by: KakaruHayate <97896816+KakaruHayate@users.noreply.github.com>
Co-authored-by: yxlllc <llc1995@sina.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants