This repo contains FineMotions's solution to the GENEA 2022 Challange
Our submission as well as some renders of our experiments could be found here
The whole repo contains code of the models along with various scripts:
process_motion.py- extracts motion features from bvh-data.pymorequired.process_audio.py- extracts features from audioprocess_text.py- generetes one-hot encoddings for symbols of text transcriptsnormalize.py- normalize motion features and store mean and max poses to npz filealign_data.py- aligns motion, audio and text features to create dataset for modelstrain.py- train one of the model by it's name:wav2gest,recell,recellseq,feedforward,seq2seq,lstminfer.py- inference model, smooth, denormalize results and generate bvh
module src contains code of the models and some utils:
basecontains baseDataModuleto operate with various datafeedforward- simple model generates motion by frame from window of features. Based on Kucherenko et al. 2019lstm- predicts sequence of poses from aligned sequence of features via simple rnnrecell- auto-regressive ReCell model, contains two systems and datasets for one-frame generation and for short sequences which allows teacher-forcing and zeroing techniquesseq2seq- sequence-to-sequence model from https://github.com/FineMotion/GENEA_2020seqae- unfinished autoencoder for sequences, has not been usedwav2gest- modification ofseq2seqallowing different lengths for input and output sequencesauto-encoder- windowed auto-encoder