Journal article

Improving generative modelling in VAEs using Multimodal Prior

Abstract:: In this paper we propose a conditional generative modelling (CGM) approach for unsupervised disentangled representation learning using variational autoencoder (VAE). CGM employs a multimodal/categorical conditional prior distribution in the latent space to learn global uncertainty in data by modelling the variations at local level. Thus, the proposed framework enforces the model to independently estimate the inherent patterns within each category, which improves the interpretability of the latent representations learned by the VAE model. The evidence lower bound objective for training the generative model is maximized using a mutual information criterion between the global latent categorical variable and the encoded inputs. Further, the approach has a built-in mechanism for bounding the information flow between the encoder and the decoder which addresses the problems of posterior collapse in conventional VAE models. Experiments on a variety of datasets demonstrate that our objective can learn disentangled representations and the proposed approach achieves competitive results on various task such as generative modelling, image classification and image denoising.

Publication status:: Accepted

Peer review status:: Peer reviewed

Actions

Email

Email this record

Send the bibliographic details of this record to your email address.

Your Email
Please enter the email address that the record information will be sent to.

-
Your message (optional)
Please add any additional information to be included within the email.
Cite

Cite this record

APA Style

Abrol, V., Sharma, P., & Patra, A. (2020). Improving generative modelling in VAEs using Multimodal Prior. IEEE Transactions on Multimedia, 23, 2153–2161.

MLA Style

Abrol, V., et al. “Improving Generative Modelling in VAEs Using Multimodal Prior.” IEEE Transactions on Multimedia, vol. 23, Institute of Electrical and Electronics Engineers, 2020, pp. 2153–61.

Chicago Style

Abrol, V, P Sharma, and A Patra. 2020. “Improving Generative Modelling in VAEs Using Multimodal Prior.” IEEE Transactions on Multimedia 23: 2153–61.
Share
Print

Access Document

Files:: AbroletalAAM2020.pdf

(Preview, Accepted manuscript, 1.8MB, Terms of use)

Publisher copy:: 10.1109/TMM.2020.3008053

Authors

+ Abrol, V More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Mathematical Institute
Sub department:: Mathematical Institute
Role:: Author
ORCID:: 0000-0001-8149-8151

+ Sharma, P More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Sub department:: Engineering Science
Role:: Author

+ Patra, A More by this author

Institution:: University of Oxford
Division:: MPLS
Department:: Engineering Science
Sub department:: Engineering Science
Role:: Author

Publisher:: Institute of Electrical and Electronics Engineers
Journal:: IEEE Transactions on Multimedia More from this journal
Volume:: 23
Pages:: 2153 - 2161
Publication date:: 2020-07-08
Acceptance date:: 2020-06-23
DOI:: 10.1109/TMM.2020.3008053
EISSN:: 1941-0077
ISSN:: 1520-9210

Language:: English
Keywords:: FFR
Pubs id:: 1116905
Local pid:: pubs:1116905
Deposit date:: 2020-07-17
ARK identifier:: ark:/29072/ora_d4a7306e6d5c4e8495254363723328f8

Terms of use

Copyright holder:: IEEE
Copyright date:: 2020
Rights statement:: © IEEE 2020
Notes:: This is the accepted manuscript version of the article. The final version is available online from IEEE at: https://doi.org/10.1109/TMM.2020.3008053

Licence:: Terms and Conditions of Use for Oxford University Research Archive

Views and Downloads

About views and downloads

If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP