Learning from heterogeneous data poses challenges such as combining data from various sources and of different types. Meanwhile, heterogeneous data are often associated with missingness in real-world applications due to heterogeneity and noise of input sources. In this work, we propose the variational selective autoencoder (VSAE), a general framework to learn representations from partially-observed heterogeneous data. VSAE learns the latent dependencies in heterogeneous data by modeling the joint distribution of observed data, unobserved data, and the imputation mask which represents how the data are missing. It results in a unified model for various downstream tasks including data generation and imputation. Evaluation on both low-dimensional and high-dimensional heterogeneous datasets for these two tasks shows improvement over state-of-the-art models.
Bibtex
@misc{gong2021variational,
title={Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data},
author={Yu Gong and Hossein Hajimirsadeghi and Jiawei He and Thibaut Durand and Greg Mori},
year={2021},
eprint={2102.12679},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
Related Research
-
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation
K. Arora, L. El Asri, H. Bahuleyan, and J. Chi Kit Cheung. Association for Computational Linguistics (ACL)
Publications
-
Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation
Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation
V. Balasubramanian, I. Kobyzev, H. Bahuleyan, and O. Vechtomova. Conference of the European Chapter of the Association for Computational Linguistics (EACL)
Publications
-
Wavelet Flow: Fast Training of High Resolution Normalizing Flows
Wavelet Flow: Fast Training of High Resolution Normalizing Flows
J. Yu, K. Derpanis, and M. Brubaker. Conference on Neural Information Processing Systems (NeurIPS)
Publications