Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data

Learning from heterogeneous data poses challenges such as combining data from various sources and of different types. Meanwhile, heterogeneous data are often associated with missingness in real-world applications due to heterogeneity and noise of input sources. In this work, we propose the variational selective autoencoder (VSAE), a general framework to learn representations from partially-observed heterogeneous data. VSAE learns the latent dependencies in heterogeneous data by modeling the joint distribution of observed data, unobserved data, and the imputation mask which represents how the data are missing. It results in a unified model for various downstream tasks including data generation and imputation. Evaluation on both low-dimensional and high-dimensional heterogeneous datasets for these two tasks shows improvement over state-of-the-art models.

Bibtex

@misc{gong2021variational,
title={Variational Selective Autoencoder: Learning from Partially-Observed Heterogeneous Data},
author={Yu Gong and Hossein Hajimirsadeghi and Jiawei He and Thibaut Durand and Greg Mori},
year={2021},
eprint={2102.12679},
archivePrefix={arXiv},
primaryClass={cs.LG}
}

Related Research

Mastering Language Models: A Comprehensive Collection of Tutorials

Mastering Language Models: A Comprehensive Collection of Tutorials

S. Prince.

Generative Models

Research
Training and fine-tuning large language models

Training and fine-tuning large language models

S. Prince.

Generative Models

Research
ACL 2023 Recommended Reading List

ACL 2023 Recommended Reading List

P. Forsyth, K. Tang, and W. Zi.

Causality; Generative Models; Natural Language Processing

Research

Cookies Settings

Bibtex

Related Research