Heterogeneous Multi-task Learning with Expert Diversity

Predicting multiple heterogeneous biological and medical targets is a challenge for traditional deep learning models. In contrast to single-task learning, in which a separate model is trained for each target, multi-task learning (MTL) optimizes a single model to predict multiple related targets simultaneously. To address this challenge, we propose the Multi-gate Mixture-of-Experts with Exclusivity (MMoEEx). Our work aims to tackle the heterogeneous MTL setting, in which the same model optimizes multiple tasks with different characteristics. Such a scenario can overwhelm current MTL approaches due to the challenges in balancing shared and task-specific representations and the need to optimize tasks with competing optimization paths. Our method makes two key contributions: first, we introduce an approach to induce more diversity among experts, thus creating representations more suitable for highly imbalanced and heterogenous MTL learning; second, we adopt a two-step optimization [6, 11] approach to balancing the tasks at the gradient level. We validate our method on three MTL benchmark datasets, including Medical Information Mart for Intensive Care (MIMIC-III) and PubChem BioAssay (PCBA).

Bibtex

@article{DBLP:journals/corr/abs-2106-10595,
author  = {Raquel Aoki and
        Frederick Tung and
        Gabriel L. Oliveira},
title   = {Heterogeneous Multi-task Learning with Expert Diversity},
journal = {CoRR},
volume  = {abs/2106.10595},
year   = {2021},
url    = {https://arxiv.org/abs/2106.10595},
archivePrefix = {arXiv},
eprint  = {2106.10595},
timestamp = {Tue, 29 Jun 2021 16:55:04 +0200},
biburl  = {https://dblp.org/rec/journals/corr/abs-2106-10595.bib},
bibsource = {dblp computer science bibliography, https://dblp.org}
}

Related Research

A High-level Overview of Large Language Models

A High-level Overview of Large Language Models

W. Zi, L. El Asri, and S. Prince.

Learning And Generalization; Natural Language Processing

Research
DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

E. Rahimian, G. Javadi, F. Tung, and G. Oliveira. Workshop at The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR)

Computer Vision; Multi-task Learning

Publications
Borealis AI at International Conference on Learning Representations (ICLR): Machine Learning for a better financial future

Borealis AI at International Conference on Learning Representations (ICLR): Machine Learning for a better financial future

Learning And Generalization; Natural Language Processing; Time series Modelling

Research

Cookies Settings

Bibtex

Related Research