About this item:

118 Views | 92 Downloads

Author Notes:

maywang@gatech.edu; ltong9@gatech.edu

Contributed equally: LM and KC

The authors would like to thank Mr. Hang Wu for his kind suggestions on the experiment design and analysis.

L.T. and M.D.W conceived of and organized the study. L.T. developed the theory and performed the experiments. L.T., J.M, K.C., and M.D.W contributed to the analysis. L.T., J.M, K.C., and M.D.W wrote the manuscript and made the figures. All authors discussed the results and contributed to the final manuscript.

The authors declare that they have no competing interests.

Subjects:

Research Funding:

The work was supported in part by grants from the National Institute of Health (NIH) under Award R01CA163256, Giglio Breast Cancer Research Fund, Petit Institute Faculty Fellow and Carol Ann and David D. Flanagan Faculty Fellow Research Fund, and Georgia Cancer Coalition Distinguished Cancer Scholar award. This work was also supported in part by the scholarship from China Scholarship Council (CSC) under the Grant CSC NO. 201406010343. The content of this article is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Keywords:

  • Science & Technology
  • Life Sciences & Biomedicine
  • Medical Informatics
  • Multi-omics integration
  • Breast Cancer
  • Survival analysis
  • Deep learning

Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis

Tools:

Journal Title:

BMC MEDICAL INFORMATICS AND DECISION MAKING

Volume:

Volume 20, Number 1

Publisher:

, Pages 225-225

Type of Work:

Article | Final Publisher PDF

Abstract:

Background: Breast cancer is the most prevalent and among the most deadly cancers in females. Patients with breast cancer have highly variable survival lengths, indicating a need to identify prognostic biomarkers for personalized diagnosis and treatment. With the development of new technologies such as next-generation sequencing, multi-omics information are becoming available for a more thorough evaluation of a patient's condition. In this study, we aim to improve breast cancer overall survival prediction by integrating multi-omics data (e.g., gene expression, DNA methylation, miRNA expression, and copy number variations (CNVs)). Methods: Motivated by multi-view learning, we propose a novel strategy to integrate multi-omics data for breast cancer survival prediction by applying complementary and consensus principles. The complementary principle assumes each -omics data contains modality-unique information. To preserve such information, we develop a concatenation autoencoder (ConcatAE) that concatenates the hidden features learned from each modality for integration. The consensus principle assumes that the disagreements among modalities upper bound the model errors. To get rid of the noises or discrepancies among modalities, we develop a cross-modality autoencoder (CrossAE) to maximize the agreement among modalities to achieve a modality-invariant representation. We first validate the effectiveness of our proposed models on the MNIST simulated data. We then apply these models to the TCCA breast cancer multi-omics data for overall survival prediction. Results: For breast cancer overall survival prediction, the integration of DNA methylation and miRNA expression achieves the best overall performance of 0.641 ± 0.031 with ConcatAE, and 0.63 ± 0.081 with CrossAE. Both strategies outperform baseline single-modality models using only DNA methylation (0.583 ± 0.058) or miRNA expression (0.616 ± 0.057). Conclusions: In conclusion, we achieve improved overall survival prediction performance by utilizing either the complementary or consensus information among multi-omics data. The proposed ConcatAE and CrossAE models can inspire future deep representation-based multi-omics integration techniques. We believe these novel multi-omics integration models can benefit the personalized diagnosis and treatment of breast cancer patients.

Copyright information:

© The Author(s) 2020

This is an Open Access work distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Export to EndNote