Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data Journal Article


Authors: Hao, J.; Kim, Y.; Mallavarapu, T.; Oh, J. H.; Kang, M.
Article Title: Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data
Abstract: Background: Understanding the complex biological mechanisms of cancer patient survival using genomic and clinical data is vital, not only to develop new treatments for patients, but also to improve survival prediction. However, highly nonlinear and high-dimension, low-sample size (HDLSS) data cause computational challenges to applying conventional survival analysis. Results: We propose a novel biologically interpretable pathway-based sparse deep neural network, named Cox-PASNet, which integrates high-dimensional gene expression data and clinical data on a simple neural network architecture for survival analysis. Cox-PASNet is biologically interpretable where nodes in the neural network correspond to biological genes and pathways, while capturing the nonlinear and hierarchical effects of biological pathways associated with cancer patient survival. We also propose a heuristic optimization solution to train Cox-PASNet with HDLSS data. Cox-PASNet was intensively evaluated by comparing the predictive performance of current state-of-the-art methods on glioblastoma multiforme (GBM) and ovarian serous cystadenocarcinoma (OV) cancer. In the experiments, Cox-PASNet showed out-performance, compared to the benchmarking methods. Moreover, the neural network architecture of Cox-PASNet was biologically interpreted, and several significant prognostic factors of genes and biological pathways were identified. Conclusions: Cox-PASNet models biological mechanisms in the neural network by incorporating biological pathway databases and sparse coding. The neural network of Cox-PASNet can identify nonlinear and hierarchical associations of genomic and clinical data to cancer patient survival. The open-source code of Cox-PASNet in PyTorch implemented for training, evaluation, and model interpretation is available at: https://github.com/DataX-JieHao/Cox-PASNet. © 2019 The Author(s).
Keywords: cancer survival; survival analysis; survival rate; cancer growth; ovarian cancer; ovary adenocarcinoma; gene expression; risk factor; glioblastoma; genomics; glioblastoma multiforme; cancer prognosis; human; priority journal; article; patient risk; deep neural network; cox-pasnet; ovarian serous cystadenocarcinoma
Journal Title: BMC Medical Genomics
Volume: 12
Issue: Suppl. 10
ISSN: 1755-8794
Publisher: Biomed Central Ltd  
Date Published: 2019-12-23
Start Page: 189
Language: English
DOI: 10.1186/s12920-019-0624-2
PUBMED: 31865908
PROVIDER: scopus
PMCID: PMC6927105
DOI/URL:
Notes: This paper is an updated and peer-reviewed follow-up to the conference paper presented at the IEEE BIBM International Conference on Bioinformatics & Biomedicine 2018: Medical Genomics (2020 Dec 3-6, Madrid, Spain) -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Jung Hun Oh
    187 Oh