Predicting cancer risk from germline whole-exome sequencing data using a novel context-based variant aggregation approach Journal Article


Authors: Guan, Z.; Begg, C. B.; Shen, R.
Article Title: Predicting cancer risk from germline whole-exome sequencing data using a novel context-based variant aggregation approach
Abstract: Many studies have shown that the distributions of the genomic, nucleotide, and epigenetic contexts of somatic variants in tumors are informative of cancer etiology. Recently, a new direction of research has focused on ex-tracting signals from the contexts of germline variants and evidence has emerged that patterns defined by these factors are associated with onco-genic pathways, histologic subtypes, and prognosis. It remains an open question whether aggregating germline variants using meta-features cap-turing their genomic, nucleotide, and epigenetic contexts can improve cancer risk prediction. This aggregation approach can potentially increase statistical power for detecting signals from rare variants, which have been hypothesized to be a major source of the missing heritability of can-cer. Using germline whole-exome sequencing data from the UK Biobank, we developed risk models for 10 cancer types using known risk vari-ants (cancer-associated SNPs and pathogenic variants in known cancer predisposition genes) as well as models that additionally include the meta-features. The meta-features did not improve the prediction accuracy of models based on known risk variants. It is possible that expanding the approach to whole-genome sequencing can lead to gains in prediction accuracy.Significance: There is evidence that cancer is partly caused by rare genetic variants that have not yet been identified. We investigate this issue using novel statistical methods and data from the UK Biobank.
Keywords: association; missing heritability; mutational signatures
Journal Title: Cancer Research Communications
Volume: 3
Issue: 3
ISSN: 2767-9764
Publisher: American Association for Cancer Research  
Date Published: 2023-03-01
Start Page: 483
End Page: 488
Language: English
ACCESSION: WOS:001031551700001
DOI: 10.1158/2767-9764.Crc-22-0355
PROVIDER: wos
PMCID: PMC10032232
PUBMED: 36969913
Notes: Article -- Source: Wos
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Colin B Begg
    307 Begg
  2. Ronglai Shen
    207 Shen
  3. Zoe Guan
    7 Guan