Mining mutation contexts across the cancer genome to map tumor site of origin Journal Article


Authors: Chakraborty, S.; Martin, A.; Guan, Z.; Begg, C. B.; Shen, R.
Article Title: Mining mutation contexts across the cancer genome to map tumor site of origin
Abstract: The vast preponderance of somatic mutations in a typical cancer are either extremely rare or have never been previously recorded in available databases that track somatic mutations. These constitute a hidden genome that contrasts the relatively small number of mutations that occur frequently, the properties of which have been studied in depth. Here we demonstrate that this hidden genome contains much more accurate information than common mutations for the purpose of identifying the site of origin of primary cancers in settings where this is unknown. We accomplish this using a projection-based statistical method that achieves a highly effective signal condensation, by leveraging DNA sequence and epigenetic contexts using a set of meta-features that embody the mutation contexts of rare variants throughout the genome. © 2021, The Author(s).
Keywords: genetics; mutation; neoplasm; neoplasms; dna repair; logistic models; dna; genome; tumor; statistical model; genetic database; databases, genetic; machine learning; cancer; humans; human
Journal Title: Nature Communications
Volume: 12
ISSN: 2041-1723
Publisher: Nature Publishing Group  
Date Published: 2021-05-24
Start Page: 3051
Language: English
DOI: 10.1038/s41467-021-23094-z
PUBMED: 34031376
PROVIDER: scopus
PMCID: PMC8144407
DOI/URL:
Notes: Article -- Export Date: 1 July 2021 -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Colin B Begg
    306 Begg
  2. Ronglai Shen
    204 Shen
  3. Axel Stephen Martin
    19 Martin
  4. Zoe Guan
    7 Guan