Next-generation sequencing technologies provide a powerful tool for studying genome evolution during progression of advanced diseases such as cancer. Although many recent studies have employed new sequencing technologies to detect mutations across multiple, genetically related tumors, current methods do not exploit available phylogenetic information to improve the accuracy of their variant calls. Here, we present a novel algorithm that uses somatic single nucleotide variations (SNVs) in multiple, related tissue samples as lineage markers for phylogenetic tree reconstruction. Our method then leverages the inferred phylogeny to improve the accuracy of SNV discovery. Experimental analyses demonstrate that our method achieves up to 32% improvement for somatic SNV calling of multiple related samples over the accuracy of GATK's Unified Genotyper, the state of the art multisample SNV caller. © 2013 Springer-Verlag.
|Original language||English (US)|
|Title of host publication||Research in Computational Molecular Biology|
|Publisher||Springer Science + Business Media|
|Number of pages||15|
|State||Published - 2013|
Bibliographical noteKAUST Repository Item: Exported on 2020-10-01
Acknowledgements: RS was supported by NSERC postdoctoral fellowship (PDF). DKH was supported by a STMicroelectronics Stanford Graduate Fellowship. SS and DK were supported by Stanford CURIS program. DEN was supported by training grant from NIH/NLM and a Bio-X Stanford Interdisciplinary Graduate Fellowship. This work was funded by a grant from KAUST to SB, and the Sequencing Initiative of the Stanford Department of Pathology to RW and AS.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.