Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data
7.295
Zitationen
1
Autoren
2011
Jahr
Abstract
MOTIVATION: Most existing methods for DNA sequence analysis rely on accurate sequences or genotypes. However, in applications of the next-generation sequencing (NGS), accurate genotypes may not be easily obtained (e.g. multi-sample low-coverage sequencing or somatic mutation discovery). These applications press for the development of new methods for analyzing sequence data with uncertainty. RESULTS: We present a statistical framework for calling SNPs, discovering somatic mutations, inferring population genetical parameters and performing association tests directly based on sequencing data without explicit genotyping or linkage-based imputation. On real data, we demonstrate that our method achieves comparable accuracy to alternative methods for estimating site allele count, for inferring allele frequency spectrum and for association mapping. We also highlight the necessity of using symmetric datasets for finding somatic mutations and confirm that for discovering rare events, mismapping is frequently the leading source of errors. AVAILABILITY: http://samtools.sourceforge.net. CONTACT: hengli@broadinstitute.org.
Ähnliche Arbeiten
PLINK: A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses
2007 · 35.757 Zit.
WGCNA: an R package for weighted correlation network analysis
2008 · 28.625 Zit.
A global reference for human genetic variation
2015 · 19.703 Zit.
The variant call format and VCFtools
2011 · 17.439 Zit.
Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows
2010 · 16.476 Zit.