Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
MUSCLE: multiple sequence alignment with high accuracy and high throughput
45.857
Zitationen
1
Autoren
2004
Jahr
Abstract
We describe MUSCLE, a new computer program for creating multiple alignments of protein sequences. Elements of the algorithm include fast distance estimation using kmer counting, progressive alignment using a new profile function we call the log-expectation score, and refinement using tree-dependent restricted partitioning. The speed and accuracy of MUSCLE are compared with T-Coffee, MAFFT and CLUSTALW on four test sets of reference alignments: BAliBASE, SABmark, SMART and a new benchmark, PREFAB. MUSCLE achieves the highest, or joint highest, rank in accuracy on each of these sets. Without refinement, MUSCLE achieves average accuracy statistically indistinguishable from T-Coffee and MAFFT, and is the fastest of the tested methods for large numbers of sequences, aligning 5000 sequences of average length 350 in 7 min on a current desktop computer. The MUSCLE program, source code and PREFAB test data are freely available at http://www.drive5. com/muscle.
Ähnliche Arbeiten
Cleavage of Structural Proteins during the Assembly of the Head of Bacteriophage T4
1970 · 251.175 Zit.
Basic local alignment search tool
1990 · 93.395 Zit.
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
1997 · 74.078 Zit.
Fiji: an open-source platform for biological-image analysis
2012 · 68.453 Zit.
Trimmomatic: a flexible trimmer for Illumina sequence data
2014 · 67.574 Zit.