Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Ensemble machine learning on gene expression data for cancer classification.
410
Zitationen
2
Autoren
2003
Jahr
Abstract
Whole genome RNA expression studies permit systematic approaches to understanding the correlation between gene expression profiles to disease states or different developmental stages of a cell. Microarray analysis provides quantitative information about the complete transcription profile of cells that facilitate drug and therapeutics development, disease diagnosis, and understanding in the basic cell biology. One of the challenges in microarray analysis, especially in cancerous gene expression profiles, is to identify genes or groups of genes that are highly expressed in tumour cells but not in normal cells and vice versa. Previously, we have shown that ensemble machine learning consistently performs well in classifying biological data. In this paper, we focus on three different supervised machine learning techniques in cancer classification, namely C4.5 decision tree, and bagged and boosted decision trees. We have performed classification tasks on seven publicly available cancerous microarray data and compared the classification/prediction performance of these methods. We have observed that ensemble learning (bagged and boosted decision trees) often performs better than single decision trees in this classification task.
Ähnliche Arbeiten
Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method
2001 · 179.507 Zit.
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005 · 55.861 Zit.
<tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data
2009 · 43.987 Zit.
limma powers differential expression analyses for RNA-sequencing and microarray studies
2015 · 42.220 Zit.
clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters
2012 · 37.365 Zit.