Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Improved Random Forest for Classification
442
Zitationen
6
Autoren
2018
Jahr
Abstract
We propose an improved random forest classifier that performs classification with minimum number of trees. The proposed method iteratively removes some unimportant features. Based on the number of important and unimportant features, we formulate a novel theoretical upper limit on the number of trees to be added to the forest to ensure improvement in classification accuracy. Our algorithm converges with a reduced but important set of features. We prove that further addition of trees or further reduction of features does not improve classification performance. The efficacy of the proposed approach is demonstrated through experiments on benchmark datasets. We further use the proposed classifier to detect mitotic nuclei in the histopathological datasets of breast tissues. We also apply our method on the industrial dataset of dual phase steel microstructures to classify different phases. Results of our method on different datasets show significant reduction in average classification error compared to a number of competing methods.
Ähnliche Arbeiten
Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method
2001 · 179.656 Zit.
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005 · 55.912 Zit.
<tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data
2009 · 44.029 Zit.
limma powers differential expression analyses for RNA-sequencing and microarray studies
2015 · 42.262 Zit.
clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters
2012 · 37.426 Zit.