Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Integrating Feature Selection and Feature Extraction Methods With Deep Learning to Predict Clinical Outcome of Breast Cancer
164
Zitationen
4
Autoren
2018
Jahr
Abstract
In many microarray studies, classifiers have been constructed based on gene signatures to predict clinical outcomes for various cancer sufferers. However, signatures originating from different studies often suffer from poor robustness when used in the classification of data sets independent from which they were generated from. In this paper, we present an unsupervised feature learning framework by integrating a principal component analysis algorithm and autoencoder neural network to identify different characteristics from gene expression profiles. As the foundation for the obtained features, an ensemble classifier based on the AdaBoost algorithm (PCA-AE-Ada) was constructed to predict clinical outcomes in breast cancer. During the experiments, we established an additional classifier with the same classifier learning strategy (PCA-Ada) in order to perform as a baseline to the proposed method, where the only difference is the training inputs. The area under the receiver operating characteristic curve index, Matthews correlation coefficient index, accuracy, and other evaluation parameters of the proposed method were tested on several independent breast cancer data sets and compared with representative gene signature-based algorithms including the baseline method. Experimental results demonstrate that the proposed method using deep learning techniques performs better than others.
Ähnliche Arbeiten
Analysis of Relative Gene Expression Data Using Real-Time Quantitative PCR and the 2−ΔΔCT Method
2001 · 180.021 Zit.
Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles
2005 · 56.040 Zit.
<tt>edgeR</tt> : a Bioconductor package for differential expression analysis of digital gene expression data
2009 · 44.133 Zit.
limma powers differential expression analyses for RNA-sequencing and microarray studies
2015 · 42.410 Zit.
clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters
2012 · 37.591 Zit.