Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.

Using the machine learning approach to predict patient survival from high-dimensional survival data

2016·28 Zitationen

Volltext beim Verlag öffnen

Zitationen

Autoren

2016

Jahr

Abstract

Survival analysis with high-dimensional data deals with the prediction of patient survival based on their gene expression data and clinical data. A crucial task for the accuracy of survival analysis in this context is to select the features highly correlated with the patient's survival time. Since the information about class labels is hidden, existing feature selection methods in machine learning are not applicable. In contrast to classical statistical methods which address this issue with the Cox score, we propose to tackle this problem by discretizing the survival time of patients into a suitable number of subgroups via silhouettes clustering validity. To cope with patients' censoring, we use “k-nearest neighbor” based on clinical parameters. Feature selection is then accomplished using Fast Correlation-Based Filtering approach from machine learning community. The effectiveness and efficiency of the proposed method are demonstrated through comparisons with classical statistical methods on real-world datasets and simulation datasets.

Autoren

Institutionen

Memorial University of Newfoundland(CA)

Themen

Machine Learning in HealthcareStatistical Methods and InferenceAI in cancer detection

Volltext beim Verlag öffnen

Using the machine learning approach to predict patient survival from high-dimensional survival data

Abstract

Ähnliche Arbeiten

Autoren

Institutionen

Themen