Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Using the Fleming-Harrington Estimator Method to Process Censored Data in Machine Learning: A Methodological Study
0
Zitationen
2
Autoren
2024
Jahr
Abstract
The Cox regression method is generally used to model censored data. Recently, with the increase in data, new methods have been sought. This study aims to reclassify the censored data using the Fleming-Harrington method to apply machine learning techniques, thereby conducting survival analysis through machine learning classification methods. In practice, the censored data of acute leukemia patients were used, with four distinct sample sizes simulated using a correlation matrix obtained from this acute leukemia dataset. The data were adapted to the machine learning algorithm using the Fleming-Harrington method. Naïve Bayes, Decision Tree, Random Forest, and Support Vector Machines methods were applied to the datasets from among the classification algorithms. Performance metrics, including accuracy, the area under the ROC Curve (AUC), and the F score, were used to compare these algorithms. Results showed that the Random Forest algorithm performed best for the actual dataset, while the Naïve Bayes algorithm produced the best outcomes for the simulated dataset. When examining the machine learning algorithm results, close values were found, with Naïve Bayes outperforming other algorithms in all situations. Comparisons between these datasets using the Cox regression method and Naïve Bayes algorithm AUC values revealed similar outcomes. However, as the sample size increased, the performance of the Cox regression method decreased, while the machine learning algorithms' performance increased. Therefore, machine learning algorithms can provide valuable insights into cancer patients' mortality status or the likelihood of disease recurrence in studies incorporating survival analyses, especially when the sample size is large.
Ähnliche Arbeiten
A method of comparing the areas under receiver operating characteristic curves derived from the same cases.
1983 · 7.062 Zit.
Artificial neural networks: a tutorial
1996 · 4.914 Zit.
Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning
2018 · 4.540 Zit.
Ridge-Based Vessel Segmentation in Color Images of the Retina
2004 · 4.063 Zit.
Bone Histomorphometry : Standardization of Nomenclature, Symbols, and Units
1987 · 3.273 Zit.