Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Diagnosis of Breast Cancer Using Random Forests
57
Zitationen
2
Autoren
2023
Jahr
Abstract
Breast cancer was the most diagnosed form of cancer in 2020. Early diagnosis of breast cancer results in a significant improvement in long-term survival rates. Current methods require consultation with experts, which is expensive and time-consuming and thus may not be accessible to all. This paper seeks to train and evaluate supervised machine learning models for the accurate and efficient detection of breast cancer. The Wisconsin Breast Cancer Database dataset describes 30 attributes of cell nuclei, including, but not limited to, their radius, texture, and concavity. It contains 569 instances, 212 of which are malignant tumors. The Random Forest algorithm outperforms other algorithms in classifying breast tumors as either malignant or benign and is thus selected as our primary model. It is trained on two different subsets of the dataset having 16 and 8 features, respectively, identified with the help of multiple feature selection methods. The Random Forest models are tested post hyperparameter tuning on a holdout set, and accuracies of 100% and 99.30% respectively. The models are also compared with four other machine learning classification algorithms: Support Vector Machine (SVM), Decision Tree, Multilayer Perceptron, and K-Nearest Neighbors. The results confirm that Random Forest is the superior method for breast cancer diagnosis.
Ähnliche Arbeiten
A survey on deep learning in medical image analysis
2017 · 13.877 Zit.
pROC: an open-source package for R and S+ to analyze and compare ROC curves
2011 · 13.749 Zit.
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.437 Zit.
A survey on Image Data Augmentation for Deep Learning
2019 · 12.029 Zit.
QuPath: Open source software for digital pathology image analysis
2017 · 8.377 Zit.