Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Interpretable AI for Skin Cancer Prediction using Clinical Indicators for Indian Patients
0
Zitationen
2
Autoren
2026
Jahr
Abstract
Skin cancer is a serious public health challenge, and early diagnosis is crucial for effective treatment. We present a machine-learning pipeline to classify patients as “Cancer” vs “Non-Cancer” using clinical and demographic features. The study uses a real-world dataset (<tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\mathbf{n}=\mathbf{7 0 4}$</tex>) with 20 predictors (age, gender, lesion site, lifestyle factors, etc.) and an extremely imbalanced target (696 non-cancer vs 8 cancer cases), fortunately, due to less cancer patients observed in India. We perform thorough preprocessing (including handling missing values and categorical encoding), exploratory data analysis, and compare five classifiers: Logistic Regression, Random Forest, Gradient Boosting, Support Vector Machine (RBF kernel), and K-Nearest Neighbors. A stratified 5-fold cross-validation ensures reliable model comparison and overfitting control. The best model (Logistic Regression) achieved near-perfect cross-validated performance (ROC-AUC <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\boldsymbol{\approx} \mathbf{1. 0}$</tex>) and very high test accuracy <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$(99.3 \%)$</tex>, with precision 66.7% and recall 100% on the rare cancer class. Performance metrics (accuracy, precision, recall, <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$\text{F1}$</tex>, ROC-AUC), confusion matrix, ROC curve, and feature importance (from the tree-based models) are reported. We include visualizations from EDA (class balance, age distribution, lesion site distribution, correlation heatmap) and evaluation (cross-validation bar chart, confusion matrix, ROC curve, and feature importances). The study highlights the predictive power of routine clinical features and provides insights for interpretable skin cancer risk assessment using SHAP and LIME.
Ähnliche Arbeiten
Dermatologist-level classification of skin cancer with deep neural networks
2017 · 13.394 Zit.
Tumor Angiogenesis: Therapeutic Implications
1971 · 10.104 Zit.
Improved Survival with Vemurafenib in Melanoma with BRAF V600E Mutation
2011 · 7.661 Zit.
Final Version of 2009 AJCC Melanoma Staging and Classification
2009 · 4.558 Zit.
Technical Details of Intraoperative Lymphatic Mapping for Early Stage Melanoma
1992 · 4.401 Zit.