Data analytics approaches for breast cancer survivability: comparison of data mining methods
Authors
Advisors
Dag, Ali
Issue Date
Type
Keywords
Citation
Abstract
In the early stages of breast cancer, inefficient treatment methods, as well as the patient's health condition may impact the patient's lifetime expectancy. In this study, given a set of explanatory variables that include the patient's demographics, health condition, and cancer treatment regimen, our objective is to investigate the performance of four different machine learning methods including an artificial neural network, classification and regression tree, logistic regression, and Bayesian belief network. We utilize these four methods with a ten-fold cross validation to predict the ten-year survivability of a breast cancer patient after initial diagnosis. The results of each method are compared with respect to accuracy, sensitivity, specificity, and area-under-the-curve(AUC) metrics. We observe that the logistic regression method shows better performance compared to the others with respect to the AUC metric. In all prediction models, the stage of the cancer is the most important predictor of breast cancer survivability.
Table of Contents
Description
Research completed in the Department of Industrial, Systems and Manufacturing Engineering, Wichita State University and Beacom School of Business, University of South Dakota
Publisher
Journal
Book Title
Series
v. 13