Using Logistic Regression and Support Vector Classification to Predict Cancer

Authors

  • Bowen Zhang

DOI:

https://doi.org/10.54097/bkvnxg90

Keywords:

Logistic Regression, SVC, Prediction.

Abstract

This study investigates the application of machine learning (ML) algorithms in the early diagnosis of breast cancer, focusing on logistic regression and Support Vector Classification (SVC). Utilizing a dataset from Kaggle, which includes diverse clinical features from breast mass samples, the research conducts a comparative analysis of these models in terms of accuracy and interpretability. Our findings reveal that both logistic regression and SVC demonstrate high precision in distinguishing between benign and malignant tumors, with SVC showing a marginally superior performance due to its higher sensitivity and lower rate of false negatives. The study emphasizes the potential of ML in enhancing cancer diagnostic processes, highlighting the importance of non-invasive, cost-effective, and accurate diagnostic alternatives. It also addresses the challenges of model interpretability and the need for more transparent ML applications in clinical settings. This research paves the way for future advancements in medical diagnostics, offering promising directions for integrating ML algorithms into clinical decision-making and patient care.

Downloads

Download data is not yet available.

References

Meurer, William J., and J. Tolles. Logistic Regression Diagnostics: Understanding How Well a Model Predicts Outcomes. JAMA, 2017.

Chen, H., et al. Classification Prediction of Breast Cancer Based on Machine Learning."Computational Intelligence and Neuroscience, 2023, 6530719 - 9.

Wieczorek, J., C. Guerin, and T. McMahon. K‐fold Cross‐validation for Complex Sample Surveys. Stat (International Statistical Institute), 2022, 11 (1).

Thivakaran, T. K., and M. Ramesh. Exploratory Data Analysis and Sales Forecasting of Bigmart Dataset Using Supervised and ANN Algorithms. Measurement. Sensors, 2022, 23, 100388.

Cheng, J., et al. A Variable Selection Method Based on Mutual Information and Variance Inflation Factor. Spectrochimica Acta. Part A, Molecular and Biomolecular Spectroscopy, 2022, 268, 120652.

Liu, Yuxia, et al. Influencing Factors and Prediction Methods of Radiotherapy and Chemotherapy in Patients with Lung Cancer Based on Logistic Regression Analysis. Scientific Reports, 2022, 12 (1), 21094 - 21094.

Downloads

Published

10-04-2024

How to Cite

Zhang, B. (2024). Using Logistic Regression and Support Vector Classification to Predict Cancer. Highlights in Science, Engineering and Technology, 92, 288-294. https://doi.org/10.54097/bkvnxg90