Lung Cancer Risk Analysis and Prediction Using Machine Learning Techniques

Authors

  • Hongyi Ding
  • Qi Tong
  • Hongran Wang
  • Zhan Zheng

DOI:

https://doi.org/10.54097/hset.v39i.6525

Keywords:

Lang Cancer; Random Forest; Logistic Regression; Machine Learning.

Abstract

In this work, the main challenges are to find the factors for lung cancer and to use machine learning techniques to analyze the risk of lung cancer. Lung cancer is a malignant tumor, usually arising from the bronchial mucosa or glands of the lungs. The death rate of patients is very rapid. The incidence and death rates of lung cancer are increasing year by year in many countries. Over the past 50 years, many countries have reported significant increases in lung cancer morbidity and mortality. The incidence and mortality of lung cancer in men rank first among all malignant tumors, and the incidence and mortality in women rank second. The random forest and logistic regression are used to predict lung cancer risk based on patients' symptomatic and behavioral features.

Downloads

Download data is not yet available.

References

Mayo Clinic, "Lung cancer - Symptoms and causes," Mayo Clinic, Mar. 23, 2021.

B. E. Johnson, "Second lung cancers in patients after treatment for an initial lung cancer," JNCI: Journal of the National Cancer Institute, vol. 90, no. 18, pp. 1335–1345, 1998.

American Cancer Society, "What Causes Lung Cancer?," Cancer.org, 2010.

American Cancer Society, "How to Detect Non-small Cell Lung Cancer | Lung Cancer Tests," www.cancer.org, Jun. 01, 2021.

S. M. Farber, M. A. Benioff, and J. D. Smith, "Diagnostic problems of cancer of the lung," California Medicine, vol. 76, no. 5, p. 328, 1952.

D. G. Kleinbaum, K. Dietz, M. Gail, M. Klein, and M. Klein, Logistic regression. Springer, 2002.

R. E. Wright, "Logistic regression." 1995.

Z.-Q. Hong and J.-Y. Yang, "Optimal discriminant plane for a small number of samples and design method of classifier on the plane," pattern recognition, vol. 24, no. 4, pp. 317–324, 1991.

Wikipedia Contributors. "Logistic Regression." Wikipedia, Wikimedia Foundation, 12 Apr. 2019.

P. Schober and T. R. Vetter, "Logistic regression in medical research," Anesthesia and analgesia, vol. 132, no. 2, p. 365, 2021.

"Lung Cancer," www.kaggle.com. https://www.kaggle.com/datasets/nancyalaswad90/lung-cancer (accessed Sep. 10, 2022).

D. W. Hosmer Jr, S. Lemeshow, and R. X. Sturdivant, Applied logistic regression. John Wiley & Sons, 2013, vol. 398.

S. J. Rigatti, "Random Forest," Journal of Insurance Medicine, vol. 47, no. 1, pp. 31–39, 2017.

L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5–32, 2001.

M. Schonlau and R. Y. Zou, "The random forest algorithm for statistical learning," The Stata Journal, vol. 20, no. 1, pp. 3–29, 2020.

Downloads

Published

01-04-2023

How to Cite

Ding, H., Tong, Q., Wang, H., & Zheng, Z. (2023). Lung Cancer Risk Analysis and Prediction Using Machine Learning Techniques. Highlights in Science, Engineering and Technology, 39, 195-200. https://doi.org/10.54097/hset.v39i.6525