Research on Prediction of Liver Disease Based on Machine Learning Models
DOI:
https://doi.org/10.54097/hset.v68i.11926Keywords:
machine learning; dataset; missing values; encoding categorical variables; normalizing featuresAbstract
Liver disease is a disease that has attracted much attention in the world. Liver diseases such as cirrhosis and liver cancer are the common causes of death in the world. Many liver diseases have no obvious symptoms in the early stage of onset, so they are easily overlooked by people. Treatment for liver illnesses is crucial and relies heavily on early diagnosis and management. This study assessed the effectiveness of various machine learning approaches for the identification of liver disease due to the high cost and complexity of the diagnostic process. This research used five machine learning models to predict the presence of liver disease based on a patient's medical records using an Indian liver patient record dataset. The dataset was prepared for model training through data preprocessing and analysis, including handling missing values, encoding categorical variables, and normalizing features. Five machine learning algorithms were evaluated, with Random Forest emerging as the highest performing model with an accuracy of 73.56% on the test set. This study contributes to the field by demonstrating the potential of machine learning to accurately predict liver disease, aiding in early diagnosis and treatment.
Downloads
References
T.G.Cotter, M.Rinella, Nonalcoholic fatty liver disease 2020: the state of the disease, Gastroenterology, 158, 1851-1864 (2020).
S.Lee, H.Huang, M.Zelen, Early detection of disease and scheduling of screening examinations, Statistical Methods in Medical Research, 13, 443-456 (2004).
S. Samarpita and R. N. Satpathy, Applications of Machine Learning in Healthcare: An Overview, 2022 1st ICIDeA, Bhubaneswar, India, 51-56 (2022).
K. Sellamuthu, S. P, P. K and R. S, Liver Disease Prediction using Logistic Regression, 2022 8th ICSSS, Chennai, India, 01-06 (2022).
C.C. Wu, W.C. Yeh, W.D. Hsu. et al. Prediction of fatty liver disease using machine learning algorithms, Computer Methods and Programs in Biomedicine, 170, 23-29 (2019).
W. Noble, what is a support vector machine. Nat Biotechnol, 24, 1565–1567 (2006).
L. Breiman, Random Forests, Machine Learning, 45, 5–32 (2001).
W. Xing and Y. Bei, Medical Health Big Data Classification Based on KNN Classification Algorithm, in IEEE Access, 8, 28808-28819 (2020).
G. Shobana and K. Umamaheswari, Prediction of Liver Disease using Gradient Boost Machine Learning Techniques with Feature Scaling, 2021 5th ICCMC, Erode, India, 1223-1229 (2021).
A. Kumar, N. Sahu, Categorization of Liver Disease Using Classification Techniques, IJRASET, 5 (2017)
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







