Red wine Quality Prediction by Supervised Learning

Authors

  • Peini Li

DOI:

https://doi.org/10.54097/9xh7x607

Keywords:

Red wine; quality; supervised learning.

Abstract

In this essay, various machine learning techniques are carried out to model the red wine dataset, which contains 1599 samples and 11 physical features, and predict its quality. Data processing techniques including dropping duplicate values and removing outliers are first applied to normalize the features and bring them to a similar scale. Five algorithms, KNN, Multinominal Logistic Regression, Bagging, Random Forest, and Gradient Boosting are then applied. Cross-validation is performed to identify the best parameters for each model. Additionally, feature selection is also done in Logistic Regression based on the calculated p-values. All five models perform well, with test accuracies roughly around 60%, indicating that machine learning has enormous potential on red wine quality prediction to facilitate the whole industry. Among them, Logistic Regression and Random Forest have slightly higher accuracies. This paper highlights the importance of machine learning methods of Logistic Regression and Random Forest in red wine quality prediction.

Downloads

Download data is not yet available.

References

Chen, J., Li, R., & Yang, J. A Prediction Model for Quality of Red Wine through Explainable Artificial Intelligence. In 2022 IEEE International Conference on Sensing, Diagnostics, Prognostics, and Control, 2022: 268-272.

Kumar, S., Agrawal, K., & Mandan, N. Red Wine Quality Prediction Using Machine Learning Techniques. In 2020 International Conference on Computer Communication and Informatics, 2020: 1-6.

Mohana, R., Sharma, P., & Sharma, A. Ensemble Framework for Red Wine Quality Prediction. Food Anal. Methods, 2023, 16: 30-44.

Aich, S., Al-Absi, A. A., Hui, K. L., & Sain, M. Prediction of Quality for Different Type of Wine based on Different Feature Sets Using Supervised Machine Learning Techniques. In 2019 21st International Conference on Advanced Communication Technology, 2019: 1122-1127.

Aich, S., Al-Absi, A. A., Hui, K. L., Lee, J. T., & Sain, M. A classification approach with different feature sets to predict the quality of different types of wine using machine learning techniques. In 2018 20th International Conference on Advanced Communication Technology, 2018: 139-143).

Sun, Y., Wong, A. K. C., & Kamel, M. S. Classification of imbalanced data: A review. International Journal of Pattern Recognition and Artificial Intelligence, 2009, 23(4): 687-719.

Tanha, J., Abdi, Y., Samadi, N., et al. Boosting methods for multi-class imbalanced data classification: An experimental review. Journal of Big Data, 2020, 7: 70.

Dahal, K. R., Dahal, J. N., Banjade, H., et al. Prediction of wine quality using machine learning algorithms. Open Journal of Statistics, 2021, 11(2): 278-289.

Kumar, V., & Minz, S. Feature selection: A literature review. SmartCR, 2014, 4(3): 211-229.

Gupta, Y. Selection of important features and predicting wine quality using machine learning techniques. Procedia Computer Science, 2018, 125: 305-312.

Downloads

Published

15-12-2023

How to Cite

Li, P. (2023). Red wine Quality Prediction by Supervised Learning. Highlights in Science, Engineering and Technology, 72, 1121-1125. https://doi.org/10.54097/9xh7x607