Study for Iris Classification Based on Multiple Machine Learning Models

Authors

  • Yihan Zhou

DOI:

https://doi.org/10.54097/hset.v23i.3620

Keywords:

MLP; KNN; SVM; Random Forest model; Neural Network; Iris dataset.

Abstract

Classification algorithms in machine learning aim to classify data into different kinds, which is important in data mining. It is important and challenging to select the classification model with high accuracy and efficiency. To address this, this paper compares and analyzes Multilayer Perceptron (MLP)with different machine learning methods, including K-NearestNeighbor (KNN), Support Vector Machine (SVM), Logistic regression, decision tree, and random Forest. All models are trained to classify different kinds of Iris to show further comprehension of different models. All the related models are evaluated with metrics called accuracy and confusion matrices. The experimental results show that the confusion matrices based on various models are similar but MLP overperforms other models in terms of accuracy. With sufficient nodes per hidden layer for backpropagation, MLP has a remarkable ability to analyze and provide projections. With this superiority, it is expected that the neural network can have better performance in classification soon. After understanding the strength and limitations of various classification algorithms, the optimization models are expected to be developed to better solve real-world problems.

Downloads

Download data is not yet available.

References

S. Ali, K. A. Smith, On learning algorithm selection for classification. Applied Soft Computing, 2006, 6(2): 119-138.

J. M. Keller, M. R. Gray and J. A. Givens, A fuzzy K-nearest neighbor algorithm. IEEE Transactions on Systems, Man, and Cybernetics, 1985, SMC-15(4): 580-585.

N. Cristianini and J. Shawe-Taylor, An introduction to support vector machines and other kernel-based learning methods. Cambridge: Cambridge University Press, 2000.

S. Dreiseitl, L. Ohno-Machado, Logistic regression and artificial neural network classification models: a methodology review. Journal of Biomedical Informatics, 2002, 35: 352-359.

Y. Y. Song, Y. Lu, Decision tree methods: applications for classification and prediction. Shanghai Arch Psychiatry, 2015,27(2): 130-135.

L. Breiman, Random Forests. Machine Learning, 2001, 45: 5-32.

M. W Gardner, S. R Dorling, Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmospheric Environment, 1998, 32(14-15): 2627-2636.

Y. LeCun, Y. Bengio, G. Hinton, Deep learning, Nature, 2015, 521: 436-444.

Information from: https://archive.ics.uci.edu/ml/datasets/iris

S. K. Pal, S. Mitra, Multilayer perceptron, fuzzy sets, and classification, IEEE Transactions on Neural Networks.1992, 3(5): 683-697.

D. W. Ruck, S.K. Rogers, M. Kabrisky, Feature selection using a multilayer perceptron. Neural Network Comput, 1990, 2: 40-48.

Z. Y. Deng, X. S. Zhu, D. B. Chenget. Efficient kNN classification algorithm for big data, Neurocomputing, 2016, 195: 143-148.

A. Shokrzade, M. Ramezani, F. Akhlaghian, A novel extreme learning machine based KNN classification method for dealing with big data. Expert Systems with Applications, 2021.183: 115293.

J. X. Deng. B. Xie. D. D. You, Process parameters design of squeeze casting through an improved KNN algorithm and existing data. Journal of Manufacturing Processes, 2022, 84 (1320-1330)

T. G. Dietterich, an experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Machine Learning, 2020 40: 139-157.

Downloads

Published

03-12-2022

How to Cite

Zhou, Y. (2022). Study for Iris Classification Based on Multiple Machine Learning Models. Highlights in Science, Engineering and Technology, 23, 342-349. https://doi.org/10.54097/hset.v23i.3620