Mobile Phone Price Prediction with Feature Reduction

Menghan Chen

doi:10.54097/hset.v34i.5440

Authors

Menghan Chen

DOI:

https://doi.org/10.54097/hset.v34i.5440

Keywords:

Machine Learning, Classification, Feature Reduction, Correlation, PCA.

Abstract

Feature reduction can reduce data dimensionality and streamline model size, which focuses on the high relevance data and inferences the output faster. This paper aims to explore the performance and effectiveness of feature reduction methods that accompany the Multilayer Perceptron classifier in predicting the mobile phone price range. Pearson’s Correlation and Principal Components Analysis are chosen as the feature reduction techniques in the research. The experiment sorts the features in significant order with two distinct methods. The three experimental groups reduce 5 features each time and the control group has no feature selection. Then all the groups use the open dataset to train and test the accuracy and loss through MLP. The result indicates that the feature selected by the correlation coefficient facilitates the accuracy of the classification model. When PCA is implemented and only a few features get reduced, the performance improves a little bit, but when more features are eliminated there are huge negative influences. Pearson’s correlation has a better performance than PCA in this experiment, which achieves 95.8% accuracy and validate the effectiveness of the feature reduction method.

Downloads

Download data is not yet available.

References

Resource from https://www.statista.com/statistics/263437/global-smartphone-sales-to-end-users-since-2007/

Kumuda S, Vishal Karur, Karthick Balaje S E. (2021). Prediction of Mobile Model Price using Machine Learning Techniques. International Journal of Engineering and Advanced Technology, 11(1), 273-275.

Miao, J., & Niu, L. (2016). A survey on feature selection. Procedia Computer Science, 91, 919-926.

Pipalia, K., & Bhadja, R. (2020). Performance Evaluation of Different Supervised Learning Algorithms for Mobile Price Classification. In International Journal for Research in Applied Science & Engineering Technology, 8(6), 1841-1848.

Dreiseitl, S., & Ohno-Machado, L. (2002). Logistic regression and artificial neural network classification models: a methodology review. Journal of biomedical informatics, 35(5-6), 352-359.

Resource from https://www.kaggle.com/datasets/iabhishekofficial/mobile-price-classification

Famili, A., Shen, W. M., Weber, R., & Simoudis, E. (1997). Data preprocessing and intelligent data analysis. Intelligent data analysis, 1(1), 3-23.

Ali, P. J. M., Faraj, R. H., Koya, E., Ali, P. J. M., & Faraj, R. H. (2014). Data normalization and standardization: a technical report. Mach Learn Tech Rep, 1(1), 1-6.

Murtagh, F. (1991). Multilayer perceptrons for classification and regression. Neurocomputing, 2(5-6), 183-197.

Menzies, T., Kocaguneli, E., Turhan, B., Minku, L., & Peters, F. (2014). Sharing data and models in software engineering. Morgan Kaufmann

Nair, V., & Hinton, G. E. (2010, January). Rectified linear units improve restricted boltzmann machines. In International Conference on Machine Learning.

Schober, P., Boer, C., & Schwarte, L. A. (2018). Correlation coefficients: appropriate use and interpretation. Anesthesia & Analgesia, 126(5), 1763-1768.

Wold, S., Esbensen, K., & Geladi, P. (1987). Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3), 37-52.