House Price Prediction and Analysis Based on Random Forest and XGBoost Models

Authors

  • Han Li

DOI:

https://doi.org/10.54097/hbem.v21i.14837

Keywords:

House price prediction; Random Forest; XGBoost.

Abstract

Accurate prediction of house price is important in housing market. It’s difficult to forecast housing price because it’s influenced by many factors. There has been many discussions on housing  price prediction by kinds of machine learning algorithm. This paper attempt to predict housing price by Random Forest and XGBoost models, and compares the performance between them. In this paper, missing values processing, correlation analysis and standardization of samples are carried on the initial data at first, then two machine learning models are constructed, trained and test on the same dataset. The Kaggle house price dataset (“House Prices-Advanced Regression Techniques”) is used in the paper. The dataset contains 1420 samples with 79 features that represent almost all characteristics of house. The results show that XGBoost algorithm achieve higher R square score of 89% , indicating that XGBoost model can be more  efficient and accurate on measurement and prediction of housing sales prices.

Downloads

Download data is not yet available.

References

Abigail Bola Adetunji, Oluwatobi Noah Akande, Funmilola Alaba Ajala, etal. House Price Prediction using Random Forest Machine Learning Technique. The 8th International Conference on Information Technology and Quantitative Management, 2020&2021

Akash Dagar, Shreya Kapoor. A Comparative Study on House Price Prediction. International Journal for Modern Trends in Science and Technology, 2020, 6(12):103-107.

Xiaotong Li, Xuan Guo, Chengjie Wang. Analysis of Beijing Second-Hand House Price Based on Random Forest. Hans Journal of Data Mining, 2017, 7(2), 37-45

José-María Montero, Román Mínguez and Gema Fernández-Avilés. Housing price prediction: parametric versus semi-parametric spatial hedonic models. Journal of Geographical Systems, 2018, 20, 27–55

Wang Dongxue, Guo Xiujuan. Housing price prediction model based on XGBoost. Northern Architecture, 2021.6, 6(3): 79-82.

Taoran. Optimized housing price prediction based on XGBoost. Journal of Sichuan University(Natural Science Edition), 2022.5, 59(3): (037001)1-18.

B.Vijay Kumar, B.Ashritha, CH.Teja, M.Vineeth. House Price Prediction Using Gradient Boost Regression Model. International Journal of Research and Analytical Reviews, 2020, 7(1), 52-55

Chen Lei. Forecast of Commercial Housing Prices in Nanjing—A Comparative Analysis Based on ARIMA Model and SARIMA Model. Statistics and Applications, 2022, 11(2): 280-287.

Breiman I. Random forests. Machine Learning, 2001, 45:5.

Chen T, Guestrin C. Xgboost: a scaled booting system. Proceedings of the 22nd acmsigkdd integrational conference on knowledge discovery and data mining, 2016.

Downloads

Published

12-12-2023

How to Cite

Li, H. (2023). House Price Prediction and Analysis Based on Random Forest and XGBoost Models. Highlights in Business, Economics and Management, 21, 934-938. https://doi.org/10.54097/hbem.v21i.14837