Comparative Analysis of Linear and Polynomial Regression Models in Predicting Shanghai Housing Prices
DOI:
https://doi.org/10.54097/ncz9xh08Keywords:
Housing Price Prediction, Linear Regression, Polynomial Regression, Shanghai Real Estate, Model Comparison.Abstract
This study presents a comparative analysis of linear and polynomial regression models for predicting housing prices in Shanghai's dynamic real estate market. Utilizing a dataset of 100 second-hand housing listings sourced from Lianjia between 2020 and 2023, with living area as the primary predictive feature, we rigorously train and evaluate both models. Our findings demonstrate that a second-degree polynomial regression model (MSE = 0. 58, R² = 0. 89)significantly outperforms a simple linear regression model (MSE = 0. 97, R² = 0. 78)in predictive accuracy and explanatory power. Residual analysis further reveals that the linear model suffers from heteroscedasticity, indicating its inadequacy in capturing the market's complexity, particularly for high-value properties. The results underscore the importance of selecting appropriately complex models for economic forecasting. We conclude that polynomial regression offers a superior fit for the non- linear trends prevalent in Shanghai's housing market. This study provides practical insights for stakeholders in real estate investment, policy formulation, and urban economics, while also acknowledging limitations related to dataset size and feature selection, suggesting directions for future research involving multivariate approaches and regularization techniques.
Downloads
References
[1] Lianjia. Shanghai Housing Dataset. 2025, sh.lianjia.com/ershoufang/.
[2] Li, Jian, and Yiming Zhang. "Linear Models for Housing Valuation in Chinese Cities." Journal of Real Estate Research, vol. 40, no. 3, 2018, pp. 345–362.
[3] Kumar, Sanjay, and Rajesh Singh. "Polynomial Regression in Housing Price Forecasting." International Journal of Data Science, vol. 15, no. 2, 2020, pp. 112–125.
[4] Ng, Andrew. Bias-Variance Tradeoff Explained. Stanford Machine Learning Lecture Notes, 2012.
[5] Tanaka, Kenji. "Geographic Segmentation in Urban Price Modeling." Urban Economics Review, vol. 55, no. 1, 2021, pp. 88–104.
[6] Wu, Lei, and Xuan He. "Deep Learning Applications in Chinese Real Estate." Applied AI, vol. 7, no. 4, 2023, pp. 201–220.
[7] Xie, Dong, et al. "Kernel Regression in Real Estate." China Economic Journal, vol. 13, no. 2, 2020, pp. 234–250.
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

