Comparative Analysis of Linear and Polynomial Regression Models in Predicting Shanghai Housing Prices

Authors

  • Zhengnan Wang Ulink college of Shanghai, Shanghai, 201600, China

DOI:

https://doi.org/10.54097/ncz9xh08

Keywords:

Housing Price Prediction, Linear Regression, Polynomial Regression, Shanghai Real Estate, Model Comparison.

Abstract

This study presents a comparative analysis of linear and polynomial regression models for predicting housing prices in Shanghai's dynamic real estate market. Utilizing a dataset of 100 second-hand housing listings sourced from Lianjia between 2020 and 2023, with living area as the primary predictive feature, we rigorously train and evaluate both models. Our findings demonstrate that a second-degree polynomial regression model (MSE = 0. 58, R² = 0. 89)significantly outperforms a simple linear regression model (MSE = 0. 97, R² = 0. 78)in predictive accuracy and explanatory power. Residual analysis further reveals that the linear model suffers from heteroscedasticity, indicating its inadequacy in capturing the market's complexity, particularly for high-value properties. The results underscore the importance of selecting appropriately complex models for economic forecasting. We conclude that polynomial regression offers a superior fit for the non- linear trends prevalent in Shanghai's housing market. This study provides practical insights for stakeholders in real estate investment, policy formulation, and urban economics, while also acknowledging limitations related to dataset size and feature selection, suggesting directions for future research involving multivariate approaches and regularization techniques.

Downloads

Download data is not yet available.

References

[1] Lianjia. Shanghai Housing Dataset. 2025, sh.lianjia.com/ershoufang/.

[2] Li, Jian, and Yiming Zhang. "Linear Models for Housing Valuation in Chinese Cities." Journal of Real Estate Research, vol. 40, no. 3, 2018, pp. 345–362.

[3] Kumar, Sanjay, and Rajesh Singh. "Polynomial Regression in Housing Price Forecasting." International Journal of Data Science, vol. 15, no. 2, 2020, pp. 112–125.

[4] Ng, Andrew. Bias-Variance Tradeoff Explained. Stanford Machine Learning Lecture Notes, 2012.

[5] Tanaka, Kenji. "Geographic Segmentation in Urban Price Modeling." Urban Economics Review, vol. 55, no. 1, 2021, pp. 88–104.

[6] Wu, Lei, and Xuan He. "Deep Learning Applications in Chinese Real Estate." Applied AI, vol. 7, no. 4, 2023, pp. 201–220.

[7] Xie, Dong, et al. "Kernel Regression in Real Estate." China Economic Journal, vol. 13, no. 2, 2020, pp. 234–250.

Downloads

Published

13-03-2026

Issue

Section

Articles

How to Cite

Wang, Z. (2026). Comparative Analysis of Linear and Polynomial Regression Models in Predicting Shanghai Housing Prices. Journal of Innovation and Development, 14(3), 297-301. https://doi.org/10.54097/ncz9xh08