Predicting housing prices and analyzing real estate market in the Chicago suburbs using Machine Learning
Kevin Xu, Hieu Nguyen

TL;DR
This study applies various machine learning models to predict housing prices in Chicago suburbs, demonstrating that XGBoost outperforms others in volatile post-pandemic market conditions, and analyzes influential factors using SHAP values.
Contribution
It introduces a comparative analysis of multiple machine learning models for housing price prediction in a volatile market and evaluates variable importance with SHAP.
Findings
XGBoost achieved the lowest error metrics among models.
Machine learning models effectively predict prices in volatile markets.
SHAP analysis identified key factors influencing house prices.
Abstract
The pricing of housing properties is determined by a variety of factors. However, post-pandemic markets have experienced volatility in the Chicago suburb area, which have affected house prices greatly. In this study, analysis was done on the Naperville/Bolingbrook real estate market to predict property prices based on these housing attributes through machine learning models, and to evaluate the effectiveness of such models in a volatile market space. Gathering data from Redfin, a real estate website, sales data from 2018 up until the summer season of 2022 were collected for research. By analyzing these sales in this range of time, we can also look at the state of the housing market and identify trends in price. For modeling the data, the models used were linear regression, support vector regression, decision tree regression, random forest regression, and XGBoost regression. To analyze…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHousing Market and Economics
MethodsMasked autoencoder
