This project involves data analysis and cleaning of real estate listings scraped from Immoweb, a popular property listing website in Belgium. The goal is to derive insights regarding property prices, surface areas, and other relevant features across different municipalities in Belgium, with a focus on both Flanders and Wallonia.
- Outlier Detection: Identified and plotted outliers in property prices and surface areas.
- Municipality Price Analysis:
- Determined the most and least expensive municipalities in Belgium, Flanders, and Wallonia based on average price, median price, and price per square meter.
- Property Distribution: Visualized the distribution of properties according to their surface area using histograms.
- Variable Importance: Evaluated and ranked the importance of various features influencing property prices.
- Clone the repository:
git clone https://github.com/yourusername/immoweb-data-analysis.git
- Navigate to the project directory:
cd immoweb-data-analysis
- Install the required packages:
pip install -r requirements.txt
- Run the analysis scripts in the
src/
directory or explore the Jupyter notebook innotebooks/
.