This project analyzes house sales data in the USA to identify pricing trends and relationships between housing features and sale prices using Python.
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Scikit-learn
- Jupyter Notebook
- Data cleaning and preprocessing
- Exploratory Data Analysis (EDA)
- Feature scaling using StandardScaler
- Polynomial regression using Scikit-learn Pipeline
- Data visualization for insights and trends
- Identified key factors influencing house prices
- Built regression models to analyze non-linear relationships
- Visualized correlations and pricing patterns
- Clone the repository
- Install dependencies: pip install pandas numpy matplotlib seaborn scikit-learn
- Open the notebook: jupyter notebook
Rajat Dungarwal