Found 2 repositories(showing 2)
Given Ames Housing dataset, the project started with an exploratory data analysis (EDA) to identify the missing values, suspicious data, and redundant variables. Then I performed a mixed stepwise selection to reduce the set of variables and select the best model based on AIC, BIC, and adjust R-squared. With the best model selected, the model assumptions were checked regarding normality, homoscedasticity, collinearity, and linearity between response and predictors. Several solutions were proposed to solve the assumption violation. The model was then tested on unseen data and scored on Root-Mean-Squared-Error (RMSE).
A Python based machine learning project applying multiple linear regression to predict house sale prices in Ames, Iowa. The project demonstrates dataset preparation, exploratory data analysis, model training, coefficient interpretation, and performance evaluation using error metrics and residual plots.
All 2 repositories loaded