top of page

Predicting Housing Prices in Cook County


In this, you will specify and fit a linear model to a few features of the housing data to predict housing prices. Next, we will analyze the error of the model and brainstorm ways to improve the model’s performance. Finally, we’ll delve deeper into the implications of predictive modeling within the Cook County Assessor’s Office (CCAO) case study, especially because statistical modeling is how the CCAO valuates properties. Given the history of racial discrimination in housing policy and property taxation in Cook County, consider the impacts of your modeling results as you work through this assignment - and think about what fairness might mean to property owners in Cook County. After this homework, you should be comfortable with: - Implementing a data processing pipeline using pandas - Using scikit-learn to build and fit linear models

Import All Related Libraries

import numpy as np 

import pandas as pd 
from pandas.api.types import CategoricalDtype 

%matplotlib inline 
import matplotlib.pyplot as plt 
import seaborn as sns 

import warnings 

import zipfile 
import os 

from ds100_utils import run_linear_regression_test 

# Plot settings 
plt.rcParams['figure.figsize'] = (12, 9) 
plt.rcParams['font.size'] = 12

Read Data

# Reload the data 
full_data = pd.read_csv("cook_county_train.csv") 

Split dataset

# Process the data using the pipeline for the first model
train_m1, test_m1 = train_test_split(full_data)

# A custom function that applies log transformation
def log_transform(data, col):
  data['Log ' + col] = np.log(data[col])
return data

m1_pipelines = [
(remove_outliers, None, {
'variable': 'Sale Price',
'lower': 499,

(log_transform, None, {'col': 'Sale Price'}),
(add_total_bedrooms, None, None),
(select_columns, ['Log Sale Price', 'Bedrooms'], None)

X_train_m1, y_train_m1 = process_data_gm(train_m1, m1_pipelines, 'Log Sale Price')
X_test_m1, y_test_m1 = process_data_gm(test_m1, m1_pipelines, 'Log Sale Price')

Linear Model

#creating linear model
from sklearn.linear_model import LinearRegression
reg = LinearRegression()

Fit Into Model, y_train_m1)

Find the score

reg.score(x_test_m1, y_test_m1)

If you need any programming assignment help in Machine Learning, Machine Learning project or Machine Learning homework then we are ready to help you.

Send your request at and get instant help with an affordable price.

We are always focus to delivered unique or without plagiarism code which is written by our highly educated professional which provide well structured code within your given time frame. If you are looking other programming language help like C, C++, Java, Python, PHP, Asp.Net, NodeJs, ReactJs, etc. with the different types of databases like MySQL, MongoDB, SQL Server, Oracle, etc. then also contact us.


bottom of page