SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
Agenda
★ Introduction & Context
★ What? - Research Question
★ Data Description
★ Exploratory Data Analysis
★ Data Cleaning and Curation
★ Regression Analysis
★ Conclusion
WHYthis project
8,277,829
$14.3 billion
property crime offenses
in the nation in 2014.
estimated losses caused by
property crimes in 2014.
Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
WHATare we doing
Does crime in a neighborhood
affect the home values over time?
RESEARCH QUESTION
DATAdescription
★ 1.14 million records
★ Timeline 2010-15
★ 19 variables
★ 46 crime types
★ Time-series data
★ Dimensions: 5692 x 88
★ Year, Month, ZHVI,
Neighborhood, Region
Seattle 911 Response Zillow Data Research
EXPLORATORY
Data Analysis
EXPLORATORY
Data Analysis
DATACleaning & Curation Methods
Getting rid of unwanted records and columns
Reverse Geocoding Latitudes & Longitudes ~ High
Scale Conversion using Shapefiles in R
Data Transformation ~ Tidy Data (Method applied
Hadley Wickham - Journal of Statistical Software,
vol. 59, 2014.)
Data Aggregation by Year, Month & Neighborhood
Merging the clean datasets by Aggregated Matching
Ready for
Regression Analysis
DATACleaning - R Packages used
gpclib - General Polygon Clipping Library for R
rgeos - Interface to Geometry Engine - Open Source
(GEOS)
maptools - Reading / Handling Spatial Objects
rgdal - Bindings for the Geospatial Data Abstraction
Library
spatialEco - Spatial Analysis and Modelling
tidyr dplyrggplot2
Over 300 lines of R code written for the
project:
DATACleaning & Curation numbers
1.14 Million records narrowed down to 229K values
Reverse geocoded 229K different Latitude-
Longitude combinations and mapped them to 82
neighborhood zones
Zillow data transformed from dim [5692 x 88] to
[20592 x 4] dataframes (Reduce & Transpose)
229K values aggregated to 5904 rows (5 Years, 60
Months, 82 Neighborhoods, 5 Crime types)
Final Merged Dataset to append the Zillow Home
Value to each of the corresponding aggregated 911
value
Ready for
Regression Analysis
Data Analysis
Phew, Finally
Predicting home values as a function of
property crimes
WHAT IS OUR MODEL?
Crimes aggregated by month, year and
neighborhood with corresponding value
of house during the same period and
neighborhood
✔ Linear Relationship between
data
✔ Nearly normally-distributed
residuals
✔ Constant variability
✔ Independent observations
Data Analysis
Collinearity
Data Analysis
Multiple Regression
home.value = β0
+ β1
* Assault.Count + β2
* Arrest.Count + β3
* Burglary.Count + β4
* Disturbance.Count + β5
* Robbery.Count
Estimate Pr (>|t|) 95% Confidence Interval
(Intercept) 469028.28 < 2e-16 461599.3, 476457.2
Assault.Count 1401.19 5.18e-09 931.7, 1870.6
Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4
Burglary.Count -114.53 0.2973 -329.9, 100.87
Disturbance.Count 56.38 0.0273 6.3, 106.4
Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6
Multiple R-squared: 0.1106
Assault Count $ 1401.19
Arrest Count $ 2278.03
Burglary Count $ 114.53
Disturbance Count $ 56.38
Robbery Count $ 2413.51
Data Analysis
Model Interpretation
Data Analysis
Special Inferences
Since 2010, 4471 of total 17523 assault
cases reported in Downtown Seattle
Disturbance complaints have less or no correlation
with time or neighborhood ~ evenly distributed
SeattlersliketoParty!
Burglary - Unlawful entry to a structure to commit theft or a felony. In
order for burglary to take place, a victim does not have to be present.
Robbery - Take something from someone that has value by utilizing
intimidation, force or threat. In order for robbery to take place, a
victim must be present at the scene
You are less likely to be
arrested if you stay in
an expensive house!
Other Factors include:
■ Distance from Workplace,
UW and SeaTac Airport.
■ Access to Transportation &
Departmental Stores.
■ History of neighborhood
and past data.
Project Scope
Limitations and Key Learnings
Key Learnings
✔ Data Cleaning
✔ R Programming
✔ Open-ended
questions
✔ Data Science
Thank you!

Contenu connexe

Similaire à Home Value Patrol

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyCOGS Presentations
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseAzavea
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...Azavea
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime AnalysisAzavea
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataAzavea
 

Similaire à Home Value Patrol (6)

Analyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis CountyAnalyzing the Spatial Distribution of Crime in Annapolis County
Analyzing the Spatial Distribution of Crime in Annapolis County
 
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San JoseForecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
Forecasting Space-Time Events - Strata + Hadoop World 2015 San Jose
 
HackerOne_Report
HackerOne_ReportHackerOne_Report
HackerOne_Report
 
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
2011 NIJ Crime Mapping Conference - Data Mining and Risk Forecasting in Web-b...
 
10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis10 Steps to Optimize Your Crime Analysis
10 Steps to Optimize Your Crime Analysis
 
Data Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) DataData Philly Meetup - Big (Geo) Data
Data Philly Meetup - Big (Geo) Data
 

Dernier

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxolyaivanovalion
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramMoniSankarHazra
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823
 

Dernier (20)

Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
ALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptxALSO dropshipping via API with DroFx.pptx
ALSO dropshipping via API with DroFx.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts ServiceCall Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
Call Girls In Shalimar Bagh ( Delhi) 9953330565 Escorts Service
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
ELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptxELKO dropshipping via API with DroFx.pptx
ELKO dropshipping via API with DroFx.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Capstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics ProgramCapstone Project on IBM Data Analytics Program
Capstone Project on IBM Data Analytics Program
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 

Home Value Patrol

  • 1.
  • 2. Agenda ★ Introduction & Context ★ What? - Research Question ★ Data Description ★ Exploratory Data Analysis ★ Data Cleaning and Curation ★ Regression Analysis ★ Conclusion
  • 3. WHYthis project 8,277,829 $14.3 billion property crime offenses in the nation in 2014. estimated losses caused by property crimes in 2014. Source: https://www.fbi.gov/about-us/cjis/ucr/crime-in-the-u.s/2014/crime-in-the-u.s.-2014/offenses-known-to-law-enforcement/property-crime
  • 4. WHATare we doing Does crime in a neighborhood affect the home values over time? RESEARCH QUESTION
  • 5. DATAdescription ★ 1.14 million records ★ Timeline 2010-15 ★ 19 variables ★ 46 crime types ★ Time-series data ★ Dimensions: 5692 x 88 ★ Year, Month, ZHVI, Neighborhood, Region Seattle 911 Response Zillow Data Research
  • 8. DATACleaning & Curation Methods Getting rid of unwanted records and columns Reverse Geocoding Latitudes & Longitudes ~ High Scale Conversion using Shapefiles in R Data Transformation ~ Tidy Data (Method applied Hadley Wickham - Journal of Statistical Software, vol. 59, 2014.) Data Aggregation by Year, Month & Neighborhood Merging the clean datasets by Aggregated Matching Ready for Regression Analysis
  • 9. DATACleaning - R Packages used gpclib - General Polygon Clipping Library for R rgeos - Interface to Geometry Engine - Open Source (GEOS) maptools - Reading / Handling Spatial Objects rgdal - Bindings for the Geospatial Data Abstraction Library spatialEco - Spatial Analysis and Modelling tidyr dplyrggplot2 Over 300 lines of R code written for the project:
  • 10. DATACleaning & Curation numbers 1.14 Million records narrowed down to 229K values Reverse geocoded 229K different Latitude- Longitude combinations and mapped them to 82 neighborhood zones Zillow data transformed from dim [5692 x 88] to [20592 x 4] dataframes (Reduce & Transpose) 229K values aggregated to 5904 rows (5 Years, 60 Months, 82 Neighborhoods, 5 Crime types) Final Merged Dataset to append the Zillow Home Value to each of the corresponding aggregated 911 value Ready for Regression Analysis
  • 11. Data Analysis Phew, Finally Predicting home values as a function of property crimes WHAT IS OUR MODEL? Crimes aggregated by month, year and neighborhood with corresponding value of house during the same period and neighborhood
  • 12. ✔ Linear Relationship between data ✔ Nearly normally-distributed residuals ✔ Constant variability ✔ Independent observations Data Analysis Collinearity
  • 13. Data Analysis Multiple Regression home.value = β0 + β1 * Assault.Count + β2 * Arrest.Count + β3 * Burglary.Count + β4 * Disturbance.Count + β5 * Robbery.Count Estimate Pr (>|t|) 95% Confidence Interval (Intercept) 469028.28 < 2e-16 461599.3, 476457.2 Assault.Count 1401.19 5.18e-09 931.7, 1870.6 Arrest.Count -2278.03 < 2e-16 -2531.6, -2024.4 Burglary.Count -114.53 0.2973 -329.9, 100.87 Disturbance.Count 56.38 0.0273 6.3, 106.4 Robbery.Count -2413.51 < 2e-16 -2876.40, -1950.6 Multiple R-squared: 0.1106
  • 14. Assault Count $ 1401.19 Arrest Count $ 2278.03 Burglary Count $ 114.53 Disturbance Count $ 56.38 Robbery Count $ 2413.51 Data Analysis Model Interpretation
  • 15. Data Analysis Special Inferences Since 2010, 4471 of total 17523 assault cases reported in Downtown Seattle Disturbance complaints have less or no correlation with time or neighborhood ~ evenly distributed SeattlersliketoParty! Burglary - Unlawful entry to a structure to commit theft or a felony. In order for burglary to take place, a victim does not have to be present. Robbery - Take something from someone that has value by utilizing intimidation, force or threat. In order for robbery to take place, a victim must be present at the scene You are less likely to be arrested if you stay in an expensive house!
  • 16. Other Factors include: ■ Distance from Workplace, UW and SeaTac Airport. ■ Access to Transportation & Departmental Stores. ■ History of neighborhood and past data. Project Scope Limitations and Key Learnings Key Learnings ✔ Data Cleaning ✔ R Programming ✔ Open-ended questions ✔ Data Science Thank you!