SlideShare une entreprise Scribd logo
1  sur  13
New NYC Business
Incorporation 2005-2013
An Exploration of Non-Minority and
Minority-Owned Enterprise Creation
By Shelby Ahern
stahern@gmail.com
NYC Data Science Academy
Student Demo day 07-21-2014
R005: Data Science by R(Beginner level)
Explore
• New Business Incorporation in NYC between 2005-2013, and
• New Business Incorporation, by Minority and Non-Minority Ownership
Data Sources
• Active NewYork Corporations: Beginning in 18001
• NYC Online Directory of Certified Businesses: Minority-Owned Business Enterprises
(MBE)2,3
• U.S. Census Population Estimates4
• EntityType:
• Domestic BusinessCorporation
• Domestic Cooperative Corporation
• Domestic Professional Corporation
Parameters and Notes
• 2005-2013 (9 years)
• Borough = County (ie. Manhattan: NewYork County, Brooklyn: Kings County,
Queens = Queens County, Bronx = Bronx County, Staten Island = Richmond County
Create Data Frames of Data from Each Source
Run Summary Statistics forValidation
Split by Borough and Combine DFs from Different Sources
Perform Calculations ie. New Incorporations per Capita
DataViz!
Test:
“Density” of New MBE Corps for Minority Population ≠ “Density” of New
Non-MBECorps per Non-Minority Population
An Initial Review of the Summaries of the Corporation Data and MBE-Certified
Corporations show…
Major disparity between the Number of Incorporations per year, and number of
MBE’s established in that year.
Why?
- Data Quality: Change in Ownership Structure, Restrictions to MBE Certifications, and/or Filing Lag
- !!What the Data actually represent: MBE application purpose & process
>MH_Corps
County year NewCorps NewMBECorps Tot_Pop MBE_pop1 NwCorpsperCap NwMBECorpsperCap NwMBECorpsperMBECap
NwNonMBECor
psperCap
1NEW YORK 2005 5101 35 1529774 690696 0.0033 2.30E-05 5.10E-05 0.006
2NEW YORK 2006 5395 42 1611581 738221 0.0033 2.60E-05 5.70E-05 0.0061
3NEW YORK 2007 5373 39 1620867 724926 0.0033 2.40E-05 5.40E-05 0.006
4NEW YORK 2008 5602 38 1634795 696413 0.0034 2.30E-05 5.50E-05 0.0059
5NEW YORK 2009 7617 39 1629054 669583 0.0047 2.40E-05 5.80E-05 0.0079
6NEW YORK 2010 9872 34 1585873 674800 0.0062 2.10E-05 5.00E-05 0.0108
7NEW YORK 2011 9909 24 1601948 703250 0.0062 1.50E-05 3.40E-05 0.011
8NEW YORK 2012 10326 15 1619090 697407 0.0064 9.30E-06 2.20E-05 0.0112
9NEW YORK 2013 10345 3 1585873 546732 0.0065 1.90E-06 5.50E-06 0.01
After merging data from different data frames, we are able to calculate the number of
new corporations filed per capita, on a yearly basis.
Further, we calculate the number of new corporations filed per capita of certain
populations, like MBEs/Minority and Non-MBE’s/Non-Minority populations.
Example Data Frame, Manhattan
$NwCorpsperCap
$NwMBECorpsperCap
Incorporations per Capita and MBE Incorporations per capita, 2005- 2013
MBE Incorporations per Capita, 2005- 2013
$NwCorpsperCap
$NwMBECorpsperCap
Findings:
The per-capita incidence of incorporations increased across all
boroughs, from 2005 - 2013.
Manhattan, Queens, and Brooklyn had the highest per-capita
incorporations.
Queens appears to have the steepest increase in corporation
filings.
MBE incorporations per capita are a thousands of times smaller
than the general level of per-capita-incorporation.
The per-capita incidence of MBE incorporations varied by borough
(led by Manhattan), and trended downward after 2009.
Hypothesis:The number of MBE incorporations per non-white
person is not equal to the number of non-MBE incorporations
per white person.
The approach:
1. SelectValue to test:
 MBE Corps per Minority capita
 Non-MBE Corps per Non-Minority capita
▪ Utilize data from all years and boroughs(5 boroughs x 9 years x 2 categories = 90 obs.)
2. Evaluate which test(s) to conduct.
 Parametric vs. Non-parametric
 Means test vs. Other
3. Conduct test and analyze results.
Histogram, MBE Incorporations per Minority capita QQplot, MBE Incorporations per Minority capita
Histogram,
Non-MBE Incorporations per Non-Minority capita
QQplot, Non-MBE Incorporations per Non-Minority
capita
Neither MBE nor Non-MBE per capita data appear to be normally distributed.
Hence, we’ll consider the following two non-parametric tests:
Mood’s MedianTest
A nonparametric test where the null hypothesis of
the medians of the populations from which two or
more samples are drawn are identical. (Wikipedia)
H0: Medians of MBE - Minority cap and Non-MBE --
Non-Minority cap are equivalent.
H1: Medians of MBE - Minority cap and Non-MBE --
Non-Minority cap are NOT equivalent.
Mann-Whitney-Wilcoxon Test
A nonparametric test of the null
hypothesis that two populations are the same
against an alternative hypothesis, especially that a
particular population tends to have larger values
than the other. (Wikipedia)
H0: MBE - Minority cap and Non-MBE -- Non-
Minority cap could be representative of the
same set of data.
H1: MBE - Minority cap and Non-MBE -- Non-
Minority cap could NOT be representative of the
same set of data.
In both tests of parity, the null hypothesis is
rejected, thus we find that the incidence of
new business incorporations per capita are
different between the two populations.

Contenu connexe

Similaire à NYC Business Incorporation Disparities

Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card PaymentsVikas Virani
 
GraphingCharting ProjectBelow is data obtained from the Natio.docx
GraphingCharting ProjectBelow is data obtained from the Natio.docxGraphingCharting ProjectBelow is data obtained from the Natio.docx
GraphingCharting ProjectBelow is data obtained from the Natio.docxbenjaminjames21681
 
Lecture 5 Data Visualisation
Lecture 5 Data VisualisationLecture 5 Data Visualisation
Lecture 5 Data Visualisationpaul.hawking
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Descriptiongetyourcheaton
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...AdekunleJoseph4
 
MRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMichael Kelly
 
Data Science process
Data Science processData Science process
Data Science processbigdata trunk
 
New tools and new evidence on business dynamics. Chiara Criscuolo
 New tools and new evidence on business dynamics. Chiara Criscuolo  New tools and new evidence on business dynamics. Chiara Criscuolo
New tools and new evidence on business dynamics. Chiara Criscuolo enterpriseresearchcentre
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for SponsorsDee Daley
 
Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: enterpriseresearchcentre
 
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12SWAMI NATH SATPAL
 
ONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventRichard Tonkin
 
Presentation of Project and Critique.pptx
Presentation of Project and Critique.pptxPresentation of Project and Critique.pptx
Presentation of Project and Critique.pptxBillyMoses1
 

Similaire à NYC Business Incorporation Disparities (20)

Default of Credit Card Payments
Default of Credit Card PaymentsDefault of Credit Card Payments
Default of Credit Card Payments
 
GraphingCharting ProjectBelow is data obtained from the Natio.docx
GraphingCharting ProjectBelow is data obtained from the Natio.docxGraphingCharting ProjectBelow is data obtained from the Natio.docx
GraphingCharting ProjectBelow is data obtained from the Natio.docx
 
Lecture 5 Data Visualisation
Lecture 5 Data VisualisationLecture 5 Data Visualisation
Lecture 5 Data Visualisation
 
Descriptive Statistics, Numerical Description
Descriptive Statistics, Numerical DescriptionDescriptive Statistics, Numerical Description
Descriptive Statistics, Numerical Description
 
Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...Construction of a robust prediction model to forecast the likelihood of a cre...
Construction of a robust prediction model to forecast the likelihood of a cre...
 
Estimation & Adjustment in Census 2021
Estimation & Adjustment in Census 2021Estimation & Adjustment in Census 2021
Estimation & Adjustment in Census 2021
 
MD poverty indexes
MD poverty indexesMD poverty indexes
MD poverty indexes
 
11
1111
11
 
MRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxionMRMW N America 2016 presentation kelly and zanutto naxion
MRMW N America 2016 presentation kelly and zanutto naxion
 
Data Science process
Data Science processData Science process
Data Science process
 
New tools and new evidence on business dynamics. Chiara Criscuolo
 New tools and new evidence on business dynamics. Chiara Criscuolo  New tools and new evidence on business dynamics. Chiara Criscuolo
New tools and new evidence on business dynamics. Chiara Criscuolo
 
Qnt275 qnt 275
Qnt275 qnt 275Qnt275 qnt 275
Qnt275 qnt 275
 
Basic Analytics Module for Sponsors
Basic Analytics Module for SponsorsBasic Analytics Module for Sponsors
Basic Analytics Module for Sponsors
 
What Is Your LMIQ
What Is Your LMIQWhat Is Your LMIQ
What Is Your LMIQ
 
Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship: Crossing the Borders towards Entrepreneurship:
Crossing the Borders towards Entrepreneurship:
 
Lmi Demystify62910
Lmi Demystify62910Lmi Demystify62910
Lmi Demystify62910
 
Week_2_Lecture.pdf
Week_2_Lecture.pdfWeek_2_Lecture.pdf
Week_2_Lecture.pdf
 
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
BPPIMT(VIPRoad)-Big Data Analytics with R-Gr12
 
ONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats eventONS presentation at RSS South Wales poverty & inequality stats event
ONS presentation at RSS South Wales poverty & inequality stats event
 
Presentation of Project and Critique.pptx
Presentation of Project and Critique.pptxPresentation of Project and Critique.pptx
Presentation of Project and Critique.pptx
 

Plus de Vivian S. Zhang

Career services workshop- Roger Ren
Career services workshop- Roger RenCareer services workshop- Roger Ren
Career services workshop- Roger RenVivian S. Zhang
 
Nycdsa wordpress guide book
Nycdsa wordpress guide bookNycdsa wordpress guide book
Nycdsa wordpress guide bookVivian S. Zhang
 
We're so skewed_presentation
We're so skewed_presentationWe're so skewed_presentation
We're so skewed_presentationVivian S. Zhang
 
Wikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataWikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataVivian S. Zhang
 
A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data Vivian S. Zhang
 
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Vivian S. Zhang
 
Data mining with caret package
Data mining with caret packageData mining with caret package
Data mining with caret packageVivian S. Zhang
 
Streaming Python on Hadoop
Streaming Python on HadoopStreaming Python on Hadoop
Streaming Python on HadoopVivian S. Zhang
 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorVivian S. Zhang
 
Nyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedNyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedVivian S. Zhang
 
Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Vivian S. Zhang
 
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataTHE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataVivian S. Zhang
 
Max Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningMax Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningVivian S. Zhang
 
Winning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangWinning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangVivian S. Zhang
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesVivian S. Zhang
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rVivian S. Zhang
 

Plus de Vivian S. Zhang (20)

Why NYC DSA.pdf
Why NYC DSA.pdfWhy NYC DSA.pdf
Why NYC DSA.pdf
 
Career services workshop- Roger Ren
Career services workshop- Roger RenCareer services workshop- Roger Ren
Career services workshop- Roger Ren
 
Nycdsa wordpress guide book
Nycdsa wordpress guide bookNycdsa wordpress guide book
Nycdsa wordpress guide book
 
We're so skewed_presentation
We're so skewed_presentationWe're so skewed_presentation
We're so skewed_presentation
 
Wikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big DataWikipedia: Tuned Predictions on Big Data
Wikipedia: Tuned Predictions on Big Data
 
A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data A Hybrid Recommender with Yelp Challenge Data
A Hybrid Recommender with Yelp Challenge Data
 
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
 
Data mining with caret package
Data mining with caret packageData mining with caret package
Data mining with caret package
 
Xgboost
XgboostXgboost
Xgboost
 
Streaming Python on Hadoop
Streaming Python on HadoopStreaming Python on Hadoop
Streaming Python on Hadoop
 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its authorKaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
 
Xgboost
XgboostXgboost
Xgboost
 
Nyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expandedNyc open-data-2015-andvanced-sklearn-expanded
Nyc open-data-2015-andvanced-sklearn-expanded
 
Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015 Nycdsa ml conference slides march 2015
Nycdsa ml conference slides march 2015
 
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public dataTHE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
 
Max Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learningMax Kuhn's talk on R machine learning
Max Kuhn's talk on R machine learning
 
Winning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen ZhangWinning data science competitions, presented by Owen Zhang
Winning data science competitions, presented by Owen Zhang
 
Using Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York TimesUsing Machine Learning to aid Journalism at the New York Times
Using Machine Learning to aid Journalism at the New York Times
 
Introducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with rIntroducing natural language processing(NLP) with r
Introducing natural language processing(NLP) with r
 
Bayesian models in r
Bayesian models in rBayesian models in r
Bayesian models in r
 

Dernier

Prach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism CommunityPrach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism Communityprachaibot
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
Robotics Group 10 (Control Schemes) cse.pdf
Robotics Group 10  (Control Schemes) cse.pdfRobotics Group 10  (Control Schemes) cse.pdf
Robotics Group 10 (Control Schemes) cse.pdfsahilsajad201
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Sumanth A
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating SystemRashmi Bhat
 
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsNovel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsResearcher Researcher
 
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Romil Mishra
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Erbil Polytechnic University
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxsiddharthjain2303
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating SystemRashmi Bhat
 
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书rnrncn29
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communicationpanditadesh123
 
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmComputer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmDeepika Walanjkar
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxRomil Mishra
 
signals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsignals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsapna80328
 
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHTEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHSneha Padhiar
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solidnamansinghjarodiya
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfCS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfBalamuruganV28
 

Dernier (20)

Prach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism CommunityPrach: A Feature-Rich Platform Empowering the Autism Community
Prach: A Feature-Rich Platform Empowering the Autism Community
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
Robotics Group 10 (Control Schemes) cse.pdf
Robotics Group 10  (Control Schemes) cse.pdfRobotics Group 10  (Control Schemes) cse.pdf
Robotics Group 10 (Control Schemes) cse.pdf
 
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
Robotics-Asimov's Laws, Mechanical Subsystems, Robot Kinematics, Robot Dynami...
 
Designing pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptxDesigning pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptx
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating System
 
Novel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending ActuatorsNovel 3D-Printed Soft Linear and Bending Actuators
Novel 3D-Printed Soft Linear and Bending Actuators
 
Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________Gravity concentration_MI20612MI_________
Gravity concentration_MI20612MI_________
 
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
Comparative study of High-rise Building Using ETABS,SAP200 and SAFE., SAFE an...
 
Energy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptxEnergy Awareness training ppt for manufacturing process.pptx
Energy Awareness training ppt for manufacturing process.pptx
 
Input Output Management in Operating System
Input Output Management in Operating SystemInput Output Management in Operating System
Input Output Management in Operating System
 
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
『澳洲文凭』买麦考瑞大学毕业证书成绩单办理澳洲Macquarie文凭学位证书
 
multiple access in wireless communication
multiple access in wireless communicationmultiple access in wireless communication
multiple access in wireless communication
 
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithmComputer Graphics Introduction, Open GL, Line and Circle drawing algorithm
Computer Graphics Introduction, Open GL, Line and Circle drawing algorithm
 
Mine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptxMine Environment II Lab_MI10448MI__________.pptx
Mine Environment II Lab_MI10448MI__________.pptx
 
signals in triangulation .. ...Surveying
signals in triangulation .. ...Surveyingsignals in triangulation .. ...Surveying
signals in triangulation .. ...Surveying
 
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACHTEST CASE GENERATION GENERATION BLOCK BOX APPROACH
TEST CASE GENERATION GENERATION BLOCK BOX APPROACH
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solid
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
CS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdfCS 3251 Programming in c all unit notes pdf
CS 3251 Programming in c all unit notes pdf
 

NYC Business Incorporation Disparities

  • 1. New NYC Business Incorporation 2005-2013 An Exploration of Non-Minority and Minority-Owned Enterprise Creation By Shelby Ahern stahern@gmail.com NYC Data Science Academy Student Demo day 07-21-2014 R005: Data Science by R(Beginner level)
  • 2. Explore • New Business Incorporation in NYC between 2005-2013, and • New Business Incorporation, by Minority and Non-Minority Ownership Data Sources • Active NewYork Corporations: Beginning in 18001 • NYC Online Directory of Certified Businesses: Minority-Owned Business Enterprises (MBE)2,3 • U.S. Census Population Estimates4 • EntityType: • Domestic BusinessCorporation • Domestic Cooperative Corporation • Domestic Professional Corporation Parameters and Notes • 2005-2013 (9 years) • Borough = County (ie. Manhattan: NewYork County, Brooklyn: Kings County, Queens = Queens County, Bronx = Bronx County, Staten Island = Richmond County
  • 3. Create Data Frames of Data from Each Source Run Summary Statistics forValidation Split by Borough and Combine DFs from Different Sources Perform Calculations ie. New Incorporations per Capita DataViz! Test: “Density” of New MBE Corps for Minority Population ≠ “Density” of New Non-MBECorps per Non-Minority Population
  • 4. An Initial Review of the Summaries of the Corporation Data and MBE-Certified Corporations show… Major disparity between the Number of Incorporations per year, and number of MBE’s established in that year. Why? - Data Quality: Change in Ownership Structure, Restrictions to MBE Certifications, and/or Filing Lag - !!What the Data actually represent: MBE application purpose & process
  • 5. >MH_Corps County year NewCorps NewMBECorps Tot_Pop MBE_pop1 NwCorpsperCap NwMBECorpsperCap NwMBECorpsperMBECap NwNonMBECor psperCap 1NEW YORK 2005 5101 35 1529774 690696 0.0033 2.30E-05 5.10E-05 0.006 2NEW YORK 2006 5395 42 1611581 738221 0.0033 2.60E-05 5.70E-05 0.0061 3NEW YORK 2007 5373 39 1620867 724926 0.0033 2.40E-05 5.40E-05 0.006 4NEW YORK 2008 5602 38 1634795 696413 0.0034 2.30E-05 5.50E-05 0.0059 5NEW YORK 2009 7617 39 1629054 669583 0.0047 2.40E-05 5.80E-05 0.0079 6NEW YORK 2010 9872 34 1585873 674800 0.0062 2.10E-05 5.00E-05 0.0108 7NEW YORK 2011 9909 24 1601948 703250 0.0062 1.50E-05 3.40E-05 0.011 8NEW YORK 2012 10326 15 1619090 697407 0.0064 9.30E-06 2.20E-05 0.0112 9NEW YORK 2013 10345 3 1585873 546732 0.0065 1.90E-06 5.50E-06 0.01 After merging data from different data frames, we are able to calculate the number of new corporations filed per capita, on a yearly basis. Further, we calculate the number of new corporations filed per capita of certain populations, like MBEs/Minority and Non-MBE’s/Non-Minority populations. Example Data Frame, Manhattan
  • 6. $NwCorpsperCap $NwMBECorpsperCap Incorporations per Capita and MBE Incorporations per capita, 2005- 2013
  • 7. MBE Incorporations per Capita, 2005- 2013 $NwCorpsperCap $NwMBECorpsperCap
  • 8. Findings: The per-capita incidence of incorporations increased across all boroughs, from 2005 - 2013. Manhattan, Queens, and Brooklyn had the highest per-capita incorporations. Queens appears to have the steepest increase in corporation filings. MBE incorporations per capita are a thousands of times smaller than the general level of per-capita-incorporation. The per-capita incidence of MBE incorporations varied by borough (led by Manhattan), and trended downward after 2009.
  • 9. Hypothesis:The number of MBE incorporations per non-white person is not equal to the number of non-MBE incorporations per white person. The approach: 1. SelectValue to test:  MBE Corps per Minority capita  Non-MBE Corps per Non-Minority capita ▪ Utilize data from all years and boroughs(5 boroughs x 9 years x 2 categories = 90 obs.) 2. Evaluate which test(s) to conduct.  Parametric vs. Non-parametric  Means test vs. Other 3. Conduct test and analyze results.
  • 10. Histogram, MBE Incorporations per Minority capita QQplot, MBE Incorporations per Minority capita
  • 11. Histogram, Non-MBE Incorporations per Non-Minority capita QQplot, Non-MBE Incorporations per Non-Minority capita
  • 12. Neither MBE nor Non-MBE per capita data appear to be normally distributed. Hence, we’ll consider the following two non-parametric tests: Mood’s MedianTest A nonparametric test where the null hypothesis of the medians of the populations from which two or more samples are drawn are identical. (Wikipedia) H0: Medians of MBE - Minority cap and Non-MBE -- Non-Minority cap are equivalent. H1: Medians of MBE - Minority cap and Non-MBE -- Non-Minority cap are NOT equivalent. Mann-Whitney-Wilcoxon Test A nonparametric test of the null hypothesis that two populations are the same against an alternative hypothesis, especially that a particular population tends to have larger values than the other. (Wikipedia) H0: MBE - Minority cap and Non-MBE -- Non- Minority cap could be representative of the same set of data. H1: MBE - Minority cap and Non-MBE -- Non- Minority cap could NOT be representative of the same set of data.
  • 13. In both tests of parity, the null hypothesis is rejected, thus we find that the incidence of new business incorporations per capita are different between the two populations.

Notes de l'éditeur

  1. Notes: https://data.ny.gov/Economic-Development/Active-Corporations-Beginning-1800/g5xh-vgry. Accessed 7/1/14. http://mtprawvwsbswtp1-1.nyc.gov/Search.aspx. Accessed 7/9/14. Under Article 15-A of the Executive Law, an MBE is a business enterprise in which at least fifty-one percent (51%) is owned, operated and controlled by citizens or permanent resident aliens who are meeting the ethnic definitions: Black, Hispanic, Asian-Pacific, Asian-Indian Subcontinent, Native American. http://www.esd.ny.gov/MWBE/Qualifications.html. Accessed 7/16/2014. 2005: Source: U.S. Census Bureau, 2005 American Community Survey, DP01, General Demographic Characteristics: 2005. 2006: http://www.socialexplorer.com/tables/ACS2006/R10763189. Accessed 7/15/2014. 2007: http://www.socialexplorer.com/tables/ACS2007/R10763198. Accessed 7/15/2014. 2008: http://www.socialexplorer.com/tables/ACS2008/R10763200. Accessed 7/15/2014. 2009: http://www.socialexplorer.com/tables/ACS2009/R10763202. Accessed 7/15/2014. 2010: http://www.socialexplorer.com/tables/C2010/R10763203. Accessed 7/15/2014. 2011: http://www.socialexplorer.com/tables/ACS2011/R10763211. Accessed 7/15/2014. 2012: http://www.socialexplorer.com/tables/ACS2012/R10763214. Accessed 7/15/2014. 2013: Population Estimates, County Characteristics: Vintage 2013. “Annual Estimages of the Resident Population by Sex, Race, and Hispanic Origin: April 1, 2010 to July 1, 2013. http://www.census.gov/popest/data/counties/asrh/2013/index.html. Accessed 7/15/2014.
  2. 1. “MBE_pop” denotes minority population. It is the sum of the populations of the following groups: Black or African American Alone, American Indian and Alaska Native Alone, Asian Alone, Native Hawaiian and Other Pacific Islander Alone, Some Other Race Alone, and “Two or More races”. Note: in 2013, the U.S. Census Bureau stopped the practice of bucketing “Some Other Race Alone,” which is a variation in the data between 2005-2012 and 2013.