SlideShare une entreprise Scribd logo
1  sur  45
Télécharger pour lire hors ligne
What	
  Managers	
  
Need	
  to	
  Know	
  
about	
  Data	
  Science	
  
Annie	
  Flippo	
  
Outline	
  
•  What is data science
•  Industry trends
•  What is data
•  The Optimal Data Scientist
•  The Optimal Manager
•  Topics in Data Science
•  Topics in Cloud Computing
Who	
  am	
  I?	
  
Annie Flippo
Data Scientist
Software Engineer
Product Manager
Database Developer
What	
  is	
  Data	
  Science?	
  
Usage	
  of	
  Data	
  Science	
  
Finance:	
  fraud	
  detecAon,	
  score	
  
buying	
  habits,	
  calculate	
  risks	
  
Insurance:	
  inspect	
  driving	
  habits,	
  
assess	
  risks,	
  determine	
  premiums	
  
Usage	
  of	
  Data	
  Science	
  
Biometrics:	
  wearable	
  devices	
  to	
  
monitor	
  and	
  improve	
  health	
  
Digital	
  MarkeAng:	
  recommender	
  
systems,	
  audience	
  segmentaAon,	
  
retargeAng,	
  churn	
  predicAon	
  
Usage	
  of	
  Data	
  Science	
  
Retail:	
  Walmart	
  launches	
  
compeAAon	
  to	
  solve	
  business	
  
problems	
  and	
  to	
  recruit	
  talent	
  
Online:	
  NeHlix	
  launched	
  $1	
  
million	
  prize	
  to	
  improve	
  
recommendaAon	
  system	
  
Usage	
  of	
  Data	
  Science	
  
Healthcare:	
  Heritage	
  Network	
  	
  
launched	
  a	
  compeAAon	
  to	
  
predict	
  the	
  probability	
  of	
  
hospitalizaAon	
  of	
  paAents.	
  
ScienAfic:	
  NaAonal	
  Data	
  Science	
  
Bowl	
  to	
  predict	
  ocean	
  health:	
  
one	
  plankton	
  at	
  a	
  Ame	
  
Why	
  Should	
  YOU	
  Care?	
  
According	
  to	
  McKinsey1	
  (2011),	
  Big	
  Data:	
  
The	
  next	
  fron5er	
  for	
  innova5on,	
  
compe55on,	
  and	
  produc5vity.	
  
“By	
  2018,	
  the	
  United	
  States	
  alone	
  could	
  
face	
  a	
  shortage	
  of	
  140,000	
  to	
  190,000	
  
people	
  with	
  deep	
  analyAcal	
  skills	
  as	
  well	
  
as	
  1.5	
  million	
  managers	
  and	
  analysts	
  with	
  
the	
  know-­‐how	
  to	
  use	
  the	
  analysis	
  of	
  big	
  
data	
  to	
  make	
  effecAve	
  decisions”	
  
Why	
  Should	
  YOU	
  Care?	
  
According	
  to	
  Forbes2	
  (Oct	
  2015),	
  The	
  Hunt	
  
For	
  Unicorn	
  Data	
  Scien5sts	
  LiCs	
  Salaries	
  For	
  
All	
  Data	
  Analy5cs	
  Professionals	
  
•  Experienced	
  data	
  scienAsts	
  are	
  paid	
  more	
  than	
  $200k	
  
per	
  year	
  
•  Median	
  salary	
  for	
  data	
  scienAst	
  increased	
  from	
  
$115,250	
  to	
  $125,000	
  in	
  one	
  year	
  
•  Managers	
  managing	
  large	
  teams	
  can	
  expect	
  a	
  median	
  
salary	
  of	
  $235,000	
  
Because	
  it’s	
  a	
  
growing	
  and	
  
exciAng	
  field	
  
with	
  high	
  
compensaAon!	
  
Explosion	
  of	
  Data	
  Science
Why	
  now?	
  
•  Storage	
  cost	
  has	
  decreased	
  dramaAcally	
  
•  CompuAng	
  power	
  has	
  increased	
  exponenAally	
  
•  People	
  are	
  carrying	
  smartphones,	
  mini	
  
supercomputers	
  in	
  their	
  pockets	
  
•  Perfect	
  intersecAon	
  of	
  data	
  availability	
  and	
  
compuAng	
  power	
  for	
  analyAcs
Massive	
  amount	
  of	
  data
Streaming	
  into	
  your	
  company	
  …	
  
What	
  is	
  data?
It	
  can	
  be	
  raw	
  web	
  traffic	
  logs	
  …	
  
What	
  is	
  data?
Semi-­‐structured	
  data	
  from	
  APIs	
  …	
  
What	
  is	
  data?
Or,	
  structured	
  data	
  from	
  databases…	
  
…	
  what	
  to	
  do	
  with	
  all	
  this	
  data?	
  
The	
  Data	
  ScienAst
Can	
  wrangle	
  
data	
  from	
  
many	
  
sources	
  or	
  
formats	
  
The	
  Data	
  ScienAst
do	
  deep	
  data	
  
exploraAons	
  …	
  
and	
  perform	
  
thorough	
  analyses	
  	
  
DS	
  Skills	
  Inferred	
  by	
  Job	
  Openings
•  Ph.D.	
  in	
  math,	
  staAsAcs,	
  engineering	
  or	
  
physical	
  science	
  (Is	
  it	
  really	
  required?)	
  
•  Has	
  5+	
  years	
  in	
  programming	
  experience	
  in	
  
Java,	
  Scala,	
  Python,	
  R,	
  SQL,	
  MapReduce,	
  etc.	
  
•  Has	
  5+	
  years	
  experience	
  in	
  most	
  of	
  the	
  
Apache	
  Open	
  Source	
  Technologies	
  (e.g.	
  
Hadoop,	
  Spark,	
  Hive,	
  Pig,	
  Kaka,	
  etc)*	
  
•  Tell	
  a	
  story	
  like	
  a	
  novelist	
  (coherently	
  and	
  
beauAfully)	
  
*	
  By	
  the	
  Ame	
  you	
  read	
  this	
  footnote,	
  the	
  Apache	
  stack	
  has	
  already	
  grown.	
  
The	
  OpAmal	
  Data	
  ScienAst
Is	
  a	
  person	
  with	
  deep	
  staAsAcal	
  and	
  machine	
  
learning	
  knowledge,	
  extensive	
  somware	
  
engineering	
  skills	
  and	
  well-­‐versed	
  in	
  business	
  
strategy!	
  
The	
  OpAmal	
  Data	
  ScienAst	
  –	
  Take	
  2
Personality	
  Traits3	
  
•  Compulsive	
  
•  Propulsive	
  laziness	
  
•  Drive	
  to	
  create	
  and	
  learn	
  
•  Irritable	
  determinaAon	
  
•  InsensiAvity	
  to	
  pain	
  (hmm…)	
  
•  Integrity	
  
•  Humility	
  
The	
  OpAmal	
  DS	
  Manager
•  Former	
  data	
  scienAst	
  (good	
  to	
  have	
  but	
  not	
  
necessary;	
  that’s	
  just	
  asking	
  for	
  another	
  
unicorn!)	
  
•  Actually	
  interested	
  in	
  managing	
  people	
  
•  Thirst	
  to	
  learn	
  	
  
•  Apt	
  in	
  managing	
  different	
  projects	
  
•  PaAent	
  and	
  diplomaAc	
  to	
  manage	
  a	
  diverse	
  
group	
  of	
  data	
  scienAsts	
  and	
  business	
  owners	
  
•  Understand	
  when	
  to	
  go	
  with	
  an	
  80/20	
  
approach	
  	
  
Data	
  ScienAsts:	
  The	
  Challenge	
  of	
  Managing	
  
Stubbornly	
  Autonomous	
  Experts4	
  
	
  
“I	
  no5ced	
  …	
  that	
  data	
  
scien5sts,	
  but	
  also	
  sta5s5cians	
  
and	
  top	
  coders,	
  oCen	
  have	
  
difficul5es	
  accep5ng	
  orders	
  
from	
  managers	
  who	
  don’t	
  
have	
  technical	
  skills	
  
themselves.”	
  -­‐	
  Istvan	
  Hajnal	
  
Journey	
  to	
  become	
  a	
  DS	
  Manager	
  
	
  
Nate	
  Silver	
  on	
  Finding	
  a	
  Mentor,	
  Teaching	
  
Yourself	
  StaAsAcs,	
  and	
  Not	
  Sesling	
  in	
  Your	
  
Career5	
  
•  Find	
  a	
  Mentor	
  (Yes,	
  even	
  if	
  you’re	
  already	
  a	
  
senior	
  manager)	
  
•  Teach	
  Yourself	
  (online	
  resources,	
  MOOCs)	
  
•  Understand	
  the	
  life-­‐cycle	
  of	
  a	
  data-­‐driven	
  
project	
  
•  Just	
  do	
  it!	
  
Why	
  Just	
  Do	
  It?	
  
	
  
Why	
  do	
  I	
  need	
  to	
  learn	
  about	
  data	
  science	
  
and	
  manage	
  data	
  projects?	
  
	
  
“I	
  have	
  [insert	
  #	
  of	
  years]	
  years	
  of	
  
experience	
  in	
  [insert	
  my	
  industry].	
  	
  
I’m	
  comfortable	
  and	
  successful	
  
being	
  a	
  [insert	
  your	
  Atle	
  here].”	
  
Company	
  Structures
Data	
  Sources
Data	
  projects	
  are	
  lurking	
  everywhere	
  …	
  
Machine	
  Learning
Machine	
  Learning
Machine	
  Learning
Machine	
  Learning
Google	
  X	
  laboratory5	
  
Machine	
  Learning
Google	
  Research6	
  
Data	
  Science	
  Concepts
PredicAve	
  AnalyAcs	
  
ClassificaAon	
  
RecommendaAon	
  Systems	
  
Big	
  Data	
  Technology
Topics	
  in	
  Cloud	
  CompuAng
New	
  services	
  added:	
  
Your	
  Job:	
  Provide	
  Guidance
Tell	
  us	
  a	
  data	
  story	
  	
  
…	
  about	
  your	
  business	
  
Do	
  you	
  understand	
  
the	
  outcome?	
  
	
  
What	
  is	
  your	
  
recommendaAon	
  to	
  
the	
  business?	
  
Gezng	
  Started:	
  Locally
Meetups	
  
•  LA	
  R	
  users	
  group	
  
•  LA	
  Machine	
  Learning	
  
•  LA	
  Data	
  Warehouse,	
  BI	
  &	
  AnalyAcs	
  
•  LA	
  Big	
  Data	
  Users	
  Group	
  
Conferences:	
  
•  datascience.la	
  
•  bigdatadayla.org	
  
Gezng	
  Started:	
  Podcasts
dataskepAc.com	
   thetalkingmachines.com	
  
Gezng	
  Started:	
  MOOCs
Good	
  Places	
  to	
  Start
Data	
  Science	
  for	
  Business	
  
	
  
by	
  Foster	
  Provost	
  	
  
&	
  Tom	
  Fawces	
  
Good	
  Places	
  to	
  Start
Doing	
  Data	
  Science	
  
	
  
by	
  Rachel	
  Schus	
  &	
  Cathy	
  
O’Neil	
  (mathbabe.org)	
  
	
  
Free	
  at	
  
www.columbiadatascience.com	
  
Good	
  Places	
  to	
  Start
The	
  Art	
  of	
  Data	
  Science	
  	
  
	
  
by	
  Roger	
  Peng	
  &	
  Elizabeth	
  
Matsui	
  
	
  
hsps://leanpub.com/artofdatascience	
  
Get	
  Kids	
  Started
scratch.mit.edu	
   www.ixl.com	
  
Thank	
  You!
Annie	
  Flippo	
   @ACflippo	
  
Slides	
  are	
  available	
  at	
  goo.gl/1X2NMH	
  
References	
  
	
  1.  hsp://www.mckinsey.com/insights/business_technology/
big_data_the_next_fronAer_for_innovaAon	
  
2.  hsp://www.forbes.com/sites/gilpress/2015/10/09/the-­‐hunt-­‐for-­‐unicorn-­‐
data-­‐scienAsts-­‐lims-­‐salaries-­‐for-­‐all-­‐data-­‐analyAcs-­‐professionals/	
  
3.  hsp://cdn.oreillystaAc.com/en/assets/1/event/119/Data%20Science
%20Bootcamp%20PresentaAon.pdf	
  
4.  hsp://www.ibmbigdatahub.com/blog/data-­‐scienAsts-­‐challenge-­‐managing-­‐
stubbornly-­‐autonomous-­‐experts	
  
5.  hsps://hbr.org/2013/09/nate-­‐silver-­‐on-­‐finding-­‐a-­‐mentor-­‐teaching-­‐yourself-­‐
staAsAcs-­‐and-­‐not-­‐sesling-­‐in-­‐your-­‐career/	
  
6.  hsp://www.nyAmes.com/2012/06/26/technology/in-­‐a-­‐big-­‐network-­‐of-­‐
computers-­‐evidence-­‐of-­‐machine-­‐learning.html	
  
7.  hsp://research.google.com/archive/unsupervised_icml2012.html	
  

Contenu connexe

Tendances

Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
mark madsen
 
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data MeetupData Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
David Johnston
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humans
mark madsen
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
mark madsen
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
mark madsen
 

Tendances (20)

Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field Managing Data Science | Lessons from the Field
Managing Data Science | Lessons from the Field
 
H2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientistsH2O World - Machine Learning for non-data scientists
H2O World - Machine Learning for non-data scientists
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software market
 
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data MeetupData Science Consulting at ThoughtWorks -- NYC Open Data Meetup
Data Science Consulting at ThoughtWorks -- NYC Open Data Meetup
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humans
 
How to Build Data Science Teams
How to Build Data Science TeamsHow to Build Data Science Teams
How to Build Data Science Teams
 
How to Build Successful Data Team - Dataiku ?
How to Build Successful Data Team -  Dataiku ? How to Build Successful Data Team -  Dataiku ?
How to Build Successful Data Team - Dataiku ?
 
Intro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data ScientistsIntro to Data Science for Non-Data Scientists
Intro to Data Science for Non-Data Scientists
 
The Emerging Role of the Data Lake
The Emerging Role of the Data LakeThe Emerging Role of the Data Lake
The Emerging Role of the Data Lake
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of People
 
Knowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup
Knowledge Graphs for a Connected World - AI, Deep & Machine Learning MeetupKnowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup
Knowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019
 
Data science as a professional career
Data science as a professional careerData science as a professional career
Data science as a professional career
 
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data WrongThe Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
The Heart of Data Modeling: 7 Ways Your Agile Project is Managing Data Wrong
 
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...Michael Stonebraker:  Big Data, Disruption, and the 800 Pound Gorilla in the ...
Michael Stonebraker: Big Data, Disruption, and the 800 Pound Gorilla in the ...
 

Similaire à What Managers Need to Know about Data Science

Similaire à What Managers Need to Know about Data Science (20)

Which institute is best for data science?
Which institute is best for data science?Which institute is best for data science?
Which institute is best for data science?
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
Data science training institute in hyderabad
Data science training institute in hyderabadData science training institute in hyderabad
Data science training institute in hyderabad
 
Data science training in Hyderabad
Data science  training in HyderabadData science  training in Hyderabad
Data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
Data science training in hyd ppt (1)
Data science training in hyd ppt (1)Data science training in hyd ppt (1)
Data science training in hyd ppt (1)
 
data science training and placement
data science training and placementdata science training and placement
data science training and placement
 
online data science training
online data science trainingonline data science training
online data science training
 
Data science online training in hyderabad
Data science online training in hyderabadData science online training in hyderabad
Data science online training in hyderabad
 
data science online training in hyderabad
data science online training in hyderabaddata science online training in hyderabad
data science online training in hyderabad
 
Best data science training in Hyderabad
Best data science training in HyderabadBest data science training in Hyderabad
Best data science training in Hyderabad
 
Data science training Hyderabad
Data science training HyderabadData science training Hyderabad
Data science training Hyderabad
 
Lean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science teamLean Analytics: How to get more out of your data science team
Lean Analytics: How to get more out of your data science team
 
Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)Data science training in hyd ppt converted (1)
Data science training in hyd ppt converted (1)
 
Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)Data science training in hyd pdf converted (1)
Data science training in hyd pdf converted (1)
 
Data science training in hydpdf converted (1)
Data science training in hydpdf  converted (1)Data science training in hydpdf  converted (1)
Data science training in hydpdf converted (1)
 
Data science presentation
Data science presentationData science presentation
Data science presentation
 
Best Selenium certification course
Best Selenium certification courseBest Selenium certification course
Best Selenium certification course
 

Plus de Annie Flippo

Plus de Annie Flippo (6)

How to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your TeamHow to Start a Data Science Initiative and Grow Your Team
How to Start a Data Science Initiative and Grow Your Team
 
User Motivation: Refining Customer Segments with Location
User Motivation: Refining Customer Segments with LocationUser Motivation: Refining Customer Segments with Location
User Motivation: Refining Customer Segments with Location
 
Location Intelligence Goes Big in Digital Marketing
Location Intelligence Goes Big in Digital MarketingLocation Intelligence Goes Big in Digital Marketing
Location Intelligence Goes Big in Digital Marketing
 
Techniques to generate training data
Techniques to generate training dataTechniques to generate training data
Techniques to generate training data
 
Predict YouTube Video Views
Predict YouTube Video ViewsPredict YouTube Video Views
Predict YouTube Video Views
 
Use NLP to Solve Business Problems
Use NLP to Solve Business ProblemsUse NLP to Solve Business Problems
Use NLP to Solve Business Problems
 

Dernier

Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
gajnagarg
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
HyderabadDolls
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
gajnagarg
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
SayantanBiswas37
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
HyderabadDolls
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 

Dernier (20)

Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
Top profile Call Girls In dimapur [ 7014168258 ] Call Me For Genuine Models W...
 
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
Sealdah % High Class Call Girls Kolkata - 450+ Call Girl Cash Payment 8005736...
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Computer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdfComputer science Sql cheat sheet.pdf.pdf
Computer science Sql cheat sheet.pdf.pdf
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
Jodhpur Park | Call Girls in Kolkata Phone No 8005736733 Elite Escort Service...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 

What Managers Need to Know about Data Science

  • 1. What  Managers   Need  to  Know   about  Data  Science   Annie  Flippo  
  • 2. Outline   •  What is data science •  Industry trends •  What is data •  The Optimal Data Scientist •  The Optimal Manager •  Topics in Data Science •  Topics in Cloud Computing
  • 3. Who  am  I?   Annie Flippo Data Scientist Software Engineer Product Manager Database Developer
  • 4. What  is  Data  Science?  
  • 5. Usage  of  Data  Science   Finance:  fraud  detecAon,  score   buying  habits,  calculate  risks   Insurance:  inspect  driving  habits,   assess  risks,  determine  premiums  
  • 6. Usage  of  Data  Science   Biometrics:  wearable  devices  to   monitor  and  improve  health   Digital  MarkeAng:  recommender   systems,  audience  segmentaAon,   retargeAng,  churn  predicAon  
  • 7. Usage  of  Data  Science   Retail:  Walmart  launches   compeAAon  to  solve  business   problems  and  to  recruit  talent   Online:  NeHlix  launched  $1   million  prize  to  improve   recommendaAon  system  
  • 8. Usage  of  Data  Science   Healthcare:  Heritage  Network     launched  a  compeAAon  to   predict  the  probability  of   hospitalizaAon  of  paAents.   ScienAfic:  NaAonal  Data  Science   Bowl  to  predict  ocean  health:   one  plankton  at  a  Ame  
  • 9. Why  Should  YOU  Care?   According  to  McKinsey1  (2011),  Big  Data:   The  next  fron5er  for  innova5on,   compe55on,  and  produc5vity.   “By  2018,  the  United  States  alone  could   face  a  shortage  of  140,000  to  190,000   people  with  deep  analyAcal  skills  as  well   as  1.5  million  managers  and  analysts  with   the  know-­‐how  to  use  the  analysis  of  big   data  to  make  effecAve  decisions”  
  • 10. Why  Should  YOU  Care?   According  to  Forbes2  (Oct  2015),  The  Hunt   For  Unicorn  Data  Scien5sts  LiCs  Salaries  For   All  Data  Analy5cs  Professionals   •  Experienced  data  scienAsts  are  paid  more  than  $200k   per  year   •  Median  salary  for  data  scienAst  increased  from   $115,250  to  $125,000  in  one  year   •  Managers  managing  large  teams  can  expect  a  median   salary  of  $235,000  
  • 11. Because  it’s  a   growing  and   exciAng  field   with  high   compensaAon!  
  • 12. Explosion  of  Data  Science Why  now?   •  Storage  cost  has  decreased  dramaAcally   •  CompuAng  power  has  increased  exponenAally   •  People  are  carrying  smartphones,  mini   supercomputers  in  their  pockets   •  Perfect  intersecAon  of  data  availability  and   compuAng  power  for  analyAcs
  • 13. Massive  amount  of  data Streaming  into  your  company  …  
  • 14. What  is  data? It  can  be  raw  web  traffic  logs  …  
  • 15. What  is  data? Semi-­‐structured  data  from  APIs  …  
  • 16. What  is  data? Or,  structured  data  from  databases…   …  what  to  do  with  all  this  data?  
  • 17. The  Data  ScienAst Can  wrangle   data  from   many   sources  or   formats  
  • 18. The  Data  ScienAst do  deep  data   exploraAons  …   and  perform   thorough  analyses    
  • 19. DS  Skills  Inferred  by  Job  Openings •  Ph.D.  in  math,  staAsAcs,  engineering  or   physical  science  (Is  it  really  required?)   •  Has  5+  years  in  programming  experience  in   Java,  Scala,  Python,  R,  SQL,  MapReduce,  etc.   •  Has  5+  years  experience  in  most  of  the   Apache  Open  Source  Technologies  (e.g.   Hadoop,  Spark,  Hive,  Pig,  Kaka,  etc)*   •  Tell  a  story  like  a  novelist  (coherently  and   beauAfully)   *  By  the  Ame  you  read  this  footnote,  the  Apache  stack  has  already  grown.  
  • 20. The  OpAmal  Data  ScienAst Is  a  person  with  deep  staAsAcal  and  machine   learning  knowledge,  extensive  somware   engineering  skills  and  well-­‐versed  in  business   strategy!  
  • 21. The  OpAmal  Data  ScienAst  –  Take  2 Personality  Traits3   •  Compulsive   •  Propulsive  laziness   •  Drive  to  create  and  learn   •  Irritable  determinaAon   •  InsensiAvity  to  pain  (hmm…)   •  Integrity   •  Humility  
  • 22. The  OpAmal  DS  Manager •  Former  data  scienAst  (good  to  have  but  not   necessary;  that’s  just  asking  for  another   unicorn!)   •  Actually  interested  in  managing  people   •  Thirst  to  learn     •  Apt  in  managing  different  projects   •  PaAent  and  diplomaAc  to  manage  a  diverse   group  of  data  scienAsts  and  business  owners   •  Understand  when  to  go  with  an  80/20   approach    
  • 23. Data  ScienAsts:  The  Challenge  of  Managing   Stubbornly  Autonomous  Experts4     “I  no5ced  …  that  data   scien5sts,  but  also  sta5s5cians   and  top  coders,  oCen  have   difficul5es  accep5ng  orders   from  managers  who  don’t   have  technical  skills   themselves.”  -­‐  Istvan  Hajnal  
  • 24. Journey  to  become  a  DS  Manager     Nate  Silver  on  Finding  a  Mentor,  Teaching   Yourself  StaAsAcs,  and  Not  Sesling  in  Your   Career5   •  Find  a  Mentor  (Yes,  even  if  you’re  already  a   senior  manager)   •  Teach  Yourself  (online  resources,  MOOCs)   •  Understand  the  life-­‐cycle  of  a  data-­‐driven   project   •  Just  do  it!  
  • 25. Why  Just  Do  It?     Why  do  I  need  to  learn  about  data  science   and  manage  data  projects?     “I  have  [insert  #  of  years]  years  of   experience  in  [insert  my  industry].     I’m  comfortable  and  successful   being  a  [insert  your  Atle  here].”  
  • 27. Data  Sources Data  projects  are  lurking  everywhere  …  
  • 31. Machine  Learning Google  X  laboratory5  
  • 33. Data  Science  Concepts PredicAve  AnalyAcs   ClassificaAon   RecommendaAon  Systems  
  • 35. Topics  in  Cloud  CompuAng New  services  added:  
  • 36. Your  Job:  Provide  Guidance Tell  us  a  data  story     …  about  your  business   Do  you  understand   the  outcome?     What  is  your   recommendaAon  to   the  business?  
  • 37. Gezng  Started:  Locally Meetups   •  LA  R  users  group   •  LA  Machine  Learning   •  LA  Data  Warehouse,  BI  &  AnalyAcs   •  LA  Big  Data  Users  Group   Conferences:   •  datascience.la   •  bigdatadayla.org  
  • 38. Gezng  Started:  Podcasts dataskepAc.com   thetalkingmachines.com  
  • 40. Good  Places  to  Start Data  Science  for  Business     by  Foster  Provost     &  Tom  Fawces  
  • 41. Good  Places  to  Start Doing  Data  Science     by  Rachel  Schus  &  Cathy   O’Neil  (mathbabe.org)     Free  at   www.columbiadatascience.com  
  • 42. Good  Places  to  Start The  Art  of  Data  Science       by  Roger  Peng  &  Elizabeth   Matsui     hsps://leanpub.com/artofdatascience  
  • 44. Thank  You! Annie  Flippo   @ACflippo   Slides  are  available  at  goo.gl/1X2NMH  
  • 45. References    1.  hsp://www.mckinsey.com/insights/business_technology/ big_data_the_next_fronAer_for_innovaAon   2.  hsp://www.forbes.com/sites/gilpress/2015/10/09/the-­‐hunt-­‐for-­‐unicorn-­‐ data-­‐scienAsts-­‐lims-­‐salaries-­‐for-­‐all-­‐data-­‐analyAcs-­‐professionals/   3.  hsp://cdn.oreillystaAc.com/en/assets/1/event/119/Data%20Science %20Bootcamp%20PresentaAon.pdf   4.  hsp://www.ibmbigdatahub.com/blog/data-­‐scienAsts-­‐challenge-­‐managing-­‐ stubbornly-­‐autonomous-­‐experts   5.  hsps://hbr.org/2013/09/nate-­‐silver-­‐on-­‐finding-­‐a-­‐mentor-­‐teaching-­‐yourself-­‐ staAsAcs-­‐and-­‐not-­‐sesling-­‐in-­‐your-­‐career/   6.  hsp://www.nyAmes.com/2012/06/26/technology/in-­‐a-­‐big-­‐network-­‐of-­‐ computers-­‐evidence-­‐of-­‐machine-­‐learning.html   7.  hsp://research.google.com/archive/unsupervised_icml2012.html