SlideShare une entreprise Scribd logo
1  sur  24
Data  Scientist  Profiles
Tuesday  May  19th  2015
By  Dries  Van  Nieuwenhuyse
Prof.  EHSAL  Management  School
Researcher  at  BICC  Thomas  More
Lecturer  Strategic  Management  HoGent
Agenda
• Do  analytics  matter?  Of  course!
• What’s  in  a  name:  do  we  really  need  a  new  term  ”data  scientist”  for  this?
• Characteristics  of  data  scientists
• How  do  we  recognize  them?
• How  do  we  build  up  data  scientist  teams?
• Feasible?
Do analytics matters? Of course!
Who’s on Facebook?
Who’s on LinkedIn?
How much overlap is there in the PYMK suggestions? (20 – 40 – 60 – 80 – 100)
Do analytics matters? Of course!
« Some companies have built their very businesses on their
ability to collect, analyze, and act on data. Every company
can learn from what these firms do. »
Thomas H. Davenport, 2006. Competing on analytics. Harvard Business Review, January 2006 p99-107.
What’s in a name?
Whether  employers  know  or  don’t  know  what  data  
scientists  do,  they  have  been  using  -­‐ in  rapidly  
growing  numbers  -­‐ the  term  “data  scientist”  in  job  
descriptions  in  recent  years...
What’s in a name?
What then are the data scientists, these new men and women of industry?
o Are  they  scientists?  
o Engineers?  
o Programmers?  
o Business  Controllers?  
o Financial  Controllers?  
o A  new  breed  of  business  decision-­‐makers  and  innovators?
What’s in a name?
• Google‘s  chief  economist  Hal  Varian  commented  in  January  [2009]  that  the  next  sexy  job  in  the  
next  10  years  would  be  statisticians.   By  statisticians,  he  actually  meant  it  as  a  general  title  for  
someone  who  is  able  to  extract  information  from  large  datasets  and  then  present  something  of  
use  to  non-­‐data  experts…
• In  June  2009  in  a  blog  post  titled  “Rise  of  the  Data  Scientist”  by  Natahn  Yau,  a  PhD  candidate  in  
statistics,  the  term  was  first  really  used.
What’s in a name?
“What data scientists do is make discoveries while
swimming in data… [their] dominant trait is intense
curiosity - a desire to go beneath the surface of a problem,
find the questions at its heart, and distill them into a very
clear set of hypotheses that can be tested. This often entails
the associative thinking that characterizes the most
creative scientists in any field….”
Thomas H. Davenport & D.J. Patil (October 2012). Data Scientist: The Sexiest Job of the 21st Century.
What’s in a name?
A  data  scientist  is  an  engineer  who  employs  the  scientific  method  and  applies  data-­‐discovery  
tools  to  find  new  insights  in  data.  The  scientific  method—the  formulation  of  a  hypothesis,  the  
testing,  the  careful  design  of  experiments,  the  verification  by  others—is  something  they  take  
from  their  knowledge  of  statistics  and  their  training  in  scientific  disciplines.  The  application  (and  
tweaking)  of  tools  comes  from  their  engineering,  or  more  specifically,  computer  science  and  
programming  background.  The  best  data  scientists  are  product  and  process  innovators  and  
sometimes,  developers  of  new  data-­‐discovery  tools.
Data Scientists: The Definition of Sexy (Gil Press)
http://www.forbes.com/sites/gilpress/2012/09/27/data-scientists-the-definition-of-sexy/
What’s in a name?
A  data  scientist  is  a  job  title  for  an  employee  or  business  intelligence  (BI)  consultant  who  excels  at  
analyzing  data,  particularly  large  amounts  of  data,  to  help  a  business  gain  a  competitive  edge.  
Margaret Rouse, WhatIs.com
How can we recognize a data scientist?
• Very  different  backgrounds
• Curiosity    beyond  day-­‐to-­‐day  activities
• Bricolage  versus  engineering…
• Questions  are  more  important  than  answers
D.J. Patil (2011) Building Data Science Teams. O’Reilly Media.
Skills of a data scientist
The significant problems we face
cannot be solvedby the same level
of thinking that created them.
If  I  had  an  hour  to  solve  a  problem  and  my  life  depended  
on  the  solution,  I  would  spend  the  first  55  minutes  
determining  the  proper  question  to  ask,  for  once  I  know  
the  proper  question,  I  could  solve  the  problem  in  less  
than  five  minutes.
A. Einstein
Skills of a data scientist
• Finding  rich  data  sources
• Working  with  large  volumes  of  data  despite  hardware,  software,  and  bandwidth  constraint
• Cleaning  the  data  and  making  sure  that  data  is  consistent
• Merging  multiple  datasets  together
• Visualizing  that  data
• Building  rich  tooling  that  enables  others  to  work  with  data  effectively
D.J. Patil (2011) Building Data Science Teams. O’Reilly Media
Data scientist = BI professional?
EMC (2011). Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field.
Data scientist = BI professional?
EMC (2011). Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field.
• BI  professionals  focus  on  qualitative  visualization  of  existing  business  data
• Data  scientists  apply  advanced  analytical  tools  to  generate  predictive  insights
• More  communication
• More  scientifically  trained  
• Introvert  – Extravert
• Bricolage  -­‐ Engineering
Do data scientists need a PHD?
Changing role of controllers
• Gradient  between  Flexibility  and  
Control
• Gradient  between  Internal  and  
External  focus
• Finance  is  moving  in
Cornel, Renes & Vervuurt (2013). Controllers - Fit for the future. MCA
Can BI professionals become data
scientists?
• Nowadays  everyone  wants  to  become  proactive,  analytical  
and  strategically  aligned
• Will  they  all  succeed  in  this  mental  shift?
• Of  course  they  won’t  all  succeed
• So  there  is  plenty  of  room  for  talent
• Obviously  a  good  understanding  of  the  domain  of  
Performance  MANAGEMENT  in  compleness  will  play  a  
pivotal  role  in  this  change
Can BI professionals become data
scientists?
• Sure  they  can!!!
• Own  research  shows  that  personal  traits  are  in  the  end  prevailing
• Not  everyone  needs  to  kick  off  the  party
• Soft  skills  are  more  important  than  hard  skills
• Hard  skills  are  more  obvious  to  learn
• Diversity  of  multi-­‐disciplinary  teams  is  more  important  than  individual  skills
• Check  for  proactive  and  creative  thinking
How to recruit?
• Focus  recruiting  the  ‘usual  suspects’  – (commercial)  engineers,  hard  scientists,  big  data  guru’s,  
bricoleurs
• Scan  memberships  and  active  people  in  analytical  and  decision-­‐making  communities
• Steal  talent  from  Finance,  IT,  Marketing
• Actively  scan  LinkedIn
• Organize  a  contest  for  data  science  open  to  all  profiles
• Did  the  candidate  publish  in  magazines,  books?
How to recruit?
• Test  for  creativity  – ask  for  a  possible  research  agenda  for  your  company  testing  whether  they  are  
actually  prepared
• Organize  for  continuous  job  satisfaction  and  spontaneous  evolution  through  the  organization
• Let  candidate  work  on  a  data  set  for  a  day,  come  up  with  proper  questions  and  answers  and  let  the  
candidate  present  and  convince  an  audience  of  decision-­‐makers
• Avoid  overskilled  and  overtechnical  PhD’s  that  can’t  communicate
Doesn’t matter who takes the lead... just get
started
“Don't wait until everything is just right. It will never
be perfect. There will always be challenges, obstacles
and less than perfect conditions. So what. Get started
now. With each step you take, you will grow stronger
and stronger, more and more skilled, more and more
self-confident and more and more successful.”
Mark Victor Hansen
What's the profile of a data scientist?

Contenu connexe

Tendances

Data-Ed Online Webinar: Data-centric Strategy & Roadmap
Data-Ed Online Webinar: Data-centric Strategy & RoadmapData-Ed Online Webinar: Data-centric Strategy & Roadmap
Data-Ed Online Webinar: Data-centric Strategy & RoadmapDATAVERSITY
 
Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonAlan Morrison
 
Moving From Insight to Action
Moving From Insight to ActionMoving From Insight to Action
Moving From Insight to ActionInvoca
 
How to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organizationHow to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organizationYael Garten
 
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Sandra Fernandes
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyInside Analysis
 
Data-Ed: A Framework for no sql and Hadoop
Data-Ed: A Framework for no sql and HadoopData-Ed: A Framework for no sql and Hadoop
Data-Ed: A Framework for no sql and HadoopData Blueprint
 
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIs
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIsDashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIs
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIsLuciano Pesci, PhD
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)Arvind Sathi
 
The Chief Data Officer's Agenda: The Power of Data Strategy
The Chief Data Officer's Agenda: The Power of Data StrategyThe Chief Data Officer's Agenda: The Power of Data Strategy
The Chief Data Officer's Agenda: The Power of Data StrategyDATAVERSITY
 
Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Bundesverband Deutsche Startups e.V.
 
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...How to turn traditional Industrial Companies into Modern, Digital-Data Driven...
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...Data Con LA
 
Blockchain demystified
Blockchain demystifiedBlockchain demystified
Blockchain demystifiedAlan Morrison
 
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...DATAVERSITY
 
Bde presentation dv
Bde presentation dvBde presentation dv
Bde presentation dvBigDataExpo
 
Cognitive computing in Insurance
Cognitive computing in InsuranceCognitive computing in Insurance
Cognitive computing in InsuranceAnders Quitzau
 
Information 3.0 - Data + Technology + People
Information 3.0 - Data + Technology + PeopleInformation 3.0 - Data + Technology + People
Information 3.0 - Data + Technology + PeopleHubbard One
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCarl Anderson
 
Whitebook on Big Data
Whitebook on Big DataWhitebook on Big Data
Whitebook on Big DataViren Aul
 

Tendances (20)

Data-Ed Online Webinar: Data-centric Strategy & Roadmap
Data-Ed Online Webinar: Data-centric Strategy & RoadmapData-Ed Online Webinar: Data-centric Strategy & Roadmap
Data-Ed Online Webinar: Data-centric Strategy & Roadmap
 
Dcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrisonDcaf transformation & kg adoption 2022 -alan morrison
Dcaf transformation & kg adoption 2022 -alan morrison
 
Moving From Insight to Action
Moving From Insight to ActionMoving From Insight to Action
Moving From Insight to Action
 
How to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organizationHow to use your data science team: Becoming a data-driven organization
How to use your data science team: Becoming a data-driven organization
 
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
Nasscomilf2014 thedigitalenterprise-bigdataandanalyticsleadtheway-thomashdave...
 
Understanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data QuicklyUnderstanding What’s Possible: Getting Business Value from Big Data Quickly
Understanding What’s Possible: Getting Business Value from Big Data Quickly
 
Data-Ed: A Framework for no sql and Hadoop
Data-Ed: A Framework for no sql and HadoopData-Ed: A Framework for no sql and Hadoop
Data-Ed: A Framework for no sql and Hadoop
 
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIs
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIsDashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIs
Dashboards are Dumb Data - Why Smart Analytics Will Kill Your KPIs
 
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
5733   a deep dive into IBM Watson Foundation for CSP (WFC)5733   a deep dive into IBM Watson Foundation for CSP (WFC)
5733 a deep dive into IBM Watson Foundation for CSP (WFC)
 
The Chief Data Officer's Agenda: The Power of Data Strategy
The Chief Data Officer's Agenda: The Power of Data StrategyThe Chief Data Officer's Agenda: The Power of Data Strategy
The Chief Data Officer's Agenda: The Power of Data Strategy
 
Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"Industry Focus Camp SCB17 "How to build a data driven organization"
Industry Focus Camp SCB17 "How to build a data driven organization"
 
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...How to turn traditional Industrial Companies into Modern, Digital-Data Driven...
How to turn traditional Industrial Companies into Modern, Digital-Data Driven...
 
NBSintro2013
NBSintro2013NBSintro2013
NBSintro2013
 
Blockchain demystified
Blockchain demystifiedBlockchain demystified
Blockchain demystified
 
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...
ADV Slides: Organizational Change Management in Becoming an Analytic Organiza...
 
Bde presentation dv
Bde presentation dvBde presentation dv
Bde presentation dv
 
Cognitive computing in Insurance
Cognitive computing in InsuranceCognitive computing in Insurance
Cognitive computing in Insurance
 
Information 3.0 - Data + Technology + People
Information 3.0 - Data + Technology + PeopleInformation 3.0 - Data + Technology + People
Information 3.0 - Data + Technology + People
 
Creating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetupCreating a Data-Driven Organization -- thisismetis meetup
Creating a Data-Driven Organization -- thisismetis meetup
 
Whitebook on Big Data
Whitebook on Big DataWhitebook on Big Data
Whitebook on Big Data
 

En vedette

BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BICC Thomas More
 
BI congres 2016: programma
BI congres 2016: programmaBI congres 2016: programma
BI congres 2016: programmaBICC Thomas More
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...BICC Thomas More
 
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BICC Thomas More
 
BI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
BI congres 2014-4: thinking out of the box - Jos Cools - CrosspointBI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
BI congres 2014-4: thinking out of the box - Jos Cools - CrosspointBICC Thomas More
 
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas More
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas MoreBI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas More
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas MoreBICC Thomas More
 
Performance MANAGEMENT 3.0 - De evidentie zelf
Performance MANAGEMENT 3.0 - De evidentie zelfPerformance MANAGEMENT 3.0 - De evidentie zelf
Performance MANAGEMENT 3.0 - De evidentie zelfBICC Thomas More
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBICC Thomas More
 
BI congres 2014-3: facts not opinions - Tobias Temmink - Teradata
BI congres 2014-3: facts not opinions - Tobias Temmink - TeradataBI congres 2014-3: facts not opinions - Tobias Temmink - Teradata
BI congres 2014-3: facts not opinions - Tobias Temmink - TeradataBICC Thomas More
 
데이터 사이언스 소개 - 정준호
데이터 사이언스 소개 -  정준호데이터 사이언스 소개 -  정준호
데이터 사이언스 소개 - 정준호준호 정
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on HadoopCaserta
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data ScienceEdureka!
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientistryanorban
 

En vedette (13)

BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
 
BI congres 2016: programma
BI congres 2016: programmaBI congres 2016: programma
BI congres 2016: programma
 
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...BI congres 2016-2: Diving into weblog data with SAS on Hadoop -  Lisa Truyers...
BI congres 2016-2: Diving into weblog data with SAS on Hadoop - Lisa Truyers...
 
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
BI congres 2016-3: Insurance comparison engine - Miloud Belkacem - Business &...
 
BI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
BI congres 2014-4: thinking out of the box - Jos Cools - CrosspointBI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
BI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
 
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas More
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas MoreBI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas More
BI congres 2014-6: Opleiding Informatiemanagement - Hans Tubbax - Thomas More
 
Performance MANAGEMENT 3.0 - De evidentie zelf
Performance MANAGEMENT 3.0 - De evidentie zelfPerformance MANAGEMENT 3.0 - De evidentie zelf
Performance MANAGEMENT 3.0 - De evidentie zelf
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - PentahoBI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
 
BI congres 2014-3: facts not opinions - Tobias Temmink - Teradata
BI congres 2014-3: facts not opinions - Tobias Temmink - TeradataBI congres 2014-3: facts not opinions - Tobias Temmink - Teradata
BI congres 2014-3: facts not opinions - Tobias Temmink - Teradata
 
데이터 사이언스 소개 - 정준호
데이터 사이언스 소개 -  정준호데이터 사이언스 소개 -  정준호
데이터 사이언스 소개 - 정준호
 
Intro to Data Science on Hadoop
Intro to Data Science on HadoopIntro to Data Science on Hadoop
Intro to Data Science on Hadoop
 
Introduction on Data Science
Introduction on Data ScienceIntroduction on Data Science
Introduction on Data Science
 
How to Become a Data Scientist
How to Become a Data ScientistHow to Become a Data Scientist
How to Become a Data Scientist
 

Similaire à What's the profile of a data scientist?

intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...jybufgofasfbkpoovh
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data scienceBrian Spiering
 
How to start thinking like a data scientist
How to start thinking like a data scientistHow to start thinking like a data scientist
How to start thinking like a data scientistNishant Kumar
 
Jordan Engbers - Making an Effective Data Scientist
Jordan Engbers - Making an Effective Data ScientistJordan Engbers - Making an Effective Data Scientist
Jordan Engbers - Making an Effective Data ScientistCybera Inc.
 
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargShiv Shakti Ghosh
 
Data fluency for the 21st century
Data fluency for the 21st centuryData fluency for the 21st century
Data fluency for the 21st centuryMartinFrigaard
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Thinkful
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights Joe Lamantia
 
Fundamentals of Data Analytics Outline
Fundamentals of Data Analytics OutlineFundamentals of Data Analytics Outline
Fundamentals of Data Analytics OutlineDan Meyer
 
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...Pistoia Alliance
 
What is Data Science?
What is Data Science?What is Data Science?
What is Data Science?Ahmed Banafa
 
Grow Your Own - How to Create a Data Culture at Your Organization
Grow Your Own - How to Create a Data Culture at Your OrganizationGrow Your Own - How to Create a Data Culture at Your Organization
Grow Your Own - How to Create a Data Culture at Your OrganizationLuciano Pesci, PhD
 
Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Taneli Heinonen
 
Insight white paper_2014
Insight white paper_2014Insight white paper_2014
Insight white paper_2014Lin Todd
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data ScienceThinkful
 
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...Analytics (as if learning mattered) - RIDE Symposium, University of London 10...
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...Adam Cooper
 

Similaire à What's the profile of a data scientist? (20)

intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...intro to data science Clustering and visualization of data science subfields ...
intro to data science Clustering and visualization of data science subfields ...
 
Starting a career in data science
Starting a career in data scienceStarting a career in data science
Starting a career in data science
 
How to start thinking like a data scientist
How to start thinking like a data scientistHow to start thinking like a data scientist
How to start thinking like a data scientist
 
Jordan Engbers - Making an Effective Data Scientist
Jordan Engbers - Making an Effective Data ScientistJordan Engbers - Making an Effective Data Scientist
Jordan Engbers - Making an Effective Data Scientist
 
iTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun SukhaniiTrain Malaysia: Data Science by Tarun Sukhani
iTrain Malaysia: Data Science by Tarun Sukhani
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGargColloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
 
Data fluency for the 21st century
Data fluency for the 21st centuryData fluency for the 21st century
Data fluency for the 21st century
 
Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)Career in Data Science (July 2017, DTLA)
Career in Data Science (July 2017, DTLA)
 
Data Science Highlights
Data Science Highlights Data Science Highlights
Data Science Highlights
 
Data Scientist
Data ScientistData Scientist
Data Scientist
 
Fundamentals of Data Analytics Outline
Fundamentals of Data Analytics OutlineFundamentals of Data Analytics Outline
Fundamentals of Data Analytics Outline
 
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI  Webina...
Pistoia Alliance Webinar Demystifying AI: Centre of Excellence for AI Webina...
 
What is Data Science?
What is Data Science?What is Data Science?
What is Data Science?
 
Grow Your Own - How to Create a Data Culture at Your Organization
Grow Your Own - How to Create a Data Culture at Your OrganizationGrow Your Own - How to Create a Data Culture at Your Organization
Grow Your Own - How to Create a Data Culture at Your Organization
 
Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015 Deep Customer Insights, Laurea, October 2015
Deep Customer Insights, Laurea, October 2015
 
Insight white paper_2014
Insight white paper_2014Insight white paper_2014
Insight white paper_2014
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Data science
Data scienceData science
Data science
 
Getting Started in Data Science
Getting Started in Data ScienceGetting Started in Data Science
Getting Started in Data Science
 
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...Analytics (as if learning mattered) - RIDE Symposium, University of London 10...
Analytics (as if learning mattered) - RIDE Symposium, University of London 10...
 

Dernier

English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 

Dernier (20)

Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 

What's the profile of a data scientist?

  • 1. Data  Scientist  Profiles Tuesday  May  19th  2015 By  Dries  Van  Nieuwenhuyse Prof.  EHSAL  Management  School Researcher  at  BICC  Thomas  More Lecturer  Strategic  Management  HoGent
  • 2. Agenda • Do  analytics  matter?  Of  course! • What’s  in  a  name:  do  we  really  need  a  new  term  ”data  scientist”  for  this? • Characteristics  of  data  scientists • How  do  we  recognize  them? • How  do  we  build  up  data  scientist  teams? • Feasible?
  • 3. Do analytics matters? Of course! Who’s on Facebook? Who’s on LinkedIn? How much overlap is there in the PYMK suggestions? (20 – 40 – 60 – 80 – 100)
  • 4. Do analytics matters? Of course! « Some companies have built their very businesses on their ability to collect, analyze, and act on data. Every company can learn from what these firms do. » Thomas H. Davenport, 2006. Competing on analytics. Harvard Business Review, January 2006 p99-107.
  • 5. What’s in a name? Whether  employers  know  or  don’t  know  what  data   scientists  do,  they  have  been  using  -­‐ in  rapidly   growing  numbers  -­‐ the  term  “data  scientist”  in  job   descriptions  in  recent  years...
  • 6. What’s in a name? What then are the data scientists, these new men and women of industry? o Are  they  scientists?   o Engineers?   o Programmers?   o Business  Controllers?   o Financial  Controllers?   o A  new  breed  of  business  decision-­‐makers  and  innovators?
  • 7. What’s in a name? • Google‘s  chief  economist  Hal  Varian  commented  in  January  [2009]  that  the  next  sexy  job  in  the   next  10  years  would  be  statisticians.   By  statisticians,  he  actually  meant  it  as  a  general  title  for   someone  who  is  able  to  extract  information  from  large  datasets  and  then  present  something  of   use  to  non-­‐data  experts… • In  June  2009  in  a  blog  post  titled  “Rise  of  the  Data  Scientist”  by  Natahn  Yau,  a  PhD  candidate  in   statistics,  the  term  was  first  really  used.
  • 8. What’s in a name? “What data scientists do is make discoveries while swimming in data… [their] dominant trait is intense curiosity - a desire to go beneath the surface of a problem, find the questions at its heart, and distill them into a very clear set of hypotheses that can be tested. This often entails the associative thinking that characterizes the most creative scientists in any field….” Thomas H. Davenport & D.J. Patil (October 2012). Data Scientist: The Sexiest Job of the 21st Century.
  • 9. What’s in a name? A  data  scientist  is  an  engineer  who  employs  the  scientific  method  and  applies  data-­‐discovery   tools  to  find  new  insights  in  data.  The  scientific  method—the  formulation  of  a  hypothesis,  the   testing,  the  careful  design  of  experiments,  the  verification  by  others—is  something  they  take   from  their  knowledge  of  statistics  and  their  training  in  scientific  disciplines.  The  application  (and   tweaking)  of  tools  comes  from  their  engineering,  or  more  specifically,  computer  science  and   programming  background.  The  best  data  scientists  are  product  and  process  innovators  and   sometimes,  developers  of  new  data-­‐discovery  tools. Data Scientists: The Definition of Sexy (Gil Press) http://www.forbes.com/sites/gilpress/2012/09/27/data-scientists-the-definition-of-sexy/
  • 10. What’s in a name? A  data  scientist  is  a  job  title  for  an  employee  or  business  intelligence  (BI)  consultant  who  excels  at   analyzing  data,  particularly  large  amounts  of  data,  to  help  a  business  gain  a  competitive  edge.   Margaret Rouse, WhatIs.com
  • 11.
  • 12. How can we recognize a data scientist? • Very  different  backgrounds • Curiosity    beyond  day-­‐to-­‐day  activities • Bricolage  versus  engineering… • Questions  are  more  important  than  answers D.J. Patil (2011) Building Data Science Teams. O’Reilly Media.
  • 13. Skills of a data scientist The significant problems we face cannot be solvedby the same level of thinking that created them. If  I  had  an  hour  to  solve  a  problem  and  my  life  depended   on  the  solution,  I  would  spend  the  first  55  minutes   determining  the  proper  question  to  ask,  for  once  I  know   the  proper  question,  I  could  solve  the  problem  in  less   than  five  minutes. A. Einstein
  • 14. Skills of a data scientist • Finding  rich  data  sources • Working  with  large  volumes  of  data  despite  hardware,  software,  and  bandwidth  constraint • Cleaning  the  data  and  making  sure  that  data  is  consistent • Merging  multiple  datasets  together • Visualizing  that  data • Building  rich  tooling  that  enables  others  to  work  with  data  effectively D.J. Patil (2011) Building Data Science Teams. O’Reilly Media
  • 15. Data scientist = BI professional? EMC (2011). Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field.
  • 16. Data scientist = BI professional? EMC (2011). Data Science Revealed: A Data-Driven Glimpse into the Burgeoning New Field. • BI  professionals  focus  on  qualitative  visualization  of  existing  business  data • Data  scientists  apply  advanced  analytical  tools  to  generate  predictive  insights • More  communication • More  scientifically  trained   • Introvert  – Extravert • Bricolage  -­‐ Engineering
  • 17. Do data scientists need a PHD?
  • 18. Changing role of controllers • Gradient  between  Flexibility  and   Control • Gradient  between  Internal  and   External  focus • Finance  is  moving  in Cornel, Renes & Vervuurt (2013). Controllers - Fit for the future. MCA
  • 19. Can BI professionals become data scientists? • Nowadays  everyone  wants  to  become  proactive,  analytical   and  strategically  aligned • Will  they  all  succeed  in  this  mental  shift? • Of  course  they  won’t  all  succeed • So  there  is  plenty  of  room  for  talent • Obviously  a  good  understanding  of  the  domain  of   Performance  MANAGEMENT  in  compleness  will  play  a   pivotal  role  in  this  change
  • 20. Can BI professionals become data scientists? • Sure  they  can!!! • Own  research  shows  that  personal  traits  are  in  the  end  prevailing • Not  everyone  needs  to  kick  off  the  party • Soft  skills  are  more  important  than  hard  skills • Hard  skills  are  more  obvious  to  learn • Diversity  of  multi-­‐disciplinary  teams  is  more  important  than  individual  skills • Check  for  proactive  and  creative  thinking
  • 21. How to recruit? • Focus  recruiting  the  ‘usual  suspects’  – (commercial)  engineers,  hard  scientists,  big  data  guru’s,   bricoleurs • Scan  memberships  and  active  people  in  analytical  and  decision-­‐making  communities • Steal  talent  from  Finance,  IT,  Marketing • Actively  scan  LinkedIn • Organize  a  contest  for  data  science  open  to  all  profiles • Did  the  candidate  publish  in  magazines,  books?
  • 22. How to recruit? • Test  for  creativity  – ask  for  a  possible  research  agenda  for  your  company  testing  whether  they  are   actually  prepared • Organize  for  continuous  job  satisfaction  and  spontaneous  evolution  through  the  organization • Let  candidate  work  on  a  data  set  for  a  day,  come  up  with  proper  questions  and  answers  and  let  the   candidate  present  and  convince  an  audience  of  decision-­‐makers • Avoid  overskilled  and  overtechnical  PhD’s  that  can’t  communicate
  • 23. Doesn’t matter who takes the lead... just get started “Don't wait until everything is just right. It will never be perfect. There will always be challenges, obstacles and less than perfect conditions. So what. Get started now. With each step you take, you will grow stronger and stronger, more and more skilled, more and more self-confident and more and more successful.” Mark Victor Hansen