SlideShare a Scribd company logo
1 of 11
Thinking Differently in
Data Science: The
Interdisciplinary Approach
Colleen M. Farrelly
About Me
 Former MD/PhD student
 Background outside of math/stat including sociology, biochemistry, molecular
biology, psychology, genomics, epidemiology.
 Work history including academic medical studies, government, military,
biotech, and education.
 Areas of expertise include topological data analysis, measurement models,
Bayesian designs, geometry in machine learning.
Overview
 Multidisciplinary approaches often needed to solve data science problems
effectively.
 Can draw on many different areas depending on the problem:
 Sociology
 Industrial psychology
 Marketing
 Genomics
 Finance
 Medicine
 Individuals with a broad knowledge base are well-equipped for a career in
data science.
Example Problems
Problem 1: Health Risk Modeling
 Problem: Obesity and related problems are costing a healthcare system a lot
of money. How do we flag patients at risk and try to intervene on patients
who are sick?
 What disciplines might be needed? What causes might we consider? Anything
to take into account when designing the data mining and a possible trial?
What sorts of expertise might we need on this project and the
implementation of results?
Problem 1: Health Risk Modeling
 Food deserts
 Jobs with little opportunity to be active
 Genetic component
 Lack of understanding around nutrition
 Stress
Problem 2: Market Forecasting
 Problem: How can we get a better model of future valuation of a company or
sector to find good investment opportunities?
 What disciplines might be needed? What outside influences might we need to
account for? How might we set up the analysis?
Problem 2: Market Forecasting
 Election results
 Natural disasters (like Irma)
 Other market fluctuations
 Breakthrough inventions
 Employment, GDP… fluctuations
Problem 3: Predicting Disease from
Genetic Data
 Problem: Given a sequence of genetic data and patient case history
information, provide a short list of differential diagnoses with a high
probability of matching the underlying disease.
 What might complicate this analysis? Could the patient have more than one
underlying disease? Do you think the data is structured or unstructured? What
might be some technical challenges? Which disciplines could be helpful on this
project?
Problem 3: Predicting Disease from
Genetic Data
 Epigenetic factors (environment)
 Comorbidity
 Doctor error
 Incorrect spelling or unreadable shorthand
 Computational challenges of data storage and analysis requirements
 Statistical test problems (p>>n)
Conclusions
 Domain knowledge is important in data science.
 Interdisciplinary backgrounds or team compositions can help understand a
given project from multiple angles.
 This avoids potential bias or unreasonable assumptions.
 Technical expertise + domain knowledge creates value in data science
projects.

More Related Content

What's hot

2021 American Mathematical Society Data Science Talk
2021 American Mathematical Society Data Science Talk2021 American Mathematical Society Data Science Talk
2021 American Mathematical Society Data Science TalkColleen Farrelly
 
Propensity Score Matching Using SAS Enterprise Guide
Propensity Score Matching Using SAS Enterprise GuidePropensity Score Matching Using SAS Enterprise Guide
Propensity Score Matching Using SAS Enterprise GuideIan Morton
 
Data, Classifications and Sources.
Data, Classifications and Sources.Data, Classifications and Sources.
Data, Classifications and Sources.RajaKrishnan M
 
Information literacy proficiency
Information literacy proficiencyInformation literacy proficiency
Information literacy proficiencyvsabine
 
La & edm in practice
La & edm in practiceLa & edm in practice
La & edm in practicebharati k
 
The bayesian revolution in genetics
The bayesian revolution in geneticsThe bayesian revolution in genetics
The bayesian revolution in geneticsBeat Winehouse
 
Data Mining Techniques for School Failure and Dropout System
Data Mining Techniques for School Failure and Dropout SystemData Mining Techniques for School Failure and Dropout System
Data Mining Techniques for School Failure and Dropout SystemKumar Goud
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.docbutest
 
A Review of Intelligent Agent Systems in Animal Health Care
A Review of Intelligent Agent Systems in Animal Health CareA Review of Intelligent Agent Systems in Animal Health Care
A Review of Intelligent Agent Systems in Animal Health CareIJCSIS Research Publications
 
Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysisankurjain1909
 
The Influence of Participant Personality in Usability Tests
The Influence of Participant Personality in Usability TestsThe Influence of Participant Personality in Usability Tests
The Influence of Participant Personality in Usability TestsCSCJournals
 
Students academic performance using clustering technique
Students academic performance using clustering techniqueStudents academic performance using clustering technique
Students academic performance using clustering techniquesaniacorreya
 
Predicting students performance in final examination
Predicting students performance in final examinationPredicting students performance in final examination
Predicting students performance in final examinationRashid Ansari
 

What's hot (17)

2021 American Mathematical Society Data Science Talk
2021 American Mathematical Society Data Science Talk2021 American Mathematical Society Data Science Talk
2021 American Mathematical Society Data Science Talk
 
Propensity Score Matching Using SAS Enterprise Guide
Propensity Score Matching Using SAS Enterprise GuidePropensity Score Matching Using SAS Enterprise Guide
Propensity Score Matching Using SAS Enterprise Guide
 
Data, Classifications and Sources.
Data, Classifications and Sources.Data, Classifications and Sources.
Data, Classifications and Sources.
 
Information literacy proficiency
Information literacy proficiencyInformation literacy proficiency
Information literacy proficiency
 
La & edm in practice
La & edm in practiceLa & edm in practice
La & edm in practice
 
The bayesian revolution in genetics
The bayesian revolution in geneticsThe bayesian revolution in genetics
The bayesian revolution in genetics
 
Data Mining Techniques for School Failure and Dropout System
Data Mining Techniques for School Failure and Dropout SystemData Mining Techniques for School Failure and Dropout System
Data Mining Techniques for School Failure and Dropout System
 
Twala2007.doc
Twala2007.docTwala2007.doc
Twala2007.doc
 
A Review of Intelligent Agent Systems in Animal Health Care
A Review of Intelligent Agent Systems in Animal Health CareA Review of Intelligent Agent Systems in Animal Health Care
A Review of Intelligent Agent Systems in Animal Health Care
 
Basics of Data Analysis
Basics of Data AnalysisBasics of Data Analysis
Basics of Data Analysis
 
The Influence of Participant Personality in Usability Tests
The Influence of Participant Personality in Usability TestsThe Influence of Participant Personality in Usability Tests
The Influence of Participant Personality in Usability Tests
 
Chapter 1
Chapter 1Chapter 1
Chapter 1
 
Lecture notes on STS 202
Lecture notes on STS 202Lecture notes on STS 202
Lecture notes on STS 202
 
Data analysis
Data analysisData analysis
Data analysis
 
Data Analysis
Data AnalysisData Analysis
Data Analysis
 
Students academic performance using clustering technique
Students academic performance using clustering techniqueStudents academic performance using clustering technique
Students academic performance using clustering technique
 
Predicting students performance in final examination
Predicting students performance in final examinationPredicting students performance in final examination
Predicting students performance in final examination
 

Similar to Thinking Differently in Data Science

Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfBig Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfbkbk37
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in healthBig Data Colombia
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health careAravindharamanan S
 
Information Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationInformation Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationBen Williams
 
AI in medicine A comprehensive overview.pdf
AI in medicine A comprehensive overview.pdfAI in medicine A comprehensive overview.pdf
AI in medicine A comprehensive overview.pdfChristopherTHyatt
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informaticsimprovemed
 
ppt for data science slideshare.pptx
ppt for data science slideshare.pptxppt for data science slideshare.pptx
ppt for data science slideshare.pptxMangeshPatil358834
 
Data science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxData science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxArpitaDebnath20
 
Data discussion.docx
Data discussion.docxData discussion.docx
Data discussion.docxstudywriters
 
Week 8 Quantitative Research DesignPrevious Next Instructio.docx
Week 8 Quantitative Research DesignPrevious Next Instructio.docxWeek 8 Quantitative Research DesignPrevious Next Instructio.docx
Week 8 Quantitative Research DesignPrevious Next Instructio.docxphilipnelson29183
 
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdfLeveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdfSoumodeep Nanee Kundu
 
Challenges in deep learning methods for medical imaging - Pubrica
Challenges in deep learning methods for medical imaging - PubricaChallenges in deep learning methods for medical imaging - Pubrica
Challenges in deep learning methods for medical imaging - PubricaPubrica
 
improving-medication-adherence
improving-medication-adherenceimproving-medication-adherence
improving-medication-adherenceSergio Malorni
 
Theory of Human Caring on APN Role Student PresentationWeb Page
 Theory of Human Caring on APN Role Student PresentationWeb Page Theory of Human Caring on APN Role Student PresentationWeb Page
Theory of Human Caring on APN Role Student PresentationWeb PageMikeEly930
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized adviceWessel Kraaij
 
Unlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningUnlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningRNayak3
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_sciencetvk66866
 

Similar to Thinking Differently in Data Science (20)

Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdfBig Data Means Big Potential Challenges for Nurse Execs Response.pdf
Big Data Means Big Potential Challenges for Nurse Execs Response.pdf
 
Machine learning applied in health
Machine learning applied in healthMachine learning applied in health
Machine learning applied in health
 
Sun==big data analytics for health care
Sun==big data analytics for health careSun==big data analytics for health care
Sun==big data analytics for health care
 
Information Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical EducationInformation Technology: The Third Pillar of Medical Education
Information Technology: The Third Pillar of Medical Education
 
AI in medicine A comprehensive overview.pdf
AI in medicine A comprehensive overview.pdfAI in medicine A comprehensive overview.pdf
AI in medicine A comprehensive overview.pdf
 
Biomedical Informatics
Biomedical InformaticsBiomedical Informatics
Biomedical Informatics
 
ppt for data science slideshare.pptx
ppt for data science slideshare.pptxppt for data science slideshare.pptx
ppt for data science slideshare.pptx
 
Data science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptxData science in healthcare-Assignment 2.pptx
Data science in healthcare-Assignment 2.pptx
 
Data discussion.docx
Data discussion.docxData discussion.docx
Data discussion.docx
 
Week 8 Quantitative Research DesignPrevious Next Instructio.docx
Week 8 Quantitative Research DesignPrevious Next Instructio.docxWeek 8 Quantitative Research DesignPrevious Next Instructio.docx
Week 8 Quantitative Research DesignPrevious Next Instructio.docx
 
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdfLeveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
 
Data Walden.docx
Data Walden.docxData Walden.docx
Data Walden.docx
 
Challenges in deep learning methods for medical imaging - Pubrica
Challenges in deep learning methods for medical imaging - PubricaChallenges in deep learning methods for medical imaging - Pubrica
Challenges in deep learning methods for medical imaging - Pubrica
 
improving-medication-adherence
improving-medication-adherenceimproving-medication-adherence
improving-medication-adherence
 
Theory of Human Caring on APN Role Student PresentationWeb Page
 Theory of Human Caring on APN Role Student PresentationWeb Page Theory of Human Caring on APN Role Student PresentationWeb Page
Theory of Human Caring on APN Role Student PresentationWeb Page
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized advice
 
Unlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningUnlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media Listening
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Principles of data_science
Principles of data_sciencePrinciples of data_science
Principles of data_science
 
Nurses and Data Science
Nurses and Data ScienceNurses and Data Science
Nurses and Data Science
 

More from Colleen Farrelly

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Hands-On Network Science, PyData Global 2023
Hands-On Network Science, PyData Global 2023Hands-On Network Science, PyData Global 2023
Hands-On Network Science, PyData Global 2023Colleen Farrelly
 
Modeling Climate Change.pptx
Modeling Climate Change.pptxModeling Climate Change.pptx
Modeling Climate Change.pptxColleen Farrelly
 
Natural Language Processing for Beginners.pptx
Natural Language Processing for Beginners.pptxNatural Language Processing for Beginners.pptx
Natural Language Processing for Beginners.pptxColleen Farrelly
 
The Shape of Data--ODSC.pptx
The Shape of Data--ODSC.pptxThe Shape of Data--ODSC.pptx
The Shape of Data--ODSC.pptxColleen Farrelly
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxColleen Farrelly
 
Emerging Technologies for Public Health in Remote Locations.pptx
Emerging Technologies for Public Health in Remote Locations.pptxEmerging Technologies for Public Health in Remote Locations.pptx
Emerging Technologies for Public Health in Remote Locations.pptxColleen Farrelly
 
Applications of Forman-Ricci Curvature.pptx
Applications of Forman-Ricci Curvature.pptxApplications of Forman-Ricci Curvature.pptx
Applications of Forman-Ricci Curvature.pptxColleen Farrelly
 
Geometry for Social Good.pptx
Geometry for Social Good.pptxGeometry for Social Good.pptx
Geometry for Social Good.pptxColleen Farrelly
 
Topology for Time Series.pptx
Topology for Time Series.pptxTopology for Time Series.pptx
Topology for Time Series.pptxColleen Farrelly
 
Time Series Applications AMLD.pptx
Time Series Applications AMLD.pptxTime Series Applications AMLD.pptx
Time Series Applications AMLD.pptxColleen Farrelly
 
An introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptxAn introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptxColleen Farrelly
 
An introduction to time series data with R.pptx
An introduction to time series data with R.pptxAn introduction to time series data with R.pptx
An introduction to time series data with R.pptxColleen Farrelly
 
NLP: Challenges and Opportunities in Underserved Areas
NLP: Challenges and Opportunities in Underserved AreasNLP: Challenges and Opportunities in Underserved Areas
NLP: Challenges and Opportunities in Underserved AreasColleen Farrelly
 
Geometry, Data, and One Path Into Data Science.pptx
Geometry, Data, and One Path Into Data Science.pptxGeometry, Data, and One Path Into Data Science.pptx
Geometry, Data, and One Path Into Data Science.pptxColleen Farrelly
 
Topological Data Analysis.pptx
Topological Data Analysis.pptxTopological Data Analysis.pptx
Topological Data Analysis.pptxColleen Farrelly
 
Transforming Text Data to Matrix Data via Embeddings.pptx
Transforming Text Data to Matrix Data via Embeddings.pptxTransforming Text Data to Matrix Data via Embeddings.pptx
Transforming Text Data to Matrix Data via Embeddings.pptxColleen Farrelly
 
Natural Language Processing in the Wild.pptx
Natural Language Processing in the Wild.pptxNatural Language Processing in the Wild.pptx
Natural Language Processing in the Wild.pptxColleen Farrelly
 
SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing Colleen Farrelly
 
WIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceWIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceColleen Farrelly
 

More from Colleen Farrelly (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Hands-On Network Science, PyData Global 2023
Hands-On Network Science, PyData Global 2023Hands-On Network Science, PyData Global 2023
Hands-On Network Science, PyData Global 2023
 
Modeling Climate Change.pptx
Modeling Climate Change.pptxModeling Climate Change.pptx
Modeling Climate Change.pptx
 
Natural Language Processing for Beginners.pptx
Natural Language Processing for Beginners.pptxNatural Language Processing for Beginners.pptx
Natural Language Processing for Beginners.pptx
 
The Shape of Data--ODSC.pptx
The Shape of Data--ODSC.pptxThe Shape of Data--ODSC.pptx
The Shape of Data--ODSC.pptx
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
 
Emerging Technologies for Public Health in Remote Locations.pptx
Emerging Technologies for Public Health in Remote Locations.pptxEmerging Technologies for Public Health in Remote Locations.pptx
Emerging Technologies for Public Health in Remote Locations.pptx
 
Applications of Forman-Ricci Curvature.pptx
Applications of Forman-Ricci Curvature.pptxApplications of Forman-Ricci Curvature.pptx
Applications of Forman-Ricci Curvature.pptx
 
Geometry for Social Good.pptx
Geometry for Social Good.pptxGeometry for Social Good.pptx
Geometry for Social Good.pptx
 
Topology for Time Series.pptx
Topology for Time Series.pptxTopology for Time Series.pptx
Topology for Time Series.pptx
 
Time Series Applications AMLD.pptx
Time Series Applications AMLD.pptxTime Series Applications AMLD.pptx
Time Series Applications AMLD.pptx
 
An introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptxAn introduction to quantum machine learning.pptx
An introduction to quantum machine learning.pptx
 
An introduction to time series data with R.pptx
An introduction to time series data with R.pptxAn introduction to time series data with R.pptx
An introduction to time series data with R.pptx
 
NLP: Challenges and Opportunities in Underserved Areas
NLP: Challenges and Opportunities in Underserved AreasNLP: Challenges and Opportunities in Underserved Areas
NLP: Challenges and Opportunities in Underserved Areas
 
Geometry, Data, and One Path Into Data Science.pptx
Geometry, Data, and One Path Into Data Science.pptxGeometry, Data, and One Path Into Data Science.pptx
Geometry, Data, and One Path Into Data Science.pptx
 
Topological Data Analysis.pptx
Topological Data Analysis.pptxTopological Data Analysis.pptx
Topological Data Analysis.pptx
 
Transforming Text Data to Matrix Data via Embeddings.pptx
Transforming Text Data to Matrix Data via Embeddings.pptxTransforming Text Data to Matrix Data via Embeddings.pptx
Transforming Text Data to Matrix Data via Embeddings.pptx
 
Natural Language Processing in the Wild.pptx
Natural Language Processing in the Wild.pptxNatural Language Processing in the Wild.pptx
Natural Language Processing in the Wild.pptx
 
SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing SAS Global 2021 Introduction to Natural Language Processing
SAS Global 2021 Introduction to Natural Language Processing
 
WIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network ScienceWIDS 2021--An Introduction to Network Science
WIDS 2021--An Introduction to Network Science
 

Recently uploaded

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 

Recently uploaded (20)

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 

Thinking Differently in Data Science

  • 1. Thinking Differently in Data Science: The Interdisciplinary Approach Colleen M. Farrelly
  • 2. About Me  Former MD/PhD student  Background outside of math/stat including sociology, biochemistry, molecular biology, psychology, genomics, epidemiology.  Work history including academic medical studies, government, military, biotech, and education.  Areas of expertise include topological data analysis, measurement models, Bayesian designs, geometry in machine learning.
  • 3. Overview  Multidisciplinary approaches often needed to solve data science problems effectively.  Can draw on many different areas depending on the problem:  Sociology  Industrial psychology  Marketing  Genomics  Finance  Medicine  Individuals with a broad knowledge base are well-equipped for a career in data science.
  • 5. Problem 1: Health Risk Modeling  Problem: Obesity and related problems are costing a healthcare system a lot of money. How do we flag patients at risk and try to intervene on patients who are sick?  What disciplines might be needed? What causes might we consider? Anything to take into account when designing the data mining and a possible trial? What sorts of expertise might we need on this project and the implementation of results?
  • 6. Problem 1: Health Risk Modeling  Food deserts  Jobs with little opportunity to be active  Genetic component  Lack of understanding around nutrition  Stress
  • 7. Problem 2: Market Forecasting  Problem: How can we get a better model of future valuation of a company or sector to find good investment opportunities?  What disciplines might be needed? What outside influences might we need to account for? How might we set up the analysis?
  • 8. Problem 2: Market Forecasting  Election results  Natural disasters (like Irma)  Other market fluctuations  Breakthrough inventions  Employment, GDP… fluctuations
  • 9. Problem 3: Predicting Disease from Genetic Data  Problem: Given a sequence of genetic data and patient case history information, provide a short list of differential diagnoses with a high probability of matching the underlying disease.  What might complicate this analysis? Could the patient have more than one underlying disease? Do you think the data is structured or unstructured? What might be some technical challenges? Which disciplines could be helpful on this project?
  • 10. Problem 3: Predicting Disease from Genetic Data  Epigenetic factors (environment)  Comorbidity  Doctor error  Incorrect spelling or unreadable shorthand  Computational challenges of data storage and analysis requirements  Statistical test problems (p>>n)
  • 11. Conclusions  Domain knowledge is important in data science.  Interdisciplinary backgrounds or team compositions can help understand a given project from multiple angles.  This avoids potential bias or unreasonable assumptions.  Technical expertise + domain knowledge creates value in data science projects.