SlideShare une entreprise Scribd logo
1  sur  46
The Future of Frequentist
Hypothesis Testing
Denise Esserman, PhD
Yale Center for Analytical Sciences (YCAS)
Yale School of Public Health
October 19-25 | 2015 New York
www.medicres.org
Outline
• Definition of Big Data
• Expanding into Medicine
• Statistical Concerns
• The Future
October 19-25 | 2015 New York
www.medicres.org
“Big data is like teenage sex: everyone talks
about it, nobody really knows how to do it,
everyone thinks that everyone else is doing it,
so everyone claims they are doing it…”
Dan Ariely 2013
[Terry Speed talk, 2014]
October 19-25 | 2015 New York
www.medicres.org
What are/is “Big Data”?
October 19-25 | 2015 New York
www.medicres.org
• “Buzzword”
• Multiple Definitions
– The V’s: volume, velocity, variability, variety,
veracity, value (and complexity)
• Variability in the quantity and quality of the
data
Origin
• Believed to have originated with Web search
companies
– Querying very large distributed aggregations of
loosely-structured data1
• Does not always reference the volume of
data
October 19-25 | 2015 New York
www.medicres.org
Big Data Trends
• Google trends
October 19-25 | 2015 New York
www.medicres.org
“The hopeful vision of big data is that
organizations will be able to harvest and
harness every byte of relevant data and use it
to make the best decisions. Big data
technologies not only support the ability to
collect large amounts, but more importantly,
the ability to understand and take advantage
of its full value.”
Mark Troester14
October 19-25 | 2015 New York
www.medicres.org
Big Data in Medicine
• “Potential lies in innovative ways it can be
linked, related, and integrated to provide
more detailed and personalized
information than is possible with data from
a single source”7
• For health care providers to offer
personalized medicine
October 19-25 | 2015 New York
www.medicres.org
NIH – Big Data to Knowledge6
“The ability to harvest the wealth of information
contained in biomedical Big Data will advance our
understanding of human health and disease;
however, lack of appropriate tools, poor data
accessibility, and insufficient training, are major
impediments to rapid translational impact. To meet
this challenge, the National Institutes of Health
(NIH) launched the Big Data to Knowledge (BD2K)
initiative in 2012.”
October 19-25 | 2015 New York
www.medicres.org
BD2K Mission Statement
“BD2K is a trans-NIH initiative established
to enable biomedical research as a digital
research enterprise, to facilitate discovery
and support new knowledge, and to
maximize community engagement.”
October 19-25 | 2015 New York
www.medicres.org
BD2K Four Major Aims
• To facilitate broad use of biomedical digital assets by making them
discoverable, accessible, and citable.
• To conduct research and develop the methods, software, and tools
needed to analyze biomedical Big Data.
• To enhance training in the development and use of methods and
tools necessary for biomedical Big Data science.
• To support a data ecosystem that accelerates discovery as part of a
digital enterprise.
October 19-25 | 2015 New York
www.medicres.org
Biomedical Big Data6
• More than just very large data or a large number of data
sources.
– Complexity, challenges, and new opportunities
presented by the combined analysis of data.
• Diverse and complex.
– Imaging, phenotypic, molecular, exposure, health,
behavioral, and many other types of data.
October 19-25 | 2015 New York
www.medicres.org
• Faces many challenges.
– Unwieldy amount of information
– Lack of organization and access to data and tools
– Insufficient training in data science methods
• Spectacular opportunities.
– Maximize the potential of existing data and enable new
directions for research.
October 19-25 | 2015 New York
www.medicres.org
Quantity of data does not mean one can
ignore foundational issues of measurement
and construct validity and reliability and
dependencies among data5
October 19-25 | 2015 New York
www.medicres.org
Google Flu Trends2,3
• Machine Learning Algorithm
– Predict number of flu cases based on Google
Search Terms
– Theory Free
– Misunderstanding about uncertainties in data
collection and modeling process
• Inaccurate results over time
– Lack of statistics: did not know what linked
search terms to spread of flu
October 19-25 | 2015 New York
www.medicres.org
Google Flu Trends (cont)
• Correlation rather than causation
– Cheaper and easier
• Theory-free analysis is fragile
• Intended as a “complementary signal”
rather than stand alone forecasting tool4
October 19-25 | 2015 New York
www.medicres.org
“There are a lot of small data problems that
occur in big data. They don’t disappear
because you’ve got lots of the stuff. They get
worse.”
David Spielgelhalter
“Big data is like a big trash dump. You have to
know how to find the nuggets so it’s
profitable.”
Vin Gupta
October 19-25 | 2015 New York
www.medicres.org
Indexing vs. Analyzing Big Data
• Search companies index it
– Make relevant data easy to use
• Statisticians analyze it
– Find structure within the data
October 19-25 | 2015 New York
www.medicres.org
Data Scientists7
• People who draw insights from large quantities of
data
– Innovative problem solvers
– Expertise in statistical modeling and machine learning
– Specialized programming skills
– Solid grasp of problem domain
• Data science is blend of statistical, mathematical,
and computational sciences
October 19-25 | 2015 New York
www.medicres.org
Statistical Disconnect
• Statisticians should be leaders of Big Data
and data science movement8
– Scope goes beyond traditional activities
• Statisticians need to be more engaged
– Need to develop the skills to handle the sheer
volume of data
• Data scientists need to engage more
statisticians (or more statisticians need to
become data scientists)
October 19-25 | 2015 New York
www.medicres.org
“Better data matters because simply having
Big Data does not guarantee reliable answers
for Big Questions.”
Robert Rodriguez
October 19-25 | 2015 New York
www.medicres.org
Some of the Statistical Concerns
• Sampling populations
– Sampling error – not representative
• Confounders
• Multiple Testing
• Bias
– Sampling bias – not randomly chosen
• Overfitting
October 19-25 | 2015 New York
www.medicres.org
Big n problem
• Myth that problem is only computational in
nature, not statistical because of large n
• Standard errors can be large even with Big
Data
– Issue with large p
October 19-25 | 2015 New York
www.medicres.org
• Scale of data requires spreading across
cluster or grid of computers
• Computational work to be distributed with
the data
– Google MapReduce Model for parallel
programming (Apache Hadoop)
October 19-25 | 2015 New York
www.medicres.org
Software Alchemy (SA)
• Simple, powerful method to reduce computation
• Partition the data in r groups and calculate
average of estimator across groups
– Requires partitions to be distributed similarly
• May need initial “shuffle” step
• Works well for any asymptotically normal
estimator
– Also works for p growing
October 19-25 | 2015 New York
www.medicres.org
Big p problem
• High-dimensional data
• Dimension reduction
– e.g. Principal components analysis, variable
selection
• Issues:
– Multiple comparisons and simultaneous
inference
– Sparsity of data
October 19-25 | 2015 New York
www.medicres.org
In the future…
• Can we find methods that allow larger
values of p than “safe” o(√n)?
– Dimension reduction may distort results
• Can we more easily verify technical
assumptions?
– e.g. lack of potential consistency of the LASSO
• Can we find more general methods?
October 19-25 | 2015 New York
www.medicres.org
Theoretical Null Distributions
• Null distribution is most often not estimated, but
hypothesized in classic hypothesis testing
• Incorrect null might lead to false inference
• Influences
– Correlation
– Incorrect assumptions
– Unobserved covariates
October 19-25 | 2015 New York
www.medicres.org
Empirical Null Distribution
• Empirical null estimated from study’s data
• Does not assume “nice” normal with variance
going to 0
• Need independence assumption, but do not need
identical assumption
– Can have heterogeneous groups
October 19-25 | 2015 New York
www.medicres.org
Big-data Clinical Trials (BCT)10
• Neglected problem in RCT – analysis
typically based on different effectiveness of
different interventions provided at
baseline
• Want to be able to analyze association of
baseline treatment and its subsequent
dynamic processes
October 19-25 | 2015 New York
www.medicres.org
Example: Blood Pressure
• RCT:
– Effectiveness of Antihypertensive on BP control
– BP measured at specified outcomes
– Long term outcomes (e.g. stroke)
• BCT:
– Maintenance stable BP every day, hour, minute
– “Dosing” equipment
October 19-25 | 2015 New York
www.medicres.org
Epidemiologic Perspective
• Big data is useful to detect rare drug-related
side effects, not likely to be observed in RCT
• Chance to look at rare diseases
• Can contribute to an understanding of the
strengths and limitations of new population
sources12
October 19-25 | 2015 New York
www.medicres.org
Future of BCT10
• How will this be defined?
• What is the “right” data to collect?
• Who is in a position to design this trial?
• How do we handle threats of big data?
• How do we incorporate the non-static
populations?
October 19-25 | 2015 New York
www.medicres.org
Moving Beyond Rectangular Data
• Structure of the data may be irregular, non-
structured
– Knowledge will change over time11
• Varying types of data
– Pictures, videos, images
– Unstructured and Sei-structured from Social
Media
October 19-25 | 2015 New York
www.medicres.org
Machine Learning
“…a subfield of computer science that evolved from
the study of pattern recognition and computational
learning theory in artificial intelligence. Machine
learning explores the study and construction of
algorithms that can learn from and make predictions
on data. Such algorithms operate by building a
model from example inputs in order to make data-
driven predictions or decisions, rather than
following strictly statistic program instructions.”9
October 19-25 | 2015 New York
www.medicres.org
Machine Learning Tasks
• Supervised learning
– Example inputs and output – learn a rule that maps inputs to
outputs
• Semi-supervised learning
– Incomplete training signal (i.e. target outputs missing)
• Unsupervised learning
– Leave on own to find structure in input
• Reinforcement learning
– Interaction with dynamic environment
October 19-25 | 2015 New York
www.medicres.org
Example: Support Vector Machines
• Supervised learning method
• Used for classification and regression
analysis
• Can perform non-linear classification
October 19-25 | 2015 New York
www.medicres.org
Challenges with Machine Learning
• Need to choose the appropriate algorithm
• Need to be able to define hyper-parameters
– One or model parameters
• Data needs to be in appropriate format
– This is not trivial
• Execution time grows with number of attributes
and data instances
October 19-25 | 2015 New York
www.medicres.org
Future of Machine Learning
• Automatic searches for optimal algorithm
and hyper-parameters
– Very time consuming, limited usefulness at
present
• More user friendly approaches
– e.g. allow healthcare researcher to efficiently
and independently build predictive model13
October 19-25 | 2015 New York
www.medicres.org
Example: Machine Learning for Big
Clinical Data (MLBCD)13
• Supports whole process of iterative machine learning on
big clinical data
– Clinical parameter extraction
– Feature construction
– Algorithm and hyper-parameter selection
– Model building
– Model evaluation
• Can use after once (1) defined study population and
research question, (2) obtained clinical data set, and (3)
prepped data, including cleaning and filling in missing
data
October 19-25 | 2015 New York
www.medicres.org
Data Set Preparation
• Tremendous amount of work goes into
getting a data set together
– Acquire, normalize, clean
• e.g. Pivoting entity-attribute value format EMR to
relational table formats13
October 19-25 | 2015 New York
www.medicres.org
ID (entity Test # (Attribute) Pulse (Value)
100100 Test 1 98
989021 Test 2 101
100100 Test 2 75
989021 Test 3 99
989021 Test 4 88
ID Test_1 Test_2 Test_3
100100 98 75 null
989021 null 101 99
• Even bigger challenge with large data sets
– Different formats
– Different locations
– Data quality and governance
– Security, privacy and regulatory challenges
• Surprisingly little work done here – and it
should be a priority!
October 19-25 | 2015 New York
www.medicres.org
Example: Fitbit
• Study of Cardiac Risk
• Use Fitbit to measure daily step counts for 3
years (4000 participants)
• Participants need to upload to
laptop/phone and then sync to Fitbit server
• Devise holds one month of data
• Need to then connect with other study data
October 19-25 | 2015 New York
www.medicres.org
Where should the field head?
• Need new techniques for data management
• New tools for data analysis
• New tools for data visualization
• Ways to acquire and analyze unstructured
text data
October 19-25 | 2015 New York
www.medicres.org
“Big data is not about the technologies to
store massive amounts of data, it is about
creating a flexible infrastructure with high-
performance computing, high-performance
analytics and governance – in a deployment
model that makes sense for the
organization.”14
Mark Troester
October 19-25 | 2015 New York
www.medicres.org
References
1. http://www.webopedia.com/TERM/B/big_data.html (accessed October 15, 2015)
2. http://simplystatistics.org/2014/05/07/why-big-data-is-in-trouble-they-forgot-about-applied-statistics/ (accessed October
15,2015)
3. http://www.ft.com/intl/cms/s/2/21a6e7d8-b479-11e3-a09a-00144feabdc0.html (accessed October 15, 2015)
4. http://bits.blogs.nytimes.com/2014/03/28/google-flu-trends-the-limits-of-big-data/?_r=0 (accessed October 16, 2015)
5. Lazer D, Kennedy R, King G, Vespignani. The Parable of Google Flu: Traps in Big Data Analysis. Science, 343: 1203-05, (2014).
6. https://datascience.nih.gov/bd2k/about/what (accessed October 15, 2015)
7. http://magazine.amstat.org/blog/2012/06/01/prescorner/ (accessed October 17, 2015)
8. http://magazine.amstat.org/blog/2013/06/01/the-asa-and-big-data/ (accessed October 18, 2015)
9. https://en.wikipedia.org/wiki/Machine_learning (accessed October 18, 2015)
10. Wang SD. Opportunities and challenges of clinical research in the big-data era: from RCT to BCT. J Thorac Dis, 5(6): 721-
723, (2013)
11. Wang SD, Shen Y. Redefining big-data clinical trial (BCT). Annals of Translational Medicine. 2(10): 96, (2014)
12. Gange SJ, Golub ET. From smallpox to big data: The Next 100 years of epidemiologic methods. American Journal of
Epidemiology. DOI: 10.1093/aje/kwv150
13. Luo G. MLBCD: a machine learning tool for big clinical data. Inf Sci Syst. 3:3, 2015.
14. Big Data Meets Big Data Analytics. SAS White Paper
October 19-25 | 2015 New York
www.medicres.org

Contenu connexe

Tendances

Leveraging Medical Health Record Data for Identifying Research Study Particip...
Leveraging Medical Health Record Data for Identifying Research Study Particip...Leveraging Medical Health Record Data for Identifying Research Study Particip...
Leveraging Medical Health Record Data for Identifying Research Study Particip...SC CTSI at USC and CHLA
 
Big Data: Big Opportunities or Big Trouble?
Big Data: Big Opportunities or Big Trouble?Big Data: Big Opportunities or Big Trouble?
Big Data: Big Opportunities or Big Trouble?Shea Swauger
 
TIARA Module 4 Geoff Barnes 10302019
TIARA Module 4 Geoff Barnes 10302019TIARA Module 4 Geoff Barnes 10302019
TIARA Module 4 Geoff Barnes 10302019Stacy Farr, PhD, MPH
 
Weighing evidence and assessing uncertainties
Weighing evidence and assessing uncertaintiesWeighing evidence and assessing uncertainties
Weighing evidence and assessing uncertaintiesEFSA EU
 
TIARA Module 4 Anne Sales Practical Considerations 102019
TIARA Module 4 Anne Sales Practical Considerations 102019TIARA Module 4 Anne Sales Practical Considerations 102019
TIARA Module 4 Anne Sales Practical Considerations 102019Stacy Farr, PhD, MPH
 
Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015Paul Courtney
 
Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Ewout Steyerberg
 
David Gough - Evidence informed policy making - 27 June 2017
David Gough - Evidence informed policy making - 27 June 2017David Gough - Evidence informed policy making - 27 June 2017
David Gough - Evidence informed policy making - 27 June 2017OECD Governance
 
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...NAP Global Network
 
TIARA Module 3 Design and Analysis Dr. Anne Sales 082019
TIARA Module 3 Design and Analysis  Dr. Anne Sales 082019TIARA Module 3 Design and Analysis  Dr. Anne Sales 082019
TIARA Module 3 Design and Analysis Dr. Anne Sales 082019Stacy Farr, PhD, MPH
 
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting Documents
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting DocumentsDigital Scholar Webinar: Clinicaltrials.gov Registration and Reporting Documents
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting DocumentsSC CTSI at USC and CHLA
 
National archetype governance in Norway
National archetype governance in NorwayNational archetype governance in Norway
National archetype governance in NorwaySilje Ljosland Bakke
 
Full METL Research to Faster Miracles
Full METL Research to Faster MiraclesFull METL Research to Faster Miracles
Full METL Research to Faster MiraclesSean Manion PhD
 
TIARA Module 2 Anne Sales Theory & Approaches 06192019
TIARA Module 2 Anne Sales Theory & Approaches 06192019TIARA Module 2 Anne Sales Theory & Approaches 06192019
TIARA Module 2 Anne Sales Theory & Approaches 06192019Stacy Farr, PhD, MPH
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEwout Steyerberg
 
Journal club summary: Open Science save lives
Journal club summary: Open Science save livesJournal club summary: Open Science save lives
Journal club summary: Open Science save livesDorothy Bishop
 

Tendances (20)

297 vickers
297 vickers297 vickers
297 vickers
 
Leveraging Medical Health Record Data for Identifying Research Study Particip...
Leveraging Medical Health Record Data for Identifying Research Study Particip...Leveraging Medical Health Record Data for Identifying Research Study Particip...
Leveraging Medical Health Record Data for Identifying Research Study Particip...
 
Big Data: Big Opportunities or Big Trouble?
Big Data: Big Opportunities or Big Trouble?Big Data: Big Opportunities or Big Trouble?
Big Data: Big Opportunities or Big Trouble?
 
TIARA Module 4 Geoff Barnes 10302019
TIARA Module 4 Geoff Barnes 10302019TIARA Module 4 Geoff Barnes 10302019
TIARA Module 4 Geoff Barnes 10302019
 
An introduction to Statistical Analysis Plans
An introduction to Statistical Analysis PlansAn introduction to Statistical Analysis Plans
An introduction to Statistical Analysis Plans
 
Weighing evidence and assessing uncertainties
Weighing evidence and assessing uncertaintiesWeighing evidence and assessing uncertainties
Weighing evidence and assessing uncertainties
 
TIARA Module 4 Anne Sales Practical Considerations 102019
TIARA Module 4 Anne Sales Practical Considerations 102019TIARA Module 4 Anne Sales Practical Considerations 102019
TIARA Module 4 Anne Sales Practical Considerations 102019
 
Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015Almaden presentation 15-dec-2015
Almaden presentation 15-dec-2015
 
Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20Open science LMU session contribution E Steyerberg 2jul20
Open science LMU session contribution E Steyerberg 2jul20
 
Kochalko,"Why we should stop worrying about high impact journal indicators an...
Kochalko,"Why we should stop worrying about high impact journal indicators an...Kochalko,"Why we should stop worrying about high impact journal indicators an...
Kochalko,"Why we should stop worrying about high impact journal indicators an...
 
David Gough - Evidence informed policy making - 27 June 2017
David Gough - Evidence informed policy making - 27 June 2017David Gough - Evidence informed policy making - 27 June 2017
David Gough - Evidence informed policy making - 27 June 2017
 
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...
Measuring Progress: Indicators, Data Sources and Assessment | Laszlo Pinter, ...
 
TIARA Module 3 Design and Analysis Dr. Anne Sales 082019
TIARA Module 3 Design and Analysis  Dr. Anne Sales 082019TIARA Module 3 Design and Analysis  Dr. Anne Sales 082019
TIARA Module 3 Design and Analysis Dr. Anne Sales 082019
 
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting Documents
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting DocumentsDigital Scholar Webinar: Clinicaltrials.gov Registration and Reporting Documents
Digital Scholar Webinar: Clinicaltrials.gov Registration and Reporting Documents
 
Adams, "Profiles, not Metrics; Why it is important to drill into the data tha...
Adams, "Profiles, not Metrics; Why it is important to drill into the data tha...Adams, "Profiles, not Metrics; Why it is important to drill into the data tha...
Adams, "Profiles, not Metrics; Why it is important to drill into the data tha...
 
National archetype governance in Norway
National archetype governance in NorwayNational archetype governance in Norway
National archetype governance in Norway
 
Full METL Research to Faster Miracles
Full METL Research to Faster MiraclesFull METL Research to Faster Miracles
Full METL Research to Faster Miracles
 
TIARA Module 2 Anne Sales Theory & Approaches 06192019
TIARA Module 2 Anne Sales Theory & Approaches 06192019TIARA Module 2 Anne Sales Theory & Approaches 06192019
TIARA Module 2 Anne Sales Theory & Approaches 06192019
 
Evaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk predictionEvaluation of the clinical value of biomarkers for risk prediction
Evaluation of the clinical value of biomarkers for risk prediction
 
Journal club summary: Open Science save lives
Journal club summary: Open Science save livesJournal club summary: Open Science save lives
Journal club summary: Open Science save lives
 

Similaire à Denise Esserman MedicReS World Congress 2015

Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...US-Ignite
 
Design and evaluation of an interactive proof-of-concept dashboard for genera...
Design and evaluation of an interactive proof-of-concept dashboard for genera...Design and evaluation of an interactive proof-of-concept dashboard for genera...
Design and evaluation of an interactive proof-of-concept dashboard for genera...Robin De Croon
 
1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptxRahulTr22
 
Open data meetup nyc 1 23-14
Open data meetup nyc 1 23-14Open data meetup nyc 1 23-14
Open data meetup nyc 1 23-14Vivian S. Zhang
 
Big Data Privacy - Society Issues + Big Data
Big Data Privacy - Society Issues + Big DataBig Data Privacy - Society Issues + Big Data
Big Data Privacy - Society Issues + Big DataSylvia Ogweng
 
DAMA Webinar - Big and Little Data Quality
DAMA Webinar - Big and Little Data QualityDAMA Webinar - Big and Little Data Quality
DAMA Webinar - Big and Little Data QualityDATAVERSITY
 
Dama webinar-little-data-in-a-big-data-world-160217211043
Dama webinar-little-data-in-a-big-data-world-160217211043Dama webinar-little-data-in-a-big-data-world-160217211043
Dama webinar-little-data-in-a-big-data-world-160217211043SeilaIglesiasDomngue
 
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...Kristin Wolff
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analyticsBill Jacobs
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...Christopher Hart
 
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...Warren Kibbe
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalkimlyman
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxwahiba ben abdessalem
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfvishal choudhary
 
Plum Analytics - Clinical Citations: Telling the Story of Impact
Plum Analytics - Clinical Citations: Telling the Story of ImpactPlum Analytics - Clinical Citations: Telling the Story of Impact
Plum Analytics - Clinical Citations: Telling the Story of ImpactMarianne Parkhill
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxssuser1a4f0f
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?Paul Agapow
 
Perspectives from the NIH Associate Director for Data Science (ADDS) Office
Perspectives from the NIH Associate Director for Data Science (ADDS) OfficePerspectives from the NIH Associate Director for Data Science (ADDS) Office
Perspectives from the NIH Associate Director for Data Science (ADDS) OfficeAmazon Web Services
 

Similaire à Denise Esserman MedicReS World Congress 2015 (20)

Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
 
Design and evaluation of an interactive proof-of-concept dashboard for genera...
Design and evaluation of an interactive proof-of-concept dashboard for genera...Design and evaluation of an interactive proof-of-concept dashboard for genera...
Design and evaluation of an interactive proof-of-concept dashboard for genera...
 
1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx1. Data Science overview - part1.pptx
1. Data Science overview - part1.pptx
 
Open data meetup nyc 1 23-14
Open data meetup nyc 1 23-14Open data meetup nyc 1 23-14
Open data meetup nyc 1 23-14
 
Big Data Privacy - Society Issues + Big Data
Big Data Privacy - Society Issues + Big DataBig Data Privacy - Society Issues + Big Data
Big Data Privacy - Society Issues + Big Data
 
Let’s All Agree on What We’re Counting and How
Let’s All Agree on What We’re Counting and HowLet’s All Agree on What We’re Counting and How
Let’s All Agree on What We’re Counting and How
 
DAMA Webinar - Big and Little Data Quality
DAMA Webinar - Big and Little Data QualityDAMA Webinar - Big and Little Data Quality
DAMA Webinar - Big and Little Data Quality
 
Dama webinar-little-data-in-a-big-data-world-160217211043
Dama webinar-little-data-in-a-big-data-world-160217211043Dama webinar-little-data-in-a-big-data-world-160217211043
Dama webinar-little-data-in-a-big-data-world-160217211043
 
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
The Emerging Workforce Data Ecosystem: New Strategies, Partners & Tools Helpi...
 
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
2013 10 cu leeds school big data conference - bill jacobs - revolution analytics
 
Carpenter - Lets remove "alt" from altmetrics - ER&L Presentation
Carpenter - Lets remove "alt" from altmetrics - ER&L PresentationCarpenter - Lets remove "alt" from altmetrics - ER&L Presentation
Carpenter - Lets remove "alt" from altmetrics - ER&L Presentation
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
 
Sdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) finalSdal air health and social development (jan. 27, 2014) final
Sdal air health and social development (jan. 27, 2014) final
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
Data_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdfData_Science_Applications_&_Use_Cases.pdf
Data_Science_Applications_&_Use_Cases.pdf
 
Plum Analytics - Clinical Citations: Telling the Story of Impact
Plum Analytics - Clinical Citations: Telling the Story of ImpactPlum Analytics - Clinical Citations: Telling the Story of Impact
Plum Analytics - Clinical Citations: Telling the Story of Impact
 
Data_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptxData_Science_Applications_&_Use_Cases.pptx
Data_Science_Applications_&_Use_Cases.pptx
 
The End of the Drug Development Casino?
The End of the Drug Development Casino?The End of the Drug Development Casino?
The End of the Drug Development Casino?
 
Perspectives from the NIH Associate Director for Data Science (ADDS) Office
Perspectives from the NIH Associate Director for Data Science (ADDS) OfficePerspectives from the NIH Associate Director for Data Science (ADDS) Office
Perspectives from the NIH Associate Director for Data Science (ADDS) Office
 

Plus de MedicReS

Possibilities of patient education on clinical research during recruitment in...
Possibilities of patient education on clinical research during recruitment in...Possibilities of patient education on clinical research during recruitment in...
Possibilities of patient education on clinical research during recruitment in...MedicReS
 
Decision Tree for Adverse Event Reporting – ALL STUDIES
Decision Tree for Adverse Event Reporting – ALL STUDIESDecision Tree for Adverse Event Reporting – ALL STUDIES
Decision Tree for Adverse Event Reporting – ALL STUDIESMedicReS
 
Randomization in Clinical Trials
Randomization in Clinical TrialsRandomization in Clinical Trials
Randomization in Clinical TrialsMedicReS
 
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...MedicReS
 
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURU
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURUMedicReS BeGMR TITCK IKU ARASTIRMA BROSURU
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURUMedicReS
 
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMU
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMUMedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMU
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMUMedicReS
 
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLU
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLUMedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLU
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLUMedicReS
 
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETI
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETIMedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETI
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETIMedicReS
 
MedicReS BeGMR TITCK IKU KALITE YONETIMI
MedicReS BeGMR TITCK IKU KALITE YONETIMIMedicReS BeGMR TITCK IKU KALITE YONETIMI
MedicReS BeGMR TITCK IKU KALITE YONETIMIMedicReS
 
MedicReS BeGMR TITCK IKU DESTEKLEYICI
MedicReS BeGMR TITCK IKU DESTEKLEYICIMedicReS BeGMR TITCK IKU DESTEKLEYICI
MedicReS BeGMR TITCK IKU DESTEKLEYICIMedicReS
 
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACI
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACIMedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACI
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACIMedicReS
 
MedicReS BeGMR TITCK IKU ETIK KURUL
MedicReS BeGMR TITCK IKU ETIK KURULMedicReS BeGMR TITCK IKU ETIK KURUL
MedicReS BeGMR TITCK IKU ETIK KURULMedicReS
 
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERI
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERIMedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERI
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERIMedicReS
 
MedicReS BeGMR TITCK IKU Tanimlar
MedicReS BeGMR TITCK IKU TanimlarMedicReS BeGMR TITCK IKU Tanimlar
MedicReS BeGMR TITCK IKU TanimlarMedicReS
 
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...MedicReS
 
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...MedicReS
 
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...MedicReS
 
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...MedicReS
 
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...MedicReS
 
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. Shamoo
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. ShamooMedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. Shamoo
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. ShamooMedicReS
 

Plus de MedicReS (20)

Possibilities of patient education on clinical research during recruitment in...
Possibilities of patient education on clinical research during recruitment in...Possibilities of patient education on clinical research during recruitment in...
Possibilities of patient education on clinical research during recruitment in...
 
Decision Tree for Adverse Event Reporting – ALL STUDIES
Decision Tree for Adverse Event Reporting – ALL STUDIESDecision Tree for Adverse Event Reporting – ALL STUDIES
Decision Tree for Adverse Event Reporting – ALL STUDIES
 
Randomization in Clinical Trials
Randomization in Clinical TrialsRandomization in Clinical Trials
Randomization in Clinical Trials
 
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...
MedicReS BeGMR TITCK IKU KLINIK ARASTIRMANIN YURUTULMESI ICIN GEREKLI TEMEL B...
 
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURU
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURUMedicReS BeGMR TITCK IKU ARASTIRMA BROSURU
MedicReS BeGMR TITCK IKU ARASTIRMA BROSURU
 
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMU
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMUMedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMU
MedicReS BeGMR TITCK IKU BILGILENDIRILMIS GONULLU FORMU
 
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLU
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLUMedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLU
MedicReS BeGMR TITCK IKU ARASTIRMA PROTOKOLU
 
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETI
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETIMedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETI
MedicReS BeGMR TITCK IKU IZLEYICI VE IZLEME FAALIYETI
 
MedicReS BeGMR TITCK IKU KALITE YONETIMI
MedicReS BeGMR TITCK IKU KALITE YONETIMIMedicReS BeGMR TITCK IKU KALITE YONETIMI
MedicReS BeGMR TITCK IKU KALITE YONETIMI
 
MedicReS BeGMR TITCK IKU DESTEKLEYICI
MedicReS BeGMR TITCK IKU DESTEKLEYICIMedicReS BeGMR TITCK IKU DESTEKLEYICI
MedicReS BeGMR TITCK IKU DESTEKLEYICI
 
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACI
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACIMedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACI
MedicReS BeGMR TITCK IKU SORUMLU ARASTIRMACI
 
MedicReS BeGMR TITCK IKU ETIK KURUL
MedicReS BeGMR TITCK IKU ETIK KURULMedicReS BeGMR TITCK IKU ETIK KURUL
MedicReS BeGMR TITCK IKU ETIK KURUL
 
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERI
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERIMedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERI
MedicReS BeGMR TITCK IKU IYI KLINIK UYGULAMALARIN TEMEL ILKELERI
 
MedicReS BeGMR TITCK IKU Tanimlar
MedicReS BeGMR TITCK IKU TanimlarMedicReS BeGMR TITCK IKU Tanimlar
MedicReS BeGMR TITCK IKU Tanimlar
 
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...
MedicReS Conference 2017 Istanbul - RESEARCH TOPIC PRIORITIES RELEVANT TO DIF...
 
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...
MedicReS Conference 2017 Istanbul - Fostering Responsible Conduct of Research...
 
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...
MedicReS Conference 2017 Istanbul - Journal Metrics: The Impact Factor and Al...
 
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...
MedicReS Conference 2017 Istanbul - Ethical issues of secondary analysis of a...
 
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...
MedicReS Conference 2017 Istanbul - Integrity of Authorship in Research Publi...
 
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. Shamoo
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. ShamooMedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. Shamoo
MedicReS Winter School 2017 Vienna - Ethics of Cancer Trials - Adil E. Shamoo
 

Dernier

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.pptamreenkhanum0307
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 

Dernier (20)

PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Machine learning classification ppt.ppt
Machine learning classification  ppt.pptMachine learning classification  ppt.ppt
Machine learning classification ppt.ppt
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 

Denise Esserman MedicReS World Congress 2015

  • 1. The Future of Frequentist Hypothesis Testing Denise Esserman, PhD Yale Center for Analytical Sciences (YCAS) Yale School of Public Health October 19-25 | 2015 New York www.medicres.org
  • 2. Outline • Definition of Big Data • Expanding into Medicine • Statistical Concerns • The Future October 19-25 | 2015 New York www.medicres.org
  • 3. “Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks that everyone else is doing it, so everyone claims they are doing it…” Dan Ariely 2013 [Terry Speed talk, 2014] October 19-25 | 2015 New York www.medicres.org
  • 4. What are/is “Big Data”? October 19-25 | 2015 New York www.medicres.org • “Buzzword” • Multiple Definitions – The V’s: volume, velocity, variability, variety, veracity, value (and complexity) • Variability in the quantity and quality of the data
  • 5. Origin • Believed to have originated with Web search companies – Querying very large distributed aggregations of loosely-structured data1 • Does not always reference the volume of data October 19-25 | 2015 New York www.medicres.org
  • 6. Big Data Trends • Google trends October 19-25 | 2015 New York www.medicres.org
  • 7. “The hopeful vision of big data is that organizations will be able to harvest and harness every byte of relevant data and use it to make the best decisions. Big data technologies not only support the ability to collect large amounts, but more importantly, the ability to understand and take advantage of its full value.” Mark Troester14 October 19-25 | 2015 New York www.medicres.org
  • 8. Big Data in Medicine • “Potential lies in innovative ways it can be linked, related, and integrated to provide more detailed and personalized information than is possible with data from a single source”7 • For health care providers to offer personalized medicine October 19-25 | 2015 New York www.medicres.org
  • 9. NIH – Big Data to Knowledge6 “The ability to harvest the wealth of information contained in biomedical Big Data will advance our understanding of human health and disease; however, lack of appropriate tools, poor data accessibility, and insufficient training, are major impediments to rapid translational impact. To meet this challenge, the National Institutes of Health (NIH) launched the Big Data to Knowledge (BD2K) initiative in 2012.” October 19-25 | 2015 New York www.medicres.org
  • 10. BD2K Mission Statement “BD2K is a trans-NIH initiative established to enable biomedical research as a digital research enterprise, to facilitate discovery and support new knowledge, and to maximize community engagement.” October 19-25 | 2015 New York www.medicres.org
  • 11. BD2K Four Major Aims • To facilitate broad use of biomedical digital assets by making them discoverable, accessible, and citable. • To conduct research and develop the methods, software, and tools needed to analyze biomedical Big Data. • To enhance training in the development and use of methods and tools necessary for biomedical Big Data science. • To support a data ecosystem that accelerates discovery as part of a digital enterprise. October 19-25 | 2015 New York www.medicres.org
  • 12. Biomedical Big Data6 • More than just very large data or a large number of data sources. – Complexity, challenges, and new opportunities presented by the combined analysis of data. • Diverse and complex. – Imaging, phenotypic, molecular, exposure, health, behavioral, and many other types of data. October 19-25 | 2015 New York www.medicres.org
  • 13. • Faces many challenges. – Unwieldy amount of information – Lack of organization and access to data and tools – Insufficient training in data science methods • Spectacular opportunities. – Maximize the potential of existing data and enable new directions for research. October 19-25 | 2015 New York www.medicres.org
  • 14. Quantity of data does not mean one can ignore foundational issues of measurement and construct validity and reliability and dependencies among data5 October 19-25 | 2015 New York www.medicres.org
  • 15. Google Flu Trends2,3 • Machine Learning Algorithm – Predict number of flu cases based on Google Search Terms – Theory Free – Misunderstanding about uncertainties in data collection and modeling process • Inaccurate results over time – Lack of statistics: did not know what linked search terms to spread of flu October 19-25 | 2015 New York www.medicres.org
  • 16. Google Flu Trends (cont) • Correlation rather than causation – Cheaper and easier • Theory-free analysis is fragile • Intended as a “complementary signal” rather than stand alone forecasting tool4 October 19-25 | 2015 New York www.medicres.org
  • 17. “There are a lot of small data problems that occur in big data. They don’t disappear because you’ve got lots of the stuff. They get worse.” David Spielgelhalter “Big data is like a big trash dump. You have to know how to find the nuggets so it’s profitable.” Vin Gupta October 19-25 | 2015 New York www.medicres.org
  • 18. Indexing vs. Analyzing Big Data • Search companies index it – Make relevant data easy to use • Statisticians analyze it – Find structure within the data October 19-25 | 2015 New York www.medicres.org
  • 19. Data Scientists7 • People who draw insights from large quantities of data – Innovative problem solvers – Expertise in statistical modeling and machine learning – Specialized programming skills – Solid grasp of problem domain • Data science is blend of statistical, mathematical, and computational sciences October 19-25 | 2015 New York www.medicres.org
  • 20. Statistical Disconnect • Statisticians should be leaders of Big Data and data science movement8 – Scope goes beyond traditional activities • Statisticians need to be more engaged – Need to develop the skills to handle the sheer volume of data • Data scientists need to engage more statisticians (or more statisticians need to become data scientists) October 19-25 | 2015 New York www.medicres.org
  • 21. “Better data matters because simply having Big Data does not guarantee reliable answers for Big Questions.” Robert Rodriguez October 19-25 | 2015 New York www.medicres.org
  • 22. Some of the Statistical Concerns • Sampling populations – Sampling error – not representative • Confounders • Multiple Testing • Bias – Sampling bias – not randomly chosen • Overfitting October 19-25 | 2015 New York www.medicres.org
  • 23. Big n problem • Myth that problem is only computational in nature, not statistical because of large n • Standard errors can be large even with Big Data – Issue with large p October 19-25 | 2015 New York www.medicres.org
  • 24. • Scale of data requires spreading across cluster or grid of computers • Computational work to be distributed with the data – Google MapReduce Model for parallel programming (Apache Hadoop) October 19-25 | 2015 New York www.medicres.org
  • 25. Software Alchemy (SA) • Simple, powerful method to reduce computation • Partition the data in r groups and calculate average of estimator across groups – Requires partitions to be distributed similarly • May need initial “shuffle” step • Works well for any asymptotically normal estimator – Also works for p growing October 19-25 | 2015 New York www.medicres.org
  • 26. Big p problem • High-dimensional data • Dimension reduction – e.g. Principal components analysis, variable selection • Issues: – Multiple comparisons and simultaneous inference – Sparsity of data October 19-25 | 2015 New York www.medicres.org
  • 27. In the future… • Can we find methods that allow larger values of p than “safe” o(√n)? – Dimension reduction may distort results • Can we more easily verify technical assumptions? – e.g. lack of potential consistency of the LASSO • Can we find more general methods? October 19-25 | 2015 New York www.medicres.org
  • 28. Theoretical Null Distributions • Null distribution is most often not estimated, but hypothesized in classic hypothesis testing • Incorrect null might lead to false inference • Influences – Correlation – Incorrect assumptions – Unobserved covariates October 19-25 | 2015 New York www.medicres.org
  • 29. Empirical Null Distribution • Empirical null estimated from study’s data • Does not assume “nice” normal with variance going to 0 • Need independence assumption, but do not need identical assumption – Can have heterogeneous groups October 19-25 | 2015 New York www.medicres.org
  • 30. Big-data Clinical Trials (BCT)10 • Neglected problem in RCT – analysis typically based on different effectiveness of different interventions provided at baseline • Want to be able to analyze association of baseline treatment and its subsequent dynamic processes October 19-25 | 2015 New York www.medicres.org
  • 31. Example: Blood Pressure • RCT: – Effectiveness of Antihypertensive on BP control – BP measured at specified outcomes – Long term outcomes (e.g. stroke) • BCT: – Maintenance stable BP every day, hour, minute – “Dosing” equipment October 19-25 | 2015 New York www.medicres.org
  • 32. Epidemiologic Perspective • Big data is useful to detect rare drug-related side effects, not likely to be observed in RCT • Chance to look at rare diseases • Can contribute to an understanding of the strengths and limitations of new population sources12 October 19-25 | 2015 New York www.medicres.org
  • 33. Future of BCT10 • How will this be defined? • What is the “right” data to collect? • Who is in a position to design this trial? • How do we handle threats of big data? • How do we incorporate the non-static populations? October 19-25 | 2015 New York www.medicres.org
  • 34. Moving Beyond Rectangular Data • Structure of the data may be irregular, non- structured – Knowledge will change over time11 • Varying types of data – Pictures, videos, images – Unstructured and Sei-structured from Social Media October 19-25 | 2015 New York www.medicres.org
  • 35. Machine Learning “…a subfield of computer science that evolved from the study of pattern recognition and computational learning theory in artificial intelligence. Machine learning explores the study and construction of algorithms that can learn from and make predictions on data. Such algorithms operate by building a model from example inputs in order to make data- driven predictions or decisions, rather than following strictly statistic program instructions.”9 October 19-25 | 2015 New York www.medicres.org
  • 36. Machine Learning Tasks • Supervised learning – Example inputs and output – learn a rule that maps inputs to outputs • Semi-supervised learning – Incomplete training signal (i.e. target outputs missing) • Unsupervised learning – Leave on own to find structure in input • Reinforcement learning – Interaction with dynamic environment October 19-25 | 2015 New York www.medicres.org
  • 37. Example: Support Vector Machines • Supervised learning method • Used for classification and regression analysis • Can perform non-linear classification October 19-25 | 2015 New York www.medicres.org
  • 38. Challenges with Machine Learning • Need to choose the appropriate algorithm • Need to be able to define hyper-parameters – One or model parameters • Data needs to be in appropriate format – This is not trivial • Execution time grows with number of attributes and data instances October 19-25 | 2015 New York www.medicres.org
  • 39. Future of Machine Learning • Automatic searches for optimal algorithm and hyper-parameters – Very time consuming, limited usefulness at present • More user friendly approaches – e.g. allow healthcare researcher to efficiently and independently build predictive model13 October 19-25 | 2015 New York www.medicres.org
  • 40. Example: Machine Learning for Big Clinical Data (MLBCD)13 • Supports whole process of iterative machine learning on big clinical data – Clinical parameter extraction – Feature construction – Algorithm and hyper-parameter selection – Model building – Model evaluation • Can use after once (1) defined study population and research question, (2) obtained clinical data set, and (3) prepped data, including cleaning and filling in missing data October 19-25 | 2015 New York www.medicres.org
  • 41. Data Set Preparation • Tremendous amount of work goes into getting a data set together – Acquire, normalize, clean • e.g. Pivoting entity-attribute value format EMR to relational table formats13 October 19-25 | 2015 New York www.medicres.org ID (entity Test # (Attribute) Pulse (Value) 100100 Test 1 98 989021 Test 2 101 100100 Test 2 75 989021 Test 3 99 989021 Test 4 88 ID Test_1 Test_2 Test_3 100100 98 75 null 989021 null 101 99
  • 42. • Even bigger challenge with large data sets – Different formats – Different locations – Data quality and governance – Security, privacy and regulatory challenges • Surprisingly little work done here – and it should be a priority! October 19-25 | 2015 New York www.medicres.org
  • 43. Example: Fitbit • Study of Cardiac Risk • Use Fitbit to measure daily step counts for 3 years (4000 participants) • Participants need to upload to laptop/phone and then sync to Fitbit server • Devise holds one month of data • Need to then connect with other study data October 19-25 | 2015 New York www.medicres.org
  • 44. Where should the field head? • Need new techniques for data management • New tools for data analysis • New tools for data visualization • Ways to acquire and analyze unstructured text data October 19-25 | 2015 New York www.medicres.org
  • 45. “Big data is not about the technologies to store massive amounts of data, it is about creating a flexible infrastructure with high- performance computing, high-performance analytics and governance – in a deployment model that makes sense for the organization.”14 Mark Troester October 19-25 | 2015 New York www.medicres.org
  • 46. References 1. http://www.webopedia.com/TERM/B/big_data.html (accessed October 15, 2015) 2. http://simplystatistics.org/2014/05/07/why-big-data-is-in-trouble-they-forgot-about-applied-statistics/ (accessed October 15,2015) 3. http://www.ft.com/intl/cms/s/2/21a6e7d8-b479-11e3-a09a-00144feabdc0.html (accessed October 15, 2015) 4. http://bits.blogs.nytimes.com/2014/03/28/google-flu-trends-the-limits-of-big-data/?_r=0 (accessed October 16, 2015) 5. Lazer D, Kennedy R, King G, Vespignani. The Parable of Google Flu: Traps in Big Data Analysis. Science, 343: 1203-05, (2014). 6. https://datascience.nih.gov/bd2k/about/what (accessed October 15, 2015) 7. http://magazine.amstat.org/blog/2012/06/01/prescorner/ (accessed October 17, 2015) 8. http://magazine.amstat.org/blog/2013/06/01/the-asa-and-big-data/ (accessed October 18, 2015) 9. https://en.wikipedia.org/wiki/Machine_learning (accessed October 18, 2015) 10. Wang SD. Opportunities and challenges of clinical research in the big-data era: from RCT to BCT. J Thorac Dis, 5(6): 721- 723, (2013) 11. Wang SD, Shen Y. Redefining big-data clinical trial (BCT). Annals of Translational Medicine. 2(10): 96, (2014) 12. Gange SJ, Golub ET. From smallpox to big data: The Next 100 years of epidemiologic methods. American Journal of Epidemiology. DOI: 10.1093/aje/kwv150 13. Luo G. MLBCD: a machine learning tool for big clinical data. Inf Sci Syst. 3:3, 2015. 14. Big Data Meets Big Data Analytics. SAS White Paper October 19-25 | 2015 New York www.medicres.org