Meta-regression with DisMod-MR: how robust is the model?

•Download as PPTX, PDF•

2 likes•1,789 views

Institute for Health Metrics and Evaluation - University of Washington

GHME 2013 Conference Session: Dismod MR workshop Date: June 18 2013 Presenter: Hannah Peterson Institute: Institute for Health Metrics and Evaluation (IHME), University of Washington

Technology Education

Meta-regression with DisMod-MR:
how robust is the model?
June 18, 2013
Hannah M Peterson
Post-Bachelor Fellow

YLDs
• Measures morbidity
• Requires age-specific prevalence
o For 291 outcomes
o For 2 sexes
o For 187 countries
o For 3 years
3

Is negative-binomial distribution
the best choice?
DisMod-MR
4

Alternative distributions
5
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial

Alternative distributions
6
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial

Alternative distributions
7
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial

Alternative distributions
8
Distribution Probability Density Function
Normal
Lognormal
Binomial
Negative-
binomial

Potential experimental frameworks
• Data collection
o Ideal
o Impractical
• Simulation
o Impossible to know true data distribution
• Out-of-sample cross validation
o Do not have to choose distribution
9

Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
11

Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
12

Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
• Fit the remaining 75% of
data (“training data”)
13

Out-of-sample predictive validity
• Randomly select 25% of
data to use as “test data”
• Fit the remaining 75% of
data (“training data”)
• Use fit to calculate statistics
for test data
14

Comparing distributions
16
How to determine the best distribution?

Results
18
Percent of wins (%)
Distribution Bias MAE PC Total
Normal 22.1 20.6 34.6 25.7
Lognormal 29.7 13.0 36.5 26.4
Binomial 26.3 48.3 1.9 25.5
Negative-
binomial
21.9 18.1 27.1 22.4

Conclusions
• Choice of distribution doesn’t greatly influence results
• Best overall performance: lognormal distribution
o Contingent on method to adjust data whose value is 0
• Further investigate when each distribution performs best
o Dependent on number of covariates, priors, amount of data?
19

Thank you
Hannah Peterson
peterhm@uw.edu
www.healthmetricsandevaluation.org

What's hot

Observational study is divided into descriptive and analytical studies. Non-experimental Observational because there is no individual intervention Treatment and exposures occur in a “non-controlled” environment Individuals can be observed prospectively or retrospectively COHORT STUDY- an “observational” design comparing individuals with a known risk factor or exposure with others without the risk factor or exposure. looking for a difference in the risk (incidence) of a disease over time. best observational design data usually collected prospectively (some retrospective) CASE CONTROL - EFFECT TO CAUSE Retrospective When disease is rare .

Analytical study designs.pptx

Aryasree L

4. Calculate samplesize for cross-sectional studies

Azmi Mohd Tamil

Types of data

Akanksha Gupta

Descriptive Epidemiology (including Measurement in epidemiology)

Dr. Animesh Gupta

Systematic review

Khalid Mahmood

0620 w13 qp_32

King Ali

Epidemiological Exercises on case control studies

Jayaramachandran S

Big Data: Learning from MIMIC- Celi

intensivecaresociety

GLOBAL PERSPECTIVE CAMBRIDGE IGCSE: KEY TERMS

George Dumitrache

L16 rm (systematic review and meta-analysis)-samer

Dr Ghaiath Hussein

Association & causation (2016)

Shyam Ashtekar

RCT Critical Appraisal - Validity

Dr. Majdi Al Jasim

Question study design

Anisur Rahman

Study design of Prof Zak

Professor M Zak Khalil, MD, MRCP (UK), FACC, FESC

Bias, confounding and causality in p'coepidemiological research

samthamby79

It is a fundamental but common mistake to regard clinical trials as being a form of representative inference. The key issue is comparability. Experiments do not involve typical material. In clinical trials; it is concurrent control that is key and randomisation is a device for calculating standard errors appropriately that should reflect the design. Generalisation beyond the clinical trial always involves theory.

Clinical trials are about comparability not generalisability V2.pptx

StephenSenn3

Critical Appraisal Of Research Essay Example Paper.docx

studywriters

Sample size calculation - a brief overview

Azmi Mohd Tamil

0620 28

King Ali

6. Calculate samplesize for cohort studies

Azmi Mohd Tamil

What's hot (20)

Analytical study designs.pptx

4. Calculate samplesize for cross-sectional studies

Types of data

Descriptive Epidemiology (including Measurement in epidemiology)

Systematic review

0620 w13 qp_32

Epidemiological Exercises on case control studies

Big Data: Learning from MIMIC- Celi

GLOBAL PERSPECTIVE CAMBRIDGE IGCSE: KEY TERMS

L16 rm (systematic review and meta-analysis)-samer

Association & causation (2016)

RCT Critical Appraisal - Validity

Question study design

Study design of Prof Zak

Bias, confounding and causality in p'coepidemiological research

Clinical trials are about comparability not generalisability V2.pptx

Critical Appraisal Of Research Essay Example Paper.docx

Sample size calculation - a brief overview

0620 28

6. Calculate samplesize for cohort studies

More from Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamerica Initiative: Mixed-Methods Evaluation Plan

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamerica Initiative: Select results from the third operation measure...

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamérica Initiative: Mixed-Methods Evaluation Plan

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamerica Initiative: Select results from the second operation measur...

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamérica Initiative: Select results from the baseline measurement

Institute for Health Metrics and Evaluation - University of Washington

Verbal autopsy interviews were conducted with caretakers for all deaths of children under the age of 5 in Yucatán, Mexico during 2015-2016. Results from the verbal autopsy were triangulated with data from vital registration systems and medical records to check for concordance at both the individual and population level. Findings suggest that overall the vital registration system for deaths of children under 5 is strong, however concordance between vital registration systems and medical records varies based on cause of death and age of the deceased (neonatal vs. child). This presentation summarizes methods and results for the quality of mortality statistics analysis and was presented at the 2019 Instituto Nacional de Salud Public Conference in Cuernavaca, Mexico in March 2019.

Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)

Institute for Health Metrics and Evaluation - University of Washington

The first phase of the “Under-5 Child Health and Mortality Statistics Project” sough to strengthen the evidence and understanding of key factors related to under-5 mortality in Yucatán, Mexico using Verbal Autopsy data collection tools with an added battery on search for care processes for U5 deaths which occurred in Yucatán during 2015-2016, and the triangulation of Verbal Autopsy reports with data from vital registration systems and medical records. This presentation, presented to stakeholders at a results dissemination workshop in October 2017 in Mérida, Yucatán, provides an overview of the project and summarizes key results and learnings from the research.

Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...

Institute for Health Metrics and Evaluation - University of Washington

The second phase of the “Under-5 Child Health and Mortality Statistics Project” sough to strengthen the evidence and understanding of key factors related to under-5 mortality in Yucatán, Mexico through the implementation and evaluation of both community and facility-based interventions, aimed at improving recognition of alarm signs among mothers and caretakers for common causes of death in children and improving the quality of cause of death certification for deaths of children under 5, respectively. This presentation, presented virtually to stakeholders at a results dissemination workshop in January 2021, provides an overview of the project and summarizes key results and learnings from the research.

Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...

Institute for Health Metrics and Evaluation - University of Washington

The Global Fund Prospective Country Evaluation

Institute for Health Metrics and Evaluation - University of Washington

The Prospective Country Evaluation is an embedded mixed-methods evaluation platform designed to examine the Global Fund business model, investments and contribution to disease program outcomes and impact in eight countries. Findings were synthesized across the 8 countries to provide timely and actionable recommendations to support program improvements and accelerate progress towards the objectives of the Global Fund 2017-2022 Strategy.

Prospective Country Evaluation 2019 Synthesis Findings

Institute for Health Metrics and Evaluation - University of Washington

Global Burden of Disease (GBD) 2017 study findings

Institute for Health Metrics and Evaluation - University of Washington

In “Measuring human capital: a systematic analysis of 195 countries and territories, 1990–2016” IHME provides the first internationally comparable index of human capital. Building on past efforts, the study offers a measure of expected human capital that incorporates educational attainment, education quality or learning, functional health status, and survival for 195 countries, from 1990 to 2016.

Expected Human Capital: Key themes and talking points

Institute for Health Metrics and Evaluation - University of Washington

Global Health Financing

Institute for Health Metrics and Evaluation - University of Washington

Maternal and Child Mortality in the United States

Institute for Health Metrics and Evaluation - University of Washington

Salud Mesoamérica 2015 Initiative: Select results from the first operation me...

Institute for Health Metrics and Evaluation - University of Washington

Chronic diseases and their risk factors in the Kingdom of Saudi Arabia

Institute for Health Metrics and Evaluation - University of Washington

Speyer communicating dataforimpact_2015

Institute for Health Metrics and Evaluation - University of Washington

Understanding the costs of and constraints to health service delivery in Ghana

Institute for Health Metrics and Evaluation - University of Washington

ABCE: Understanding the costs of and constraints to health service delivery ...

Institute for Health Metrics and Evaluation - University of Washington

More from Institute for Health Metrics and Evaluation - University of Washington (20)

Salud Mesoamerica Initiative: Mixed-Methods Evaluation Plan

Salud Mesoamerica Initiative: Select results from the third operation measure...

Salud Mesoamerica Process Evaluation: Evidence on Culture Change in Health Sy...

Salud Mesoamérica Initiative: Mixed-Methods Evaluation Plan

Salud Mesoamerica Initiative: Select results from the second operation measur...

Salud Mesoamérica Initiative: Select results from the baseline measurement

Quality of under-5 mortality statistics in Yucatán, Mexico (Spanish)

Under-5 mortality and healthcare in Yucatán – 2017 Results dissemination work...

Under-5 mortality and healthcare in Yucatán – 2021 Results dissemination work...

The Global Fund Prospective Country Evaluation

Prospective Country Evaluation 2019 Synthesis Findings

Global Burden of Disease (GBD) 2017 study findings

Expected Human Capital: Key themes and talking points

Global Health Financing

Maternal and Child Mortality in the United States

Salud Mesoamérica 2015 Initiative: Select results from the first operation me...

Chronic diseases and their risk factors in the Kingdom of Saudi Arabia

Speyer communicating dataforimpact_2015

Understanding the costs of and constraints to health service delivery in Ghana

ABCE: Understanding the costs of and constraints to health service delivery ...

Recently uploaded

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

What is a good lead in your organisation? Which leads are priority? What happens to leads? When sales and marketing give different answers to these questions, or perhaps aren't sure of the answers at all, frustrations build and opportunities are left on the table. Join us for an illuminating session with Cian McLoughlin, HubSpot Principal Customer Success Manager, as we look at that crucial piece of the customer journey in which leads are transferred from marketing to sales.

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

HampshireHUG

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

CNv6 Instructor Chapter 6 Quality of Service

giselly40

Choosing the right accounts payable services provider is a strategic decision that can significantly impact your business's financial performance and operational efficiency. By considering factors such as expertise, range of services, technology infrastructure, scalability, cost, and reputation, businesses can make informed decisions and select a provider that aligns with their unique needs and objectives. Partnering with the right provider can streamline accounts payable processes, drive cost savings, and position your business for long-term success. https://katprotech.com/accounts-payable-and-purchase-order-automation/

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Katpro Technologies

GenCyber Cyber Security Day Presentation

Michael W. Hawkins

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

Handwritten Text Recognition for manuscripts and early printed texts

Maria Levchenko

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Rafal Los

Slack Application Development 101 Slides

praypatel2

Scaling API-first – The story of a global engineering organization

Radu Cotescu

🐬 The future of MySQL is Postgres 🐘

RTylerCroy

Advantages of Hiring UIUX Design Service Providers for Your Business

Pixlogix Infotech

With more memory available, system performance of three Dell devices increased, which can translate to a better user experience Conclusion When your system has plenty of RAM to meet your needs, you can efficiently access the applications and data you need to finish projects and to-do lists without sacrificing time and focus. Our test results show that with more memory available, three Dell PCs delivered better performance and took less time to complete the Procyon Office Productivity benchmark. These advantages translate to users being able to complete workflows more quickly and multitask more easily. Whether you need the mobility of the Latitude 5440, the creative capabilities of the Precision 3470, or the high performance of the OptiPlex Tower Plus 7010, configuring your system with more RAM can help keep processes running smoothly, enabling you to do more without compromising performance.

Boost PC performance: How more available memory can improve productivity

Principled Technologies

2024: Domino Containers - The Next Step. News from the Domino Container commu...

Martijn de Jong

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Neo4j

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

08448380779 Call Girls In Civil Lines Women Seeking Men

CNv6 Instructor Chapter 6 Quality of Service

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

GenCyber Cyber Security Day Presentation

Driving Behavioral Change for Information Management through Data-Driven Gree...

What Are The Drone Anti-jamming Systems Technology?

Handwritten Text Recognition for manuscripts and early printed texts

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Automating Google Workspace (GWS) & more with Apps Script

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Slack Application Development 101 Slides

Scaling API-first – The story of a global engineering organization

🐬 The future of MySQL is Postgres 🐘

Advantages of Hiring UIUX Design Service Providers for Your Business

Boost PC performance: How more available memory can improve productivity

2024: Domino Containers - The Next Step. News from the Domino Container commu...

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

Meta-regression with DisMod-MR: how robust is the model?

1. Meta-regression with DisMod-MR: how robust is the model? June 18, 2013 Hannah M Peterson Post-Bachelor Fellow

2. Global Burden of Disease Study 2010 2

3. YLDs • Measures morbidity • Requires age-specific prevalence o For 291 outcomes o For 2 sexes o For 187 countries o For 3 years 3

4. Is negative-binomial distribution the best choice? DisMod-MR 4

5. Alternative distributions 5 Distribution Probability Density Function Normal Lognormal Binomial Negative- binomial

6. Alternative distributions 6 Distribution Probability Density Function Normal Lognormal Binomial Negative- binomial

7. Alternative distributions 7 Distribution Probability Density Function Normal Lognormal Binomial Negative- binomial

8. Alternative distributions 8 Distribution Probability Density Function Normal Lognormal Binomial Negative- binomial

9. Potential experimental frameworks • Data collection o Ideal o Impractical • Simulation o Impossible to know true data distribution • Out-of-sample cross validation o Do not have to choose distribution 9

10. Out-of-sample cross validation 10

11. Out-of-sample predictive validity • Randomly select 25% of data to use as “test data” 11

12. Out-of-sample predictive validity • Randomly select 25% of data to use as “test data” 12

13. Out-of-sample predictive validity • Randomly select 25% of data to use as “test data” • Fit the remaining 75% of data (“training data”) 13

14. Out-of-sample predictive validity • Randomly select 25% of data to use as “test data” • Fit the remaining 75% of data (“training data”) • Use fit to calculate statistics for test data 14

15. Out-of-sample predictive validity • Randomly select 25% of data to use as “test data” • Fit the remaining 75% of data (“training data”) • Use fit to calculate statistics for test data • For each distribution • For 1000 test-train splits • For each disease data set 15

16. Comparing distributions 16 How to determine the best distribution?

17. Metrics of evaluation • 17

18. Results 18 Percent of wins (%) Distribution Bias MAE PC Total Normal 22.1 20.6 34.6 25.7 Lognormal 29.7 13.0 36.5 26.4 Binomial 26.3 48.3 1.9 25.5 Negative- binomial 21.9 18.1 27.1 22.4

19. Conclusions • Choice of distribution doesn’t greatly influence results • Best overall performance: lognormal distribution o Contingent on method to adjust data whose value is 0 • Further investigate when each distribution performs best o Dependent on number of covariates, priors, amount of data? 19

20. Thank you Hannah Peterson peterhm@uw.edu www.healthmetricsandevaluation.org

Editor's Notes

Global Burden of Disease Study 2010 (GBD)-huge endeavor to measure health loss from disease, injuries, and risk using the Disability Adjusted Life Year (DALY)-coarsely described in the this 18-step process-I am just going to focus on a small subsection, the calculation of DALYs for injuries and disease-further narrow focus to the calculation of YLDsfigure:Murray, Ezzati, et. al. 2013. “GBD 2010: design, definitions, and metrics”. The Lancet. 380(9859):2063-2066.
-YLDsmeasure morbidity, or years lived in less than full health-the YLD calculation needs age-specific prevalence estimates, for GBD, this means ---for 291 outcomes ---for 2 sexes---for 187 countries---for 3 years-however prevalence data is often less than ideal, -examples all available data in Western Europe for GDB2010 Study---sparse (fungal diseases) ---noisy (lower back pain) ---sparse and noisy (cannabis dependence data)-to calculate age-specific prevalence, used a tool called DisMod-MR
-DisMod-MR is designed to address missing data and inconsistency ---used epidemiologic data and covariate data to calculate the age-specific prevalence based on a negative-binomial distribution---assumes all epidemiological data follows a negative-binomial distribution-is it really the best distribution to model the epidemiologic data?figure: Vos, Flaxman, et. al. 2013. “Years lived with disability (YLDs) for 1160 sequelae of 289 diseases and injuries 1990-2010: a systematic analysis for the Global Burden of Disease Study 2010”. The Lancet. 380(9859):2163-2196.
Normal𝜇=𝑚𝑒𝑎𝑛𝜎=𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛-mathematically convenient-PROBLEM: allows negative estimates of prevalence, physiological impossibleNegative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
Lognormal𝜇=𝑚𝑒𝑎𝑛𝜎=𝑠𝑡𝑎𝑛𝑑𝑎𝑟𝑑 𝑑𝑒𝑣𝑖𝑎𝑡𝑖𝑜𝑛-bounds estimates at 0-PROBLEM: doesn’t allow prevalence to be 0---can’t take the log of 0-changed values of 0 to be 1 observation-other options would be to use an offset lognormal distribution-but somehow, have to work around estimates of 0Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
Binomial-which Dr. Flaxman already discussed-discrete model𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
Negative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to varyNegative-binomial𝑁=𝑖𝑛𝑑𝑖𝑣𝑖𝑑𝑢𝑎𝑙𝑠 𝑡𝑒𝑠𝑡𝑒𝑑𝑥=𝑡𝑒𝑠𝑡𝑒𝑑 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒𝑝=𝑝𝑟𝑜𝑏𝑎𝑏𝑖𝑙𝑡𝑦discrete modeltransformation yields an overdispersion parameter which allows the standard deviation to vary
Several ways to test which distribution is the best-ideal-data collection---actually go to country (region??) and measure age-specific prevalence---expensiveimpractical-simulation---great for testing, not for validation---problem: have to choose from what distribution the simulated data/measurements come------this is what we’re testing------simulation can showwhatever you want------impossible to know from what distribution measurement-out-of-sample cross validation---way to evaluate and compare distributions---shows how model performs in real life------can test out-of-sample predictive validity------don’t have to choose data distribution---concerns------unstable with sparse data-----------not just the epidemiologic data-----------also covariates and priors
This experiment-57 different disease data sets---met inclusion criteria of more than 4 prevalence points in western europe---not a birth-condition meaning prevalence data is only at age 0-restricted to Western EuropeTo explain out-of-sample cross validation usedan example from GBD2010fungal diseases
Randomly select 25% of data to withhold as test datatest data used to evaluate results
Test data is withheld from DisMod-MR
And the remaining data is fit
From the fit, these estimates are compared to the test dataThis comparison of the estimate to the test data is where the statistics are calculatedthe same test-train split fits are created for each of the distribution so we can make a comparison
-process repeated 1000 times with different test-train splits-repeated for 57 different disease data set---met inclusion criteria of more than 4 prevalence points in western europe---not a birth-condition meaning prevalence data is only at age 057 disease/injury conditions met this criteria
metrics that capture different aspects of model performanceWant a model that is precise, accurate, well-calibrated -precise (bias)---measures average difference between the test data and prediction-accurate (median absolute error-MAE)---measure of overall error---many small errors create one large number---sensitive to mean and scale---less sensitive to outliers-calibrated (percent coverage-PC)---calibrated, meaning that our estimates are in the correct range of values------if we aim for 95% uncertainty, we expect 95% of our estimates to be good------more than that and the model is over confident------less than that and the model isn’t very good---percent of time the uncertainty interval of the prediction contains the observation---sensitive to discrete distributionsto determine which distribution performed the best, counted the the winner for each disease data set and split
-for different metrics different distributions are superior---makes sense, since each distribution has it’s strengths and weaknesses---smallest bias: lognormal---minimum MAE: binomial---closest percent coverage: lognormal-concern about most frequent results and not raw numbers:---differences are small ------bias, ten-thousandths (E-4), average bias is negative binomial------mae, hundreds-overall winner: lognormal
-previously saw, distribution choice doesn’t greatly influence DisMod-MR’s estimates of age-specific prev-results differ by metric-Best overall performance: lognormal distribution---STRESS:Contingent on method to adjust data whose value is 0-Further investigate when each distribution performs best---Dependent on number of covariates, priors, amount of data?DisMod-MR is robust in that choice of distribution for epidemiological values does not greatly influence estimates, but one distribution performs the best most frequently

Meta-regression with DisMod-MR: how robust is the model?

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

More from Institute for Health Metrics and Evaluation - University of Washington

More from Institute for Health Metrics and Evaluation - University of Washington (20)

Recently uploaded

Recently uploaded (20)

Meta-regression with DisMod-MR: how robust is the model?

Editor's Notes