SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Sampling and statistical
power
Devesh Roy (on behalf of IFPRI-IFAD team)
Questionnaire of impact surveys
• Module A – Identification –cluster, household id, cluster
• Module B- Household demographics- sex, age, literacy
• Module C-Survey Questions- Dwellings, Drinking water supply,
Sanitation, Food security, Asset related questions, farming and
livestock questions, Anthropometry
Important details underlying this
questionnaire
• No tracking of households- straightforward identification module
• Limited set of household characteristics
• Sample size exogenously fixed and simple random sampling proposed
but effects across strata envisioned
Some elements of robust IE–leading to RIMS+
•Choosing the Sample
• Issue -what data needed and the sample required to precisely estimate differences in
outcomes between the treatment group and the comparison group.
• Determine both the sample size and how to draw the units in the sample from a
population of interest.
• What Kinds of Data Do I Need?
• Need SMART data- specific, measurable, attributable, realistic, and targeted.
• Good quality data are required to assess the impact of the intervention on the
outcomes of interest.
• The IE should not measure only outcomes for which the program is directly
accountable. Outcome indicators indirectly affected or indicators capturing unintended
program impact will maximize the value of the information that the IE generates.
Primer on sample and nature of data
• some indicators may not be amenable to IE in small samples. Detecting
impacts for outcomes that are
• extremely variable,
• rare events,
• or that are likely to be only marginally affected by an intervention may require
prohibitively large samples.
• Example Identifying the impact of an intervention on maternal mortality
rates will be feasible only in a sample that contains many pregnant women.
• Data on exogenous factors that may affect the outcome of interest. These
make it possible to control for outside influences.
• Data on other characteristics. Including additional controls or analyzing the
heterogeneity of the program’s effects along certain characteristics
Role of existing data
• To set benchmark value of indicators
• Power calculation for getting the minimum sample size
• Here in IFAD projects- One might not need to do power calculations
but get some sense of when you would need a larger sample vis-à-vis
smaller sample
How large the sample must be?
• Associated calculations are called power calculations.
• Avoid collecting too much data as well as too few data
• Remember too few data - If the sample is too small, you may not be
able to detect positive impact and may thus conclude that the
program had no effect
• Assume for simplicity that all who are intended to be beneficiary take
part and those intended to be non-beneficiary remain so.
Power calculations
• Most impact evaluations test a simple hypothesis
• Does the program have an impact? In other words, Is the program
impact different from zero? Answering this question requires two
steps:
• 1. Estimate the average outcomes for the beneficiary and non-
beneficiary groups.
• 2. Assess whether a difference exists between the average outcome
for the treatment beneficiary group and the average outcome for the
non-beneficiary group.
Large versus small sample (Large sample reduces
the chance of being unlucky (Gertler et al 2010)
Consider some related example
• Take a nutrition program
• Take anthropometric measure of the beneficiary and non-beneficiary
• Take a sample of 2 children (beneficiary and non- beneficiary) and do it many
times
• The estimates from the different samples taken repeatedly will bounce a lot-
implies the estimates are unreliable
• Take a children sample of 100 and repeat it many times –
• What do you see- which estimate bounces much more ?
• It is the smaller sample
Errors in Impact evaluation
• Type 1 error and type 2 error
• Type 1 error (conclude that average height in the beneficiary group is
higher than in non-beneficiary group when in fact it is not)
• Type 2 error (conclude that average height in the beneficiary group is no
different than in non-beneficiary group when in fact it is actually different)
• Likelihood of a type 1 error is called confidence level – usually set at 5
percent i.e. you would be 95 percent confident that program had an impact
• Many factors affect the likelihood of committing a type 2 error but sample
size is crucial- When a sample is large it is less likely that average height or
weight of children in the two groups is equal just by luck
Power calculation and errors in IE (Gertler et
al 2010)
• The statistical power of an impact evaluation is the probability that it will
detect a difference between the beneficiary and non-beneficiary groups
when in fact one exists. An impact evaluation has a high power if there is a
low risk of not detecting real program impacts, that is, of committing a
type II error.
• Under high power unlikely to be disappointed by results showing that the
program being evaluated has had no impact, when in reality it did have an
impact.
• From a policy perspective, underpowered impact evaluations are costly
• If you were to conclude that the program was not effective, even though it
was, policy makers would be likely to close down a good program.
• Carrying out power calculations could be crucial and relevant.
Steps in power calculations (Gertler et al
2010)
• Does the program create clusters?
• What is the outcome indicator?
• Do you aim to compare program impacts between subgroups?
• What is the minimum level of impact that would justify the
investment that has been made in the intervention?
• What is a reasonable level of power for the evaluation being
conducted?
• What are the baseline mean and variance of the outcome indicators?
Power calculation: Continued
• No clusters case- Intervention at the level at which impacts are observed (some treatments given
at school level and outcomes observed at the student level would comprise clusters)
• No clusters- take a random sample out of the entire population
• Identify the most important indicators that you want to evaluate
• If there are sub-groups (like SC/ST caste groups) then the sample size would be larger
• Then the sample size of the effect to be determined would be lager
• For an evaluation to identify a small impact, estimates of any difference in mean outcomes
between the treatment and comparison groups will need to be very precise, requiring a large
sample.
• Population baseline mean and variance
• Usual power of 80 percent is the norm
• Many statistical software can do the power calculations once these parameters are known
• Programs that create clusters have a different issue with power calculations
Power calculations with clusters
• Some programs assign benefits at the cluster level
• Here the principle is for sample size- number of clusters matters more
than number of households within clusters
• A sufficient number of clusters is required to test convincingly
whether a program has had an impact by comparing outcomes
• Compared to the steps before just add how variable are the outcomes
within clusters?
• Think within village incomes are same but different across villages-
adding an individual from another village would add more statistical
power
INTERNATIONAL FOOD POLICY RESEARCH INSTITUTE
Example: test of difference of means in two
populations
• The equation for sample size is derived from the
equation for the statistical test
• In a t-test the equation for the test is
t = (x1 - x2) - (m1 - m2)
(s1
2 n+ s2
2 n)12
• The derived equation for sample size is
n = (z1-/2 + z1-b)2(s1
2 + s2
2)
(m1 - m2)2
What next after sample size?
• A Sampling Strategy
• Steps in sampling
• 1. Determine the population of interest (eg children under certain
age).
• 2. Identify a sampling frame- should coincide with population of
interest but sometimes it does not.
• 3. Draw as many units from the sampling frame as required by power
calculations.
• Choose the sampling method
Sampling method
• Probability sampling (PS) methods-most rigorous- they assign a well-
defined probability of each unit’s being drawn. The 3 main PS methods are:
• Random sampling. Every unit in the population has exactly the same
probability of being drawn.
• Stratified random sampling. The population is divided into groups (for
example, male and female) and random sampling is performed within each
group (essential for comparing impacts in sub groups)
• Cluster sampling. Units are grouped in clusters, and a random sample of
clusters is drawn, after which either all units in those clusters constitute the
sample or a number of units within the cluster are randomly drawn. This
means that each cluster has a well-defined probability of being selected,
and units within a selected cluster also have a well-defined probability of
being drawn.
Sampling method and impact evaluation
• Drawing a sample depends on the rules of eligibility in the program
• Usually interventions take place at cluster level
• Implies in these cases should go for cluster sampling
• Under any cost avoid non-probabilistic like purposive or convenience
sampling

Contenu connexe

Tendances

A survey of random forest based methods for
A survey of random forest based methods forA survey of random forest based methods for
A survey of random forest based methods forNikhil Sharma
 
Impact Evaluation Overview
Impact Evaluation OverviewImpact Evaluation Overview
Impact Evaluation OverviewPhilip Jakob
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practiceAmit Sharma
 
What is impact evaluation?
What is impact evaluation?What is impact evaluation?
What is impact evaluation?David Evans
 
Forecasting Elections from Voters’ Perceptions
Forecasting Elections from Voters’ Perceptions Forecasting Elections from Voters’ Perceptions
Forecasting Elections from Voters’ Perceptions agraefe
 
Biomarker Strategies
Biomarker StrategiesBiomarker Strategies
Biomarker StrategiesTom Plasterer
 
Analytic Methods and Issues in CER from Observational Data
Analytic Methods and Issues in CER from Observational DataAnalytic Methods and Issues in CER from Observational Data
Analytic Methods and Issues in CER from Observational DataCTSI at UCSF
 
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...Stockholm Institute of Transition Economics
 
Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Joao Galdino Mello de Souza
 
Machine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talkMachine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talkGabriel Hughes PhD
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Amit Sharma
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAmit Sharma
 
data management and analysis
 data management and analysis data management and analysis
data management and analysisabdullahi mohamed
 
Aron chpt 7 ed effect size
Aron chpt 7 ed effect sizeAron chpt 7 ed effect size
Aron chpt 7 ed effect sizeKaren Price
 
Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)Matt Hansen
 
The Project Impact Pathway
The Project Impact PathwayThe Project Impact Pathway
The Project Impact PathwayPhilip Jakob
 

Tendances (20)

A survey of random forest based methods for
A survey of random forest based methods forA survey of random forest based methods for
A survey of random forest based methods for
 
Impact Evaluation Overview
Impact Evaluation OverviewImpact Evaluation Overview
Impact Evaluation Overview
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practice
 
Impact evaluation an overview
Impact evaluation  an overviewImpact evaluation  an overview
Impact evaluation an overview
 
What is impact evaluation?
What is impact evaluation?What is impact evaluation?
What is impact evaluation?
 
Forecasting Elections from Voters’ Perceptions
Forecasting Elections from Voters’ Perceptions Forecasting Elections from Voters’ Perceptions
Forecasting Elections from Voters’ Perceptions
 
Biomarker Strategies
Biomarker StrategiesBiomarker Strategies
Biomarker Strategies
 
How we can use impact evaluation to assure effective use of resources for dev...
How we can use impact evaluation to assure effective use of resources for dev...How we can use impact evaluation to assure effective use of resources for dev...
How we can use impact evaluation to assure effective use of resources for dev...
 
Analytic Methods and Issues in CER from Observational Data
Analytic Methods and Issues in CER from Observational DataAnalytic Methods and Issues in CER from Observational Data
Analytic Methods and Issues in CER from Observational Data
 
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...
How Much Can We Generalize? Measuring the External Validity of Impact Evaluat...
 
Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...Did Something Change? - Using Statistical Techniques to Interpret Service and...
Did Something Change? - Using Statistical Techniques to Interpret Service and...
 
Machine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talkMachine learning meets user analytics - Metageni tech talk
Machine learning meets user analytics - Metageni tech talk
 
Propensity Score Matching Methods
Propensity Score Matching MethodsPropensity Score Matching Methods
Propensity Score Matching Methods
 
On Samples And Sampling
On Samples And SamplingOn Samples And Sampling
On Samples And Sampling
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal Models
 
data management and analysis
 data management and analysis data management and analysis
data management and analysis
 
Aron chpt 7 ed effect size
Aron chpt 7 ed effect sizeAron chpt 7 ed effect size
Aron chpt 7 ed effect size
 
Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)Analyze Phase Roadmap (Level 3)
Analyze Phase Roadmap (Level 3)
 
The Project Impact Pathway
The Project Impact PathwayThe Project Impact Pathway
The Project Impact Pathway
 

En vedette (12)

Seminario 8
Seminario 8Seminario 8
Seminario 8
 
Prueba T de student
Prueba T de studentPrueba T de student
Prueba T de student
 
Estadística: T de student
Estadística: T de studentEstadística: T de student
Estadística: T de student
 
Student t t est
Student t t estStudent t t est
Student t t est
 
Referencias Organizadores Gráficos
Referencias Organizadores GráficosReferencias Organizadores Gráficos
Referencias Organizadores Gráficos
 
Pruebas paramétricas y no paramétricas
Pruebas paramétricas y no paramétricasPruebas paramétricas y no paramétricas
Pruebas paramétricas y no paramétricas
 
Prueba t student
Prueba t studentPrueba t student
Prueba t student
 
T-test (Revised version)
T-test (Revised version)T-test (Revised version)
T-test (Revised version)
 
T student2
T student2T student2
T student2
 
T de student para dos muestras independientes
T de student para dos muestras independientesT de student para dos muestras independientes
T de student para dos muestras independientes
 
Analisis parametricos-y-no-parametricos
Analisis parametricos-y-no-parametricosAnalisis parametricos-y-no-parametricos
Analisis parametricos-y-no-parametricos
 
T student 5 ejemplos beeto
T student 5 ejemplos beetoT student 5 ejemplos beeto
T student 5 ejemplos beeto
 

Similaire à IFPRI- Impact Surveys 1

Sample-size-comprehensive.pptx
Sample-size-comprehensive.pptxSample-size-comprehensive.pptx
Sample-size-comprehensive.pptxssuser4eb7dd
 
3. How to Randomize
3. How to Randomize3. How to Randomize
3. How to Randomizevinhthedang
 
Sample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsSample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsMarcos Vera
 
PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,Naveen K L
 
8 sampling & sample size (Dr. Mai,2014)
8  sampling & sample size (Dr. Mai,2014)8  sampling & sample size (Dr. Mai,2014)
8 sampling & sample size (Dr. Mai,2014)Phong Đá
 
Sample size calculation
Sample  size calculationSample  size calculation
Sample size calculationSwati Singh
 
What is Evaluation
What is EvaluationWhat is Evaluation
What is Evaluationclearsateam
 
Ipac 2014
Ipac 2014Ipac 2014
Ipac 2014cwhms
 
Gilligan quantitative impact eval methods
Gilligan quantitative impact eval methodsGilligan quantitative impact eval methods
Gilligan quantitative impact eval methodsgenderassets
 
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual EventKey Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual EventMEASURE Evaluation
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchShinjan Patra
 
Chapter 10Data Interpretation IssuesLearning Objec.docx
Chapter 10Data Interpretation IssuesLearning Objec.docxChapter 10Data Interpretation IssuesLearning Objec.docx
Chapter 10Data Interpretation IssuesLearning Objec.docxketurahhazelhurst
 
Basic Statistical Concepts.pdf
Basic Statistical Concepts.pdfBasic Statistical Concepts.pdf
Basic Statistical Concepts.pdfKwangheeJung
 

Similaire à IFPRI- Impact Surveys 1 (20)

Sample-size-comprehensive.pptx
Sample-size-comprehensive.pptxSample-size-comprehensive.pptx
Sample-size-comprehensive.pptx
 
What have we learnt from randomized control trials
What have we learnt from randomized control trialsWhat have we learnt from randomized control trials
What have we learnt from randomized control trials
 
3. How to Randomize
3. How to Randomize3. How to Randomize
3. How to Randomize
 
ICAR-IFPRI: Revisiting and other issues - Devesh Roy
ICAR-IFPRI: Revisiting and other issues - Devesh RoyICAR-IFPRI: Revisiting and other issues - Devesh Roy
ICAR-IFPRI: Revisiting and other issues - Devesh Roy
 
Sample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact EvaluationsSample Size Calculations for Impact Evaluations
Sample Size Calculations for Impact Evaluations
 
PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,PPT on Sample Size, Importance of Sample Size,
PPT on Sample Size, Importance of Sample Size,
 
8 sampling & sample size (Dr. Mai,2014)
8  sampling & sample size (Dr. Mai,2014)8  sampling & sample size (Dr. Mai,2014)
8 sampling & sample size (Dr. Mai,2014)
 
Sample size calculation
Sample  size calculationSample  size calculation
Sample size calculation
 
Sample size determination
Sample size determinationSample size determination
Sample size determination
 
What is Evaluation
What is EvaluationWhat is Evaluation
What is Evaluation
 
Ipac 2014
Ipac 2014Ipac 2014
Ipac 2014
 
Session 3 sample design
Session 3   sample designSession 3   sample design
Session 3 sample design
 
Gilligan quantitative impact eval methods
Gilligan quantitative impact eval methodsGilligan quantitative impact eval methods
Gilligan quantitative impact eval methods
 
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual EventKey Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
Key Issues in Impact Evaluation: A MEET and GEMNet-Health Virtual Event
 
Sampling
SamplingSampling
Sampling
 
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical researchBio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical research
 
Sampling method
Sampling methodSampling method
Sampling method
 
Chapter 10Data Interpretation IssuesLearning Objec.docx
Chapter 10Data Interpretation IssuesLearning Objec.docxChapter 10Data Interpretation IssuesLearning Objec.docx
Chapter 10Data Interpretation IssuesLearning Objec.docx
 
Basic Statistical Concepts.pdf
Basic Statistical Concepts.pdfBasic Statistical Concepts.pdf
Basic Statistical Concepts.pdf
 
ICAR-IFPRI : Problems of Impact Evaluation Confounding Factors and Selection ...
ICAR-IFPRI : Problems of Impact Evaluation Confounding Factors and Selection ...ICAR-IFPRI : Problems of Impact Evaluation Confounding Factors and Selection ...
ICAR-IFPRI : Problems of Impact Evaluation Confounding Factors and Selection ...
 

Plus de International Food Policy Research Institute- South Asia Office

Plus de International Food Policy Research Institute- South Asia Office (20)

8. Dr Rao.pdf
8. Dr Rao.pdf8. Dr Rao.pdf
8. Dr Rao.pdf
 
13. Manish Patel.pdf
13. Manish Patel.pdf13. Manish Patel.pdf
13. Manish Patel.pdf
 
15. Smita Sirohi.pdf
15. Smita Sirohi.pdf15. Smita Sirohi.pdf
15. Smita Sirohi.pdf
 
6. CD Mayee.pdf
6. CD Mayee.pdf6. CD Mayee.pdf
6. CD Mayee.pdf
 
10. Keshavulu_1.pdf
10. Keshavulu_1.pdf10. Keshavulu_1.pdf
10. Keshavulu_1.pdf
 
12. Swati Nayak.pdf
12. Swati Nayak.pdf12. Swati Nayak.pdf
12. Swati Nayak.pdf
 
16. Dr Anjani.pdf
16. Dr Anjani.pdf16. Dr Anjani.pdf
16. Dr Anjani.pdf
 
9. Malavika Dadlani_1.pdf
9. Malavika Dadlani_1.pdf9. Malavika Dadlani_1.pdf
9. Malavika Dadlani_1.pdf
 
7. Neeru Bhooshan.pdf
7. Neeru Bhooshan.pdf7. Neeru Bhooshan.pdf
7. Neeru Bhooshan.pdf
 
4. Raj Ganesh.pdf
4. Raj Ganesh.pdf4. Raj Ganesh.pdf
4. Raj Ganesh.pdf
 
11. Surinder K Tikoo_1.pdf
11. Surinder K Tikoo_1.pdf11. Surinder K Tikoo_1.pdf
11. Surinder K Tikoo_1.pdf
 
3. DK Yadava.pdf
3. DK Yadava.pdf3. DK Yadava.pdf
3. DK Yadava.pdf
 
14. Paresh Verma_1.pdf
14. Paresh Verma_1.pdf14. Paresh Verma_1.pdf
14. Paresh Verma_1.pdf
 
5. Ram Kaundinya.pdf
5. Ram Kaundinya.pdf5. Ram Kaundinya.pdf
5. Ram Kaundinya.pdf
 
1. Dr Anjani.pdf
1. Dr Anjani.pdf1. Dr Anjani.pdf
1. Dr Anjani.pdf
 
2. David Spielman.pdf
2. David Spielman.pdf2. David Spielman.pdf
2. David Spielman.pdf
 
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur RashidGFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
GFPR 2021 South Asia Launch Ppt - Dr. Shahidur Rashid
 
GFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
GFPR 2021 South Asia Launch Ppt - Dr. Johan SwinnenGFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
GFPR 2021 South Asia Launch Ppt - Dr. Johan Swinnen
 
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
Book Launch : Agricultural Transformation in Nepal: Trends, Prospects and Pol...
 
Understanding the landscape of pulse policy in India and implications for trade
Understanding the landscape of pulse policy in India and implications for tradeUnderstanding the landscape of pulse policy in India and implications for trade
Understanding the landscape of pulse policy in India and implications for trade
 

Dernier

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are successPratikSingh115843
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etclalithasri22
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 

Dernier (17)

Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
Presentation of project of business person who are success
Presentation of project of business person who are successPresentation of project of business person who are success
Presentation of project of business person who are success
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
DATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etcDATA ANALYSIS using various data sets like shoping data set etc
DATA ANALYSIS using various data sets like shoping data set etc
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use2023 Survey Shows Dip in High School E-Cigarette Use
2023 Survey Shows Dip in High School E-Cigarette Use
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 

IFPRI- Impact Surveys 1

  • 1. Sampling and statistical power Devesh Roy (on behalf of IFPRI-IFAD team)
  • 2. Questionnaire of impact surveys • Module A – Identification –cluster, household id, cluster • Module B- Household demographics- sex, age, literacy • Module C-Survey Questions- Dwellings, Drinking water supply, Sanitation, Food security, Asset related questions, farming and livestock questions, Anthropometry
  • 3. Important details underlying this questionnaire • No tracking of households- straightforward identification module • Limited set of household characteristics • Sample size exogenously fixed and simple random sampling proposed but effects across strata envisioned
  • 4. Some elements of robust IE–leading to RIMS+ •Choosing the Sample • Issue -what data needed and the sample required to precisely estimate differences in outcomes between the treatment group and the comparison group. • Determine both the sample size and how to draw the units in the sample from a population of interest. • What Kinds of Data Do I Need? • Need SMART data- specific, measurable, attributable, realistic, and targeted. • Good quality data are required to assess the impact of the intervention on the outcomes of interest. • The IE should not measure only outcomes for which the program is directly accountable. Outcome indicators indirectly affected or indicators capturing unintended program impact will maximize the value of the information that the IE generates.
  • 5. Primer on sample and nature of data • some indicators may not be amenable to IE in small samples. Detecting impacts for outcomes that are • extremely variable, • rare events, • or that are likely to be only marginally affected by an intervention may require prohibitively large samples. • Example Identifying the impact of an intervention on maternal mortality rates will be feasible only in a sample that contains many pregnant women. • Data on exogenous factors that may affect the outcome of interest. These make it possible to control for outside influences. • Data on other characteristics. Including additional controls or analyzing the heterogeneity of the program’s effects along certain characteristics
  • 6. Role of existing data • To set benchmark value of indicators • Power calculation for getting the minimum sample size • Here in IFAD projects- One might not need to do power calculations but get some sense of when you would need a larger sample vis-à-vis smaller sample
  • 7. How large the sample must be? • Associated calculations are called power calculations. • Avoid collecting too much data as well as too few data • Remember too few data - If the sample is too small, you may not be able to detect positive impact and may thus conclude that the program had no effect • Assume for simplicity that all who are intended to be beneficiary take part and those intended to be non-beneficiary remain so.
  • 8. Power calculations • Most impact evaluations test a simple hypothesis • Does the program have an impact? In other words, Is the program impact different from zero? Answering this question requires two steps: • 1. Estimate the average outcomes for the beneficiary and non- beneficiary groups. • 2. Assess whether a difference exists between the average outcome for the treatment beneficiary group and the average outcome for the non-beneficiary group.
  • 9. Large versus small sample (Large sample reduces the chance of being unlucky (Gertler et al 2010)
  • 10. Consider some related example • Take a nutrition program • Take anthropometric measure of the beneficiary and non-beneficiary • Take a sample of 2 children (beneficiary and non- beneficiary) and do it many times • The estimates from the different samples taken repeatedly will bounce a lot- implies the estimates are unreliable • Take a children sample of 100 and repeat it many times – • What do you see- which estimate bounces much more ? • It is the smaller sample
  • 11. Errors in Impact evaluation • Type 1 error and type 2 error • Type 1 error (conclude that average height in the beneficiary group is higher than in non-beneficiary group when in fact it is not) • Type 2 error (conclude that average height in the beneficiary group is no different than in non-beneficiary group when in fact it is actually different) • Likelihood of a type 1 error is called confidence level – usually set at 5 percent i.e. you would be 95 percent confident that program had an impact • Many factors affect the likelihood of committing a type 2 error but sample size is crucial- When a sample is large it is less likely that average height or weight of children in the two groups is equal just by luck
  • 12. Power calculation and errors in IE (Gertler et al 2010) • The statistical power of an impact evaluation is the probability that it will detect a difference between the beneficiary and non-beneficiary groups when in fact one exists. An impact evaluation has a high power if there is a low risk of not detecting real program impacts, that is, of committing a type II error. • Under high power unlikely to be disappointed by results showing that the program being evaluated has had no impact, when in reality it did have an impact. • From a policy perspective, underpowered impact evaluations are costly • If you were to conclude that the program was not effective, even though it was, policy makers would be likely to close down a good program. • Carrying out power calculations could be crucial and relevant.
  • 13. Steps in power calculations (Gertler et al 2010) • Does the program create clusters? • What is the outcome indicator? • Do you aim to compare program impacts between subgroups? • What is the minimum level of impact that would justify the investment that has been made in the intervention? • What is a reasonable level of power for the evaluation being conducted? • What are the baseline mean and variance of the outcome indicators?
  • 14. Power calculation: Continued • No clusters case- Intervention at the level at which impacts are observed (some treatments given at school level and outcomes observed at the student level would comprise clusters) • No clusters- take a random sample out of the entire population • Identify the most important indicators that you want to evaluate • If there are sub-groups (like SC/ST caste groups) then the sample size would be larger • Then the sample size of the effect to be determined would be lager • For an evaluation to identify a small impact, estimates of any difference in mean outcomes between the treatment and comparison groups will need to be very precise, requiring a large sample. • Population baseline mean and variance • Usual power of 80 percent is the norm • Many statistical software can do the power calculations once these parameters are known • Programs that create clusters have a different issue with power calculations
  • 15. Power calculations with clusters • Some programs assign benefits at the cluster level • Here the principle is for sample size- number of clusters matters more than number of households within clusters • A sufficient number of clusters is required to test convincingly whether a program has had an impact by comparing outcomes • Compared to the steps before just add how variable are the outcomes within clusters? • Think within village incomes are same but different across villages- adding an individual from another village would add more statistical power INTERNATIONAL FOOD POLICY RESEARCH INSTITUTE Example: test of difference of means in two populations • The equation for sample size is derived from the equation for the statistical test • In a t-test the equation for the test is t = (x1 - x2) - (m1 - m2) (s1 2 n+ s2 2 n)12 • The derived equation for sample size is n = (z1-/2 + z1-b)2(s1 2 + s2 2) (m1 - m2)2
  • 16. What next after sample size? • A Sampling Strategy • Steps in sampling • 1. Determine the population of interest (eg children under certain age). • 2. Identify a sampling frame- should coincide with population of interest but sometimes it does not. • 3. Draw as many units from the sampling frame as required by power calculations. • Choose the sampling method
  • 17. Sampling method • Probability sampling (PS) methods-most rigorous- they assign a well- defined probability of each unit’s being drawn. The 3 main PS methods are: • Random sampling. Every unit in the population has exactly the same probability of being drawn. • Stratified random sampling. The population is divided into groups (for example, male and female) and random sampling is performed within each group (essential for comparing impacts in sub groups) • Cluster sampling. Units are grouped in clusters, and a random sample of clusters is drawn, after which either all units in those clusters constitute the sample or a number of units within the cluster are randomly drawn. This means that each cluster has a well-defined probability of being selected, and units within a selected cluster also have a well-defined probability of being drawn.
  • 18. Sampling method and impact evaluation • Drawing a sample depends on the rules of eligibility in the program • Usually interventions take place at cluster level • Implies in these cases should go for cluster sampling • Under any cost avoid non-probabilistic like purposive or convenience sampling