SlideShare une entreprise Scribd logo
1  sur  33
Experimental evaluation in
education
Professor Carole Torgerson
School of Education, Durham University, United Kingdom
carole.torgerson@durham.ac.uk
International workshop Social Experiments and Innovation: a new
paradigm for public intervention?
Barcelona, September 26th 2013
Some landmark experimental evaluations
in education in the US and the developing
world
• Cambridge Somerville youth experiment (US)
– Early intervention to reduce ‘juvenile delinquency’
• Tennessee class-size experiment (US)
– Class-size reduction intervention in early years education
• PROGRESA experiment (Mexico)
– Rural anti-poverty intervention
• Balsakhi experiment (India)
– Teaching assistants intervention in literacy and numeracy
Why social experiments?
• Powerful tool for evaluating impacts in
education:
– Simple to understand
– Simple to analyse
– Ethical to use random allocation in the presence
of resource scarcity
– Scientifically the most rigorous evaluative
approach
– Deal with known and unknown confounding
Experimental evaluation in education in
an age of austerity
• Educational interventions that do not work waste
money or worse
• Educational budgets across Europe under strain
due to poor economic conditions
• Imperative that the most effective and cost-
effective interventions are adopted by policy
makers
• Randomised experiments offer the best evidence
for efficacy, effectiveness and cost-effectiveness
Some examples of expensive educational
interventions that don’t work (UK)
• Financial incentives (Brooks et al, 2008)
• ICT and spelling (Brooks et al, 2006)
• Nurse numeracy intervention (Ainsworth et al,
2011)
Example of educational intervention that
does work: Every Child Counts
• Every Child Counts (ECC) was previous UK
government’s flagship policy to help children (age 7)
at risk in numeracy
• Expensive one-to-one tutoring intervention
(Numbers Count) delivered each day over one school
term (12 weeks)
• Randomised experiment commissioned to establish
effectiveness and cost-effectiveness (Torgerson et al,
2013a; Torgerson et al, 2013b)
• Pre-post test evaluation (undertaken by developer)
demonstrated large effect size (>1 SD) and cost-
effectiveness (using weak design)
The ECC evaluation design
• Three linked randomised experiments:
– ECC (Numbers Count) vs. ‘business as usual’ (Trial 1):
44 schools
– ECC (Numbers Count) pairs vs. ECC one-to-one (Trial
2): 15 schools
– ECC triplets (Numbers Count) vs. ECC one-to-one (Trial
3): 7 schools
• Process evaluation:
– Random sample of schools
– Implementation and delivery
Design of Trial 1
• 12 children in 44 schools eligible for ‘Numbers Count’ intervention
• Numeracy test (Sandwell test) (pre-test) at beginning of autumn term
(administered by teachers)
• Random allocation of 12 children to term of delivery: autumn, spring or
summer: ‘waiting list’ design
• Intervention group: autumn children
• Control group: spring and summer children
• Numeracy test (Progress in Maths test) after 12 weeks (administered by
independent testers) (post-test)
• Simple analysis: compare the mean numeracy post-test score of
intervention children with mean numeracy score of control children and
conclude whether ‘Numbers Count’ is more effective than
• ‘business as usual’
• Rigorous design: excludes some alternative explanations for results
Design features that minimise alternative
explanations for results
• Large sample size: excludes chance finding
• Randomisation: intervention and control groups are
equivalent at start so design controls for history,
maturation, regression to the mean, selection bias
• Intervention and control conditions are both
numeracy interventions and both last for 30 minutes
each day for 12 weeks: the comparison is a ‘fair’ one
• Independent ‘blinded’ testing: eliminates possibility
of tester bias
Results
Intervention
Group
Control
Group
PIM 6 (0-30) 15.8
N = 144
14.0
N = 440
Results
Results
Intervention
Group
Control
Group
Effect Size
95% Confidence
Interval
PIM 6 (0-30) 15.8 (4.9)
N = 144
14.0 (4.5)
N = 440
0.33
(0.12 to 0.53)
Results
• ECC better than business as usual (0.33 SD)
but expensive
• No evidence that one-to-one was better than
ECC in pairs or triplets
• One-to-one not cost effective – more cost-
effective to deliver in pairs or triplets
Design limitations: Generalisability
• ECC schools were identified:
by policy-makers/funders of
programme - education
policy ‘roll out’ in England,
i.e., schools in
disadvantaged areas
• Ideally, a random sample of
all secondary schools in
England should have been
approached and asked to
take part
Design limitations: Intervention
• One-to-one teaching with
intervention children being
withdrawn from classroom
• Problem of attribution: was
effect due to Numbers
Count intervention? Or to
one-to-one teaching? Or to
withdrawal from classroom?
• Design could have included
additional one-to-one arm
Design limitations: ‘Contamination’/’spill
over’ effects
• Children withdrawn from
usual classroom teaching –
may have benefited
remaining children
• Teachers using intervention
have applied it to some
control children.
• Instead of randomising
individual children, design
could have randomised by
school (cluster
randomisation, where
school is the cluster) to
avoid these problems.
Design limitations: Long term effects
• Wait list design prevented
long term follow-up; effects
may have ‘washed out’
soon after intervention was
finished.
• Could have used cluster
randomisation;
• Could have recruited
children above threshold
and randomised these to
intervention or long term
follow-up;
• All options (above) rejected
by funder.
Conclusions
• Design and conduct warranted conclusion Numbers Count (as
delivered) more effective than usual classroom teaching BUT
because of design limitations couldn’t answer some really
important questions
• These questions could have been answered if a different
experimental design had been used: cluster randomisation
(randomisation of schools), long-term follow-up (control
group that didn’t receive intervention); one to one control
group (literacy or other numeracy)
3 EEF ‘transitions’ projects
• Background:
– Interventions aimed at children from disadvantaged
backgrounds and those struggling to reach national key stage
writing standards.
• Primary outcome measure:
– combined score on the 2 writing tasks within the Progress in
English test (GL assessment)
• Secondary outcomes:
– scores on the reading, spelling and grammar components of the
Progress in English test (GL assessment)
DISCOVER
• Research question:
– What is the effectiveness of the Discover summer
writing workshop intervention compared with a
‘business as usual’ control group on the writing
abilities of participating children?
• Individually randomised experiment
Discover: Trial Design Diagram
Improving Writing Quality Intervention:
Calderdale
• Research question:
– What is the effectiveness of the Improving Writing
Quality programme compared with ‘business as
usual’ on the writing skills of participating
children?
• Pragmatic cluster randomised design
Calderdale: Trial Design Diagram
Exeter Grammar for Writing
Intervention
• Research questions:
1. What is the effectiveness of the whole class Grammar for writing intervention
compared with a ‘business as usual’ control group on writing skills of participating
children?
2. What is the effectiveness of the whole class Grammar for writing intervention plus
additional small group intervention compared with a ‘business as usual’ control group
on writing skills of participating children?
3. What is the effectiveness of the whole class Grammar for writing intervention plus
additional small group intervention compared with the whole class Grammar for
writing intervention only on writing skills of participating children?
• Partial split plot design
Exeter: Trial Design Diagram
Exeter: Trial Design Diagram
Exeter: Trial Design Diagram
Answers research question 1:
effectiveness of whole class
intervention compared with “business
as usual” on writing skills?
Exeter: Trial Design Diagram
Exeter: Trial Design Diagram
Answers research question 2:
Effectiveness of whole class
intervention plus additional small
group intervention compared with
“business as usual” control group
on writing skills?
Exeter: Trial Design Diagram
Exeter: Trial Design Diagram
Answers research question 3:
effectiveness of whole class
intervention plus additional small
group intervention compared with
whole class intervention only on
writing skills?
Some challenges in promoting and
undertaking experimental evaluations in
education
• Resistance from within research community
• Lack of political will
• Lack of funding opportunities for individual
experiments and for capacity building
• Lack of capacity (experience and expertise) to
undertake rigorous experiments
• Potential for conflict of interest (Developer of
intervention)
• Recruitment and retention
References
• Ainsworth, H., Torgerson, D., Torgerson, C. et al (2011) Computer-based instruction
for improving student nurses’ general numeracy: Is it effective? Two RCTs,
Educational Studies
• Brooks, G., Burton, M., Coles, P., Miles, J., Torgerson, C., Torgerson, D. (2008)
Randomised controlled trial of incentives to improve attendance at adult literacy
classes, Oxford Review of Education, 34(4)
• Brooks, G., Miles, J.N.V., Torgerson, C.J. and Torgerson, D.J. (2006) Is an
intervention using computer software effective in literacy learning? A randomised
controlled trial, Educational Studies, 32(1)
• Torgerson, C.J.,Wiggins, A., Torgerson, D.J., Ainsworth, H., Hewitt, C. Every Child
Counts: Testing policy effectiveness using a randomized controlled trial, designed,
conducted and reported to CONSORT standards, Journal of Research in
Mathematics Education, July 2013
• Torgerson, C.J., Wiggins, A., Torgerson, D.J., Ainsworth, H., Hewitt, C. The
effectiveness of an intensive individual tutoring programme (Numbers Count)
delivered individually or to small groups of children: A randomised controlled trial,
Effective Education, Apr., 2013

Contenu connexe

Tendances

2017 Universal Education District Contacts Meeting
2017 Universal Education District Contacts Meeting2017 Universal Education District Contacts Meeting
2017 Universal Education District Contacts Meetingvthorvthor
 
Format for final defense
Format for final defenseFormat for final defense
Format for final defenseeyeshield8654
 
Item analysis
Item analysisItem analysis
Item analysisaneez103
 
Handout 2: SSE case study: The SSE journey for our case study school
Handout 2: SSE case study: The SSE journey for our case study schoolHandout 2: SSE case study: The SSE journey for our case study school
Handout 2: SSE case study: The SSE journey for our case study schoolMartin Brown
 
Connecting evidence based instructional practices to rti
Connecting evidence based instructional practices to rtiConnecting evidence based instructional practices to rti
Connecting evidence based instructional practices to rtiEast Central ISD
 
Pp sse booklet website
Pp sse booklet websitePp sse booklet website
Pp sse booklet websiteMartin Brown
 
MET_Gathering_Feedback_Practioner_Brief
MET_Gathering_Feedback_Practioner_BriefMET_Gathering_Feedback_Practioner_Brief
MET_Gathering_Feedback_Practioner_BriefPaul Fleischman
 
ITEM ANALYSIS
ITEM ANALYSIS ITEM ANALYSIS
ITEM ANALYSIS Waqas Khan
 
The nittygritty of language testing
The nittygritty of language testingThe nittygritty of language testing
The nittygritty of language testingMarzs
 
تصميم عمليات تقويم المناهج
تصميم عمليات تقويم المناهجتصميم عمليات تقويم المناهج
تصميم عمليات تقويم المناهجIEFE
 
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...William Kritsonis
 
TESTA Interactive Masterclass
TESTA Interactive MasterclassTESTA Interactive Masterclass
TESTA Interactive MasterclassTansy Jessop
 
Handout 3 SSE case study school (self-evaluation report: literacy)
Handout 3 SSE case study school (self-evaluation report: literacy)Handout 3 SSE case study school (self-evaluation report: literacy)
Handout 3 SSE case study school (self-evaluation report: literacy)Martin Brown
 
CypherWorx OST Effiacy Study Results 2015
CypherWorx OST Effiacy Study Results 2015CypherWorx OST Effiacy Study Results 2015
CypherWorx OST Effiacy Study Results 2015Steve Stookey
 

Tendances (20)

Group1 Assessment
Group1 AssessmentGroup1 Assessment
Group1 Assessment
 
2017 Universal Education District Contacts Meeting
2017 Universal Education District Contacts Meeting2017 Universal Education District Contacts Meeting
2017 Universal Education District Contacts Meeting
 
Format for final defense
Format for final defenseFormat for final defense
Format for final defense
 
Item analysis
Item analysisItem analysis
Item analysis
 
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
The Use of Online Quizzes for Continuous Assessment and Self-Assessment of Se...
 
Handout 2: SSE case study: The SSE journey for our case study school
Handout 2: SSE case study: The SSE journey for our case study schoolHandout 2: SSE case study: The SSE journey for our case study school
Handout 2: SSE case study: The SSE journey for our case study school
 
3.7 Evaluating impacts of VLE use
3.7 Evaluating impacts of VLE use3.7 Evaluating impacts of VLE use
3.7 Evaluating impacts of VLE use
 
Item analysis
Item analysisItem analysis
Item analysis
 
Connecting evidence based instructional practices to rti
Connecting evidence based instructional practices to rtiConnecting evidence based instructional practices to rti
Connecting evidence based instructional practices to rti
 
Pp sse booklet website
Pp sse booklet websitePp sse booklet website
Pp sse booklet website
 
MET_Gathering_Feedback_Practioner_Brief
MET_Gathering_Feedback_Practioner_BriefMET_Gathering_Feedback_Practioner_Brief
MET_Gathering_Feedback_Practioner_Brief
 
Test item analysis
Test item analysisTest item analysis
Test item analysis
 
ITEM ANALYSIS
ITEM ANALYSIS ITEM ANALYSIS
ITEM ANALYSIS
 
The nittygritty of language testing
The nittygritty of language testingThe nittygritty of language testing
The nittygritty of language testing
 
تصميم عمليات تقويم المناهج
تصميم عمليات تقويم المناهجتصميم عمليات تقويم المناهج
تصميم عمليات تقويم المناهج
 
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...
Dr. Teresa Ann Hughes, PhD Dissertation Defense, Dr. William Allan Kritsonis,...
 
Sreb March 2010 5
Sreb March 2010 5Sreb March 2010 5
Sreb March 2010 5
 
TESTA Interactive Masterclass
TESTA Interactive MasterclassTESTA Interactive Masterclass
TESTA Interactive Masterclass
 
Handout 3 SSE case study school (self-evaluation report: literacy)
Handout 3 SSE case study school (self-evaluation report: literacy)Handout 3 SSE case study school (self-evaluation report: literacy)
Handout 3 SSE case study school (self-evaluation report: literacy)
 
CypherWorx OST Effiacy Study Results 2015
CypherWorx OST Effiacy Study Results 2015CypherWorx OST Effiacy Study Results 2015
CypherWorx OST Effiacy Study Results 2015
 

En vedette

Auditoria de Comunicacion - 1426NA
Auditoria de Comunicacion - 1426NAAuditoria de Comunicacion - 1426NA
Auditoria de Comunicacion - 1426NA1426NA
 
Social Media Marketing Plan methodology
Social Media Marketing Plan methodologySocial Media Marketing Plan methodology
Social Media Marketing Plan methodologyDemand Metric
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
 
Hype vs. Reality: The AI Explainer
Hype vs. Reality: The AI ExplainerHype vs. Reality: The AI Explainer
Hype vs. Reality: The AI ExplainerLuminary Labs
 

En vedette (6)

Pasos para analizar hoy tu sitio web (2)
Pasos para analizar hoy tu sitio web (2)Pasos para analizar hoy tu sitio web (2)
Pasos para analizar hoy tu sitio web (2)
 
Auditoria de Comunicacion - 1426NA
Auditoria de Comunicacion - 1426NAAuditoria de Comunicacion - 1426NA
Auditoria de Comunicacion - 1426NA
 
Social Media Marketing Plan methodology
Social Media Marketing Plan methodologySocial Media Marketing Plan methodology
Social Media Marketing Plan methodology
 
Ejemplos de auditoria. cedulas
Ejemplos de auditoria. cedulasEjemplos de auditoria. cedulas
Ejemplos de auditoria. cedulas
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 
Hype vs. Reality: The AI Explainer
Hype vs. Reality: The AI ExplainerHype vs. Reality: The AI Explainer
Hype vs. Reality: The AI Explainer
 

Similaire à Experimental evaluation in education

Inspiring change in assessment and feedback
Inspiring change in assessment and feedbackInspiring change in assessment and feedback
Inspiring change in assessment and feedbackTansy Jessop
 
Peer Teaching PPT 02-02-23.ppt
Peer Teaching PPT 02-02-23.pptPeer Teaching PPT 02-02-23.ppt
Peer Teaching PPT 02-02-23.pptMuxITLinks
 
Learning-oriented assessment in an era of high-stakes and insecure testing
Learning-oriented assessment in an era of high-stakes and insecure testingLearning-oriented assessment in an era of high-stakes and insecure testing
Learning-oriented assessment in an era of high-stakes and insecure testingMark Carver
 
Improving student learning through programme assessment
Improving student learning through programme assessmentImproving student learning through programme assessment
Improving student learning through programme assessmentTansy Jessop
 
QUANTI-LESSON-1.pptx
QUANTI-LESSON-1.pptxQUANTI-LESSON-1.pptx
QUANTI-LESSON-1.pptxAldineRaytan
 
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
 TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan... TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...TESTA winch
 
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...Amber D. Marcu, Ph.D.
 
Learner Engagement, QM Standards and the Blended Course
Learner Engagement, QM Standards and the Blended CourseLearner Engagement, QM Standards and the Blended Course
Learner Engagement, QM Standards and the Blended CourseLakeland Community College
 
Stephen Phillips - Implementing an inquiry based approach into our school
Stephen Phillips - Implementing an inquiry based approach into our schoolStephen Phillips - Implementing an inquiry based approach into our school
Stephen Phillips - Implementing an inquiry based approach into our schoolSails-project
 
Louise Naylor
Louise NaylorLouise Naylor
Louise NaylorSEDA
 
TESTA to FASTECH Presentation
TESTA to FASTECH PresentationTESTA to FASTECH Presentation
TESTA to FASTECH PresentationTansy_Jessop
 
Developing assessment patterns that work through TESTA
Developing assessment patterns that work through TESTADeveloping assessment patterns that work through TESTA
Developing assessment patterns that work through TESTATansy Jessop
 
Instructional leardership chpt.8
Instructional leardership chpt.8Instructional leardership chpt.8
Instructional leardership chpt.8Michael Anthony
 
Dodson kelly symposium
Dodson kelly symposiumDodson kelly symposium
Dodson kelly symposiumKelly Dodson
 
Sse workshop 2 spring 2014
Sse workshop 2 spring 2014Sse workshop 2 spring 2014
Sse workshop 2 spring 2014Martin Brown
 
Squeezing assessment and stretching learning
Squeezing assessment and stretching learningSqueezing assessment and stretching learning
Squeezing assessment and stretching learningTansy Jessop
 
TESTA Summit, Woburn House London (September 2013)
TESTA Summit, Woburn House London (September 2013)TESTA Summit, Woburn House London (September 2013)
TESTA Summit, Woburn House London (September 2013)TESTA winch
 
Building Assessment Literacy with teachers and students
Building Assessment Literacy with teachers and studentsBuilding Assessment Literacy with teachers and students
Building Assessment Literacy with teachers and studentsahmadnaimullah1
 
Assessment in Education: Principles, Policy & Practice
Assessment in Education: Principles, Policy & PracticeAssessment in Education: Principles, Policy & Practice
Assessment in Education: Principles, Policy & Practicejaoverla
 

Similaire à Experimental evaluation in education (20)

Inspiring change in assessment and feedback
Inspiring change in assessment and feedbackInspiring change in assessment and feedback
Inspiring change in assessment and feedback
 
Peer Teaching PPT 02-02-23.ppt
Peer Teaching PPT 02-02-23.pptPeer Teaching PPT 02-02-23.ppt
Peer Teaching PPT 02-02-23.ppt
 
Learning-oriented assessment in an era of high-stakes and insecure testing
Learning-oriented assessment in an era of high-stakes and insecure testingLearning-oriented assessment in an era of high-stakes and insecure testing
Learning-oriented assessment in an era of high-stakes and insecure testing
 
Improving student learning through programme assessment
Improving student learning through programme assessmentImproving student learning through programme assessment
Improving student learning through programme assessment
 
QUANTI-LESSON-1.pptx
QUANTI-LESSON-1.pptxQUANTI-LESSON-1.pptx
QUANTI-LESSON-1.pptx
 
lec #1.pptx
lec #1.pptxlec #1.pptx
lec #1.pptx
 
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
 TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan... TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
TESTA, Presentation to the SDG Course Leaders, University of West of Scotlan...
 
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...
Relationship of Self-Efficacy to Stages of Concerns in the Adoption of Innova...
 
Learner Engagement, QM Standards and the Blended Course
Learner Engagement, QM Standards and the Blended CourseLearner Engagement, QM Standards and the Blended Course
Learner Engagement, QM Standards and the Blended Course
 
Stephen Phillips - Implementing an inquiry based approach into our school
Stephen Phillips - Implementing an inquiry based approach into our schoolStephen Phillips - Implementing an inquiry based approach into our school
Stephen Phillips - Implementing an inquiry based approach into our school
 
Louise Naylor
Louise NaylorLouise Naylor
Louise Naylor
 
TESTA to FASTECH Presentation
TESTA to FASTECH PresentationTESTA to FASTECH Presentation
TESTA to FASTECH Presentation
 
Developing assessment patterns that work through TESTA
Developing assessment patterns that work through TESTADeveloping assessment patterns that work through TESTA
Developing assessment patterns that work through TESTA
 
Instructional leardership chpt.8
Instructional leardership chpt.8Instructional leardership chpt.8
Instructional leardership chpt.8
 
Dodson kelly symposium
Dodson kelly symposiumDodson kelly symposium
Dodson kelly symposium
 
Sse workshop 2 spring 2014
Sse workshop 2 spring 2014Sse workshop 2 spring 2014
Sse workshop 2 spring 2014
 
Squeezing assessment and stretching learning
Squeezing assessment and stretching learningSqueezing assessment and stretching learning
Squeezing assessment and stretching learning
 
TESTA Summit, Woburn House London (September 2013)
TESTA Summit, Woburn House London (September 2013)TESTA Summit, Woburn House London (September 2013)
TESTA Summit, Woburn House London (September 2013)
 
Building Assessment Literacy with teachers and students
Building Assessment Literacy with teachers and studentsBuilding Assessment Literacy with teachers and students
Building Assessment Literacy with teachers and students
 
Assessment in Education: Principles, Policy & Practice
Assessment in Education: Principles, Policy & PracticeAssessment in Education: Principles, Policy & Practice
Assessment in Education: Principles, Policy & Practice
 

Dernier

ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...Nguyen Thanh Tu Collection
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...JojoEDelaCruz
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptxmary850239
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 

Dernier (20)

ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
HỌC TỐT TIẾNG ANH 11 THEO CHƯƠNG TRÌNH GLOBAL SUCCESS ĐÁP ÁN CHI TIẾT - CẢ NĂ...
 
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
ENG 5 Q4 WEEk 1 DAY 1 Restate sentences heard in one’s own words. Use appropr...
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx4.18.24 Movement Legacies, Reflection, and Review.pptx
4.18.24 Movement Legacies, Reflection, and Review.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 

Experimental evaluation in education

  • 1. Experimental evaluation in education Professor Carole Torgerson School of Education, Durham University, United Kingdom carole.torgerson@durham.ac.uk International workshop Social Experiments and Innovation: a new paradigm for public intervention? Barcelona, September 26th 2013
  • 2. Some landmark experimental evaluations in education in the US and the developing world • Cambridge Somerville youth experiment (US) – Early intervention to reduce ‘juvenile delinquency’ • Tennessee class-size experiment (US) – Class-size reduction intervention in early years education • PROGRESA experiment (Mexico) – Rural anti-poverty intervention • Balsakhi experiment (India) – Teaching assistants intervention in literacy and numeracy
  • 3. Why social experiments? • Powerful tool for evaluating impacts in education: – Simple to understand – Simple to analyse – Ethical to use random allocation in the presence of resource scarcity – Scientifically the most rigorous evaluative approach – Deal with known and unknown confounding
  • 4. Experimental evaluation in education in an age of austerity • Educational interventions that do not work waste money or worse • Educational budgets across Europe under strain due to poor economic conditions • Imperative that the most effective and cost- effective interventions are adopted by policy makers • Randomised experiments offer the best evidence for efficacy, effectiveness and cost-effectiveness
  • 5. Some examples of expensive educational interventions that don’t work (UK) • Financial incentives (Brooks et al, 2008) • ICT and spelling (Brooks et al, 2006) • Nurse numeracy intervention (Ainsworth et al, 2011)
  • 6. Example of educational intervention that does work: Every Child Counts • Every Child Counts (ECC) was previous UK government’s flagship policy to help children (age 7) at risk in numeracy • Expensive one-to-one tutoring intervention (Numbers Count) delivered each day over one school term (12 weeks) • Randomised experiment commissioned to establish effectiveness and cost-effectiveness (Torgerson et al, 2013a; Torgerson et al, 2013b) • Pre-post test evaluation (undertaken by developer) demonstrated large effect size (>1 SD) and cost- effectiveness (using weak design)
  • 7. The ECC evaluation design • Three linked randomised experiments: – ECC (Numbers Count) vs. ‘business as usual’ (Trial 1): 44 schools – ECC (Numbers Count) pairs vs. ECC one-to-one (Trial 2): 15 schools – ECC triplets (Numbers Count) vs. ECC one-to-one (Trial 3): 7 schools • Process evaluation: – Random sample of schools – Implementation and delivery
  • 8. Design of Trial 1 • 12 children in 44 schools eligible for ‘Numbers Count’ intervention • Numeracy test (Sandwell test) (pre-test) at beginning of autumn term (administered by teachers) • Random allocation of 12 children to term of delivery: autumn, spring or summer: ‘waiting list’ design • Intervention group: autumn children • Control group: spring and summer children • Numeracy test (Progress in Maths test) after 12 weeks (administered by independent testers) (post-test) • Simple analysis: compare the mean numeracy post-test score of intervention children with mean numeracy score of control children and conclude whether ‘Numbers Count’ is more effective than • ‘business as usual’ • Rigorous design: excludes some alternative explanations for results
  • 9. Design features that minimise alternative explanations for results • Large sample size: excludes chance finding • Randomisation: intervention and control groups are equivalent at start so design controls for history, maturation, regression to the mean, selection bias • Intervention and control conditions are both numeracy interventions and both last for 30 minutes each day for 12 weeks: the comparison is a ‘fair’ one • Independent ‘blinded’ testing: eliminates possibility of tester bias
  • 12. Results Intervention Group Control Group Effect Size 95% Confidence Interval PIM 6 (0-30) 15.8 (4.9) N = 144 14.0 (4.5) N = 440 0.33 (0.12 to 0.53)
  • 13. Results • ECC better than business as usual (0.33 SD) but expensive • No evidence that one-to-one was better than ECC in pairs or triplets • One-to-one not cost effective – more cost- effective to deliver in pairs or triplets
  • 14. Design limitations: Generalisability • ECC schools were identified: by policy-makers/funders of programme - education policy ‘roll out’ in England, i.e., schools in disadvantaged areas • Ideally, a random sample of all secondary schools in England should have been approached and asked to take part
  • 15. Design limitations: Intervention • One-to-one teaching with intervention children being withdrawn from classroom • Problem of attribution: was effect due to Numbers Count intervention? Or to one-to-one teaching? Or to withdrawal from classroom? • Design could have included additional one-to-one arm
  • 16. Design limitations: ‘Contamination’/’spill over’ effects • Children withdrawn from usual classroom teaching – may have benefited remaining children • Teachers using intervention have applied it to some control children. • Instead of randomising individual children, design could have randomised by school (cluster randomisation, where school is the cluster) to avoid these problems.
  • 17. Design limitations: Long term effects • Wait list design prevented long term follow-up; effects may have ‘washed out’ soon after intervention was finished. • Could have used cluster randomisation; • Could have recruited children above threshold and randomised these to intervention or long term follow-up; • All options (above) rejected by funder.
  • 18. Conclusions • Design and conduct warranted conclusion Numbers Count (as delivered) more effective than usual classroom teaching BUT because of design limitations couldn’t answer some really important questions • These questions could have been answered if a different experimental design had been used: cluster randomisation (randomisation of schools), long-term follow-up (control group that didn’t receive intervention); one to one control group (literacy or other numeracy)
  • 19. 3 EEF ‘transitions’ projects • Background: – Interventions aimed at children from disadvantaged backgrounds and those struggling to reach national key stage writing standards. • Primary outcome measure: – combined score on the 2 writing tasks within the Progress in English test (GL assessment) • Secondary outcomes: – scores on the reading, spelling and grammar components of the Progress in English test (GL assessment)
  • 20. DISCOVER • Research question: – What is the effectiveness of the Discover summer writing workshop intervention compared with a ‘business as usual’ control group on the writing abilities of participating children? • Individually randomised experiment
  • 22. Improving Writing Quality Intervention: Calderdale • Research question: – What is the effectiveness of the Improving Writing Quality programme compared with ‘business as usual’ on the writing skills of participating children? • Pragmatic cluster randomised design
  • 24. Exeter Grammar for Writing Intervention • Research questions: 1. What is the effectiveness of the whole class Grammar for writing intervention compared with a ‘business as usual’ control group on writing skills of participating children? 2. What is the effectiveness of the whole class Grammar for writing intervention plus additional small group intervention compared with a ‘business as usual’ control group on writing skills of participating children? 3. What is the effectiveness of the whole class Grammar for writing intervention plus additional small group intervention compared with the whole class Grammar for writing intervention only on writing skills of participating children? • Partial split plot design
  • 27. Exeter: Trial Design Diagram Answers research question 1: effectiveness of whole class intervention compared with “business as usual” on writing skills?
  • 29. Exeter: Trial Design Diagram Answers research question 2: Effectiveness of whole class intervention plus additional small group intervention compared with “business as usual” control group on writing skills?
  • 31. Exeter: Trial Design Diagram Answers research question 3: effectiveness of whole class intervention plus additional small group intervention compared with whole class intervention only on writing skills?
  • 32. Some challenges in promoting and undertaking experimental evaluations in education • Resistance from within research community • Lack of political will • Lack of funding opportunities for individual experiments and for capacity building • Lack of capacity (experience and expertise) to undertake rigorous experiments • Potential for conflict of interest (Developer of intervention) • Recruitment and retention
  • 33. References • Ainsworth, H., Torgerson, D., Torgerson, C. et al (2011) Computer-based instruction for improving student nurses’ general numeracy: Is it effective? Two RCTs, Educational Studies • Brooks, G., Burton, M., Coles, P., Miles, J., Torgerson, C., Torgerson, D. (2008) Randomised controlled trial of incentives to improve attendance at adult literacy classes, Oxford Review of Education, 34(4) • Brooks, G., Miles, J.N.V., Torgerson, C.J. and Torgerson, D.J. (2006) Is an intervention using computer software effective in literacy learning? A randomised controlled trial, Educational Studies, 32(1) • Torgerson, C.J.,Wiggins, A., Torgerson, D.J., Ainsworth, H., Hewitt, C. Every Child Counts: Testing policy effectiveness using a randomized controlled trial, designed, conducted and reported to CONSORT standards, Journal of Research in Mathematics Education, July 2013 • Torgerson, C.J., Wiggins, A., Torgerson, D.J., Ainsworth, H., Hewitt, C. The effectiveness of an intensive individual tutoring programme (Numbers Count) delivered individually or to small groups of children: A randomised controlled trial, Effective Education, Apr., 2013

Notes de l'éditeur

  1. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  2. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  3. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  4. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  5. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  6. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.
  7. Cluster Randomised Aspect of TrialComparison: Group A and B v Group C and D and E – Answer Research Question 1.Comparison: Group B v Group E – Answer Research question 2Individually Randomised Aspect of TrialComparison: Group D v Group E – Answer research question 3.