SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
1 Het begint met een idee
Experiment validity
Ivano Malavolta
Vrije Universiteit Amsterdam
2 Ivano Malavolta / S2 group / Empirical software engineering
Planning phases
Scope of this
lecture
Vrije Universiteit Amsterdam
3 Ivano Malavolta / S2 group / Green Lab
Experiment validity
● We aim for adequate validity, not universal validity
○ What matters is our population of interest
Validity is the extent to which our results are sound and
applicable to the real world
● Validity is in trade-off with experiment scope
Vrije Universiteit Amsterdam
Threats Identification
4
● Identifying threats helps to plan for adequate validity
● Each threat needs appropriate mitigation
● Several classifications of validity threats:
○ Campbell and Stanley [1]
○ Cook and Campbell [2]
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
5
Types of threat to validity
Theory
Observation
Cause EffectCausation
e.g. encoding algorithms e.g. Energy efficiency
Treatment Experiment Outcome
e.g. JPEG e.g. energy per image
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
Causation
Experiment
6
Types of threat to validity
Theory
Observation
Cause Effect
Treatment Outcome
Construct
Internal
Conclusion
Construct
External
Ivano Malavolta / S2 group / Green Lab
e.g. encoding algorithms e.g. Energy efficiency
e.g. JPEG e.g. energy per image
Vrije Universiteit Amsterdam
7
Internal validity
Internal Validity: causality between treatment and outcome
● Strongly related to the experiment design and operation
○ Are my results caused by the treatment?
○ Have I considered all possible factors?
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
8
Internal validity: types of threat
● History
○ Different trials of the experiment performed in different time frames (eg,
after holidays vs normal days)
● Maturation
○ Subjects may react differently over time (eg, learning effect, tiresome,
boredome)
● Selection
○ Some subjects may abandon the experiment
○ Event worse, some specific type of subjects may leave it
● Direction of causal influence
○ A causes B, B causes A, or X causes A and B?
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
9
Internal validity: mitigation
Analyze and identify confounding factors/noise
Choose appropriate experiment design
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
Conclusion Validity: statistical correctness and significance
● Are my conclusions correct?
● Are my results significant enough?
10
Conclusion validity
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
11
Conclusion validity: types of threat
● Low statistical power
○ Results not statistically significant
○ There is a significant difference but the statistical test does not reveal it
due to the low number of data points
● Violated assumptions of statistical tests
○ eg, many tests assume normally distributed samples
● Fishing and error rate
○ If you are combining multiple statistical tests, also their significance
should be adapted
● Reliability of measures
○ If you repeat the measurement you should get similar results → same
conclusions
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
12
Conclusion validity: mitigation
Select appropriate tests
Use only as much significance as needed
Keep environment under control
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
13
Construct validity
● Have I defined my constructs properly?
● Am I analyzing the correct variables for the effects?
Construct Validity: relation between theory and observation
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
14
Construct validity: types of threat
● Inadequate preoperational explication of constructs
○ construct not well defined before being translated into measures
○ Theory unclear
○ Comparing two methods, but not clear what does mean that a method is
better than another
● Mono-operation bias
○ I have one independent variable only, one single object or treatment
→ the experiment could not represent the theory
○ eg, inspection conducted on a single document not representative of the
set of documents on which the technique is often applied
● Mono-method bias
○ When you use a single type of measures or observations
○ The experimenter may bias the measures
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
15
Construct validity: mitigation
Early definition of constructs (GQM)
Use appropriate experiment design
Introduce redundancy for cross-checks
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
16
External validity
● Are my results valid for the whole target population?
● Have I selected a representative sample?
External Validity: generalizability of the results
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
17
External validity: types of threat
● Interaction of selection and treatment
○ the population of subjects is not representative of the one for which I
would like to generalize my results
○ eg, performing experiments with students to use results in industry
● Interaction of setting and treatment
○ the experimental setting or the material are not representative
○ e.g. I let the subjects using tools that they don’t use in the reality
○ e.g. Web development using textual editors
○ Use of toy objects
● Interaction of history and treatment
○ the experiment is conducted on a special time or day which affects the
results
○ eg, our experiment on green software is performed after a big congress at
which some subjects participated
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
18
External validity: mitigation
Use an environment as realistic as possible
Explicitly define and model your context
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
● You know that you have to explicitly take into account the
threats to validity of your experiment
● Discussing threats actually makes your experiment stronger
○ you are not showing your weaknesses, but you are playing for
replicability
● You will make tradeoffs between threats to validity in your
experiment
● Consider threats to validity from the beginning
○ Reasoning on them will make you feel more confident about the scope
and design of your experiment
19
What this lecture means to you?
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
20 Ivano Malavolta / S2 group / Empirical software engineering
Readings
Chapter 8
[1] Campbell and Stanley, Experimental and Quasi- Experimental designs for Research (1963).
(Blackboard)
[2] Cook and Campbell, Quasi-experimentation - Design and Analysis Issues for Field Settings
(1979). Available at the VU library.
Ivano Malavolta / S2 group / Green Lab
Vrije Universiteit Amsterdam
21 Ivano Malavolta / S2 group / Empirical software engineering
Some contents of lecture extracted from:
● Giuseppe Procaccianti’s lectures at VU
Acknowledgements

Contenu connexe

Tendances

The Green Lab - [07-A] Data Analysis
The Green Lab - [07-A] Data AnalysisThe Green Lab - [07-A] Data Analysis
The Green Lab - [07-A] Data AnalysisGiuseppe Procaccianti
 
[02-B] Experiment scoping
[02-B] Experiment scoping[02-B] Experiment scoping
[02-B] Experiment scopingIvano Malavolta
 
[13 - B] Experiment reporting
[13 - B] Experiment reporting[13 - B] Experiment reporting
[13 - B] Experiment reportingIvano Malavolta
 
The Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationThe Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationGiuseppe Procaccianti
 
[05-A] Experiment design (basics)
[05-A] Experiment design (basics)[05-A] Experiment design (basics)
[05-A] Experiment design (basics)Ivano Malavolta
 
The Green Lab - [01 C] Empirical software engineering
The Green Lab - [01 C] Empirical software engineeringThe Green Lab - [01 C] Empirical software engineering
The Green Lab - [01 C] Empirical software engineeringIvano Malavolta
 
The Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingThe Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingIvano Malavolta
 
The Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsThe Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsGiuseppe Procaccianti
 
The Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RThe Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RGiuseppe Procaccianti
 
The Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesThe Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesIvano Malavolta
 
[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect sizeIvano Malavolta
 
[03-B] Measurement theory basics
[03-B] Measurement theory basics[03-B] Measurement theory basics
[03-B] Measurement theory basicsIvano Malavolta
 
[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testingIvano Malavolta
 
Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Ivano Malavolta
 
Odin2018_Minh_ML_Risk_Prediction
Odin2018_Minh_ML_Risk_PredictionOdin2018_Minh_ML_Risk_Prediction
Odin2018_Minh_ML_Risk_PredictionMinh Nguyen
 

Tendances (15)

The Green Lab - [07-A] Data Analysis
The Green Lab - [07-A] Data AnalysisThe Green Lab - [07-A] Data Analysis
The Green Lab - [07-A] Data Analysis
 
[02-B] Experiment scoping
[02-B] Experiment scoping[02-B] Experiment scoping
[02-B] Experiment scoping
 
[13 - B] Experiment reporting
[13 - B] Experiment reporting[13 - B] Experiment reporting
[13 - B] Experiment reporting
 
The Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data VisualizationThe Green Lab - [11-A] Data Visualization
The Green Lab - [11-A] Data Visualization
 
[05-A] Experiment design (basics)
[05-A] Experiment design (basics)[05-A] Experiment design (basics)
[05-A] Experiment design (basics)
 
The Green Lab - [01 C] Empirical software engineering
The Green Lab - [01 C] Empirical software engineeringThe Green Lab - [01 C] Empirical software engineering
The Green Lab - [01 C] Empirical software engineering
 
The Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scopingThe Green Lab - [02 B] Experiment scoping
The Green Lab - [02 B] Experiment scoping
 
The Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and toolsThe Green Lab - [04-A] Lab environment and tools
The Green Lab - [04-A] Lab environment and tools
 
The Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in RThe Green Lab - [12-A] Data visualization in R
The Green Lab - [12-A] Data visualization in R
 
The Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challengesThe Green Lab - [13 B] Future research challenges
The Green Lab - [13 B] Future research challenges
 
[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size[09-A] Statistical tests and effect size
[09-A] Statistical tests and effect size
 
[03-B] Measurement theory basics
[03-B] Measurement theory basics[03-B] Measurement theory basics
[03-B] Measurement theory basics
 
[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing[07-B] Statistical hypothesis testing
[07-B] Statistical hypothesis testing
 
Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...Collaborative Model-Driven Software Engineering: a Classification Framework a...
Collaborative Model-Driven Software Engineering: a Classification Framework a...
 
Odin2018_Minh_ML_Risk_Prediction
Odin2018_Minh_ML_Risk_PredictionOdin2018_Minh_ML_Risk_Prediction
Odin2018_Minh_ML_Risk_Prediction
 

En vedette

The Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingThe Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingGiuseppe Procaccianti
 
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Ivano Malavolta
 
The Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupThe Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupIvano Malavolta
 
The Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsThe Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsIvano Malavolta
 
The Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationThe Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationGiuseppe Procaccianti
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...Lora Aroyo
 
WebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingWebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingLora Aroyo
 
PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors Victor de Boer
 
Talk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsTalk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsAstrid van Aggelen
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Lora Aroyo
 
Dive exploring history presentation
Dive exploring history presentationDive exploring history presentation
Dive exploring history presentationVictor de Boer
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator DemosLora Aroyo
 
Future TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialFuture TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialLora Aroyo
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigData_Europe
 

En vedette (14)

The Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis TestingThe Green Lab - [07-B] Hypothesis Testing
The Green Lab - [07-B] Hypothesis Testing
 
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...Beyond Native Apps:  Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
Beyond Native Apps: Web Technologies to the Rescue! [SPLASH 2016 - Mobile! k...
 
The Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setupThe Green Lab - [04 B] [PWA] Experiment setup
The Green Lab - [04 B] [PWA] Experiment setup
 
The Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web appsThe Green Lab - [02 C] [case study] Progressive web apps
The Green Lab - [02 C] [case study] Progressive web apps
 
The Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentationThe Green Lab - [01-B] Case study presentation
The Green Lab - [01-B] Case study presentation
 
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
CHIP Project: Personalized Museum Tour with Real-Time Adaptation on a Mobile ...
 
WebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in CrowdsourcingWebSci2013 Harnessing Disagreement in Crowdsourcing
WebSci2013 Harnessing Disagreement in Crowdsourcing
 
PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors PiLOD talk: Dutch Ships and Sailors
PiLOD talk: Dutch Ships and Sailors
 
Talk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament ProceedingsTalk of Europe – Linking European Parliament Proceedings
Talk of Europe – Linking European Parliament Proceedings
 
Agora User Committee Meeting 2013
Agora User Committee Meeting 2013Agora User Committee Meeting 2013
Agora User Committee Meeting 2013
 
Dive exploring history presentation
Dive exploring history presentationDive exploring history presentation
Dive exploring history presentation
 
SealincMedia Accurator Demos
SealincMedia Accurator DemosSealincMedia Accurator Demos
SealincMedia Accurator Demos
 
Future TV is Now: Personalized & Social
Future TV is Now: Personalized & SocialFuture TV is Now: Personalized & Social
Future TV is Now: Personalized & Social
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & Health
 

Similaire à The Green Lab - [09 B] Experiment validity

[13 - A] Experiment validity
[13 - A] Experiment validity[13 - A] Experiment validity
[13 - A] Experiment validityIvano Malavolta
 
[03-A] Experiment planning
[03-A] Experiment planning[03-A] Experiment planning
[03-A] Experiment planningIvano Malavolta
 
[01-B] Empirical software engineering
[01-B] Empirical software engineering[01-B] Empirical software engineering
[01-B] Empirical software engineeringIvano Malavolta
 
Hawaii International Conference on System Science - Chabada&Molka-Danielsen
Hawaii International Conference on System Science - Chabada&Molka-DanielsenHawaii International Conference on System Science - Chabada&Molka-Danielsen
Hawaii International Conference on System Science - Chabada&Molka-DanielsenMichal Chabada
 
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...CS, NcState
 
Psych 610 week 5 individual assignment homework exercise
Psych 610 week 5 individual assignment homework exercisePsych 610 week 5 individual assignment homework exercise
Psych 610 week 5 individual assignment homework exerciseicraltioclim1984
 
Visual Learning Pulse - Final Thesis presentation
Visual Learning Pulse - Final Thesis presentationVisual Learning Pulse - Final Thesis presentation
Visual Learning Pulse - Final Thesis presentationDaniele Di Mitri
 
DESIGNING AND CONDUCTING CASE STUDIES.ppt
DESIGNING AND CONDUCTING CASE STUDIES.pptDESIGNING AND CONDUCTING CASE STUDIES.ppt
DESIGNING AND CONDUCTING CASE STUDIES.pptMFeldyRiza1
 
Research Design
Research DesignResearch Design
Research DesignAngel Rose
 
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERS
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERSRESEARCH METHODOLOGY FOR COMPUTER ENGINEERS
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERSElisée Ndjabu
 
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...Shuhei Iitsuka
 
PM2 ARTICALS
PM2 ARTICALSPM2 ARTICALS
PM2 ARTICALSra na
 
How To Research
How To ResearchHow To Research
How To ResearchFengyi
 

Similaire à The Green Lab - [09 B] Experiment validity (20)

[13 - A] Experiment validity
[13 - A] Experiment validity[13 - A] Experiment validity
[13 - A] Experiment validity
 
[03-A] Experiment planning
[03-A] Experiment planning[03-A] Experiment planning
[03-A] Experiment planning
 
[01-B] Empirical software engineering
[01-B] Empirical software engineering[01-B] Empirical software engineering
[01-B] Empirical software engineering
 
Introduction to meta analysis
Introduction to meta analysisIntroduction to meta analysis
Introduction to meta analysis
 
Online Recitation Sessions
Online Recitation SessionsOnline Recitation Sessions
Online Recitation Sessions
 
ODiP: Open data and the scientific gift culture
ODiP: Open data and the scientific gift cultureODiP: Open data and the scientific gift culture
ODiP: Open data and the scientific gift culture
 
Critical Appraisal
Critical AppraisalCritical Appraisal
Critical Appraisal
 
Hawaii International Conference on System Science - Chabada&Molka-Danielsen
Hawaii International Conference on System Science - Chabada&Molka-DanielsenHawaii International Conference on System Science - Chabada&Molka-Danielsen
Hawaii International Conference on System Science - Chabada&Molka-Danielsen
 
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
Promise 2011: Panel - "Practical Software Project Improvements using Actionab...
 
Psych 610 week 5 individual assignment homework exercise
Psych 610 week 5 individual assignment homework exercisePsych 610 week 5 individual assignment homework exercise
Psych 610 week 5 individual assignment homework exercise
 
Visual Learning Pulse - Final Thesis presentation
Visual Learning Pulse - Final Thesis presentationVisual Learning Pulse - Final Thesis presentation
Visual Learning Pulse - Final Thesis presentation
 
DESIGNING AND CONDUCTING CASE STUDIES.ppt
DESIGNING AND CONDUCTING CASE STUDIES.pptDESIGNING AND CONDUCTING CASE STUDIES.ppt
DESIGNING AND CONDUCTING CASE STUDIES.ppt
 
Research Design
Research DesignResearch Design
Research Design
 
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERS
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERSRESEARCH METHODOLOGY FOR COMPUTER ENGINEERS
RESEARCH METHODOLOGY FOR COMPUTER ENGINEERS
 
Design of esperiment
Design of esperimentDesign of esperiment
Design of esperiment
 
Syllabus
SyllabusSyllabus
Syllabus
 
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...
Improving the Sensitivity of Online Controlled Experiments by Utilizing Pre-E...
 
Applications of Machine Learning
Applications of Machine LearningApplications of Machine Learning
Applications of Machine Learning
 
PM2 ARTICALS
PM2 ARTICALSPM2 ARTICALS
PM2 ARTICALS
 
How To Research
How To ResearchHow To Research
How To Research
 

Plus de Ivano Malavolta

Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Ivano Malavolta
 
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)Ivano Malavolta
 
Software sustainability and Green IT
Software sustainability and Green ITSoftware sustainability and Green IT
Software sustainability and Green ITIvano Malavolta
 
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Ivano Malavolta
 
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]Ivano Malavolta
 
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Ivano Malavolta
 
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Ivano Malavolta
 
Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Ivano Malavolta
 
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Ivano Malavolta
 
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Ivano Malavolta
 
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Ivano Malavolta
 
Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Ivano Malavolta
 
[2017/2018] Agile development
[2017/2018] Agile development[2017/2018] Agile development
[2017/2018] Agile developmentIvano Malavolta
 
Reconstructing microservice-based architectures
Reconstructing microservice-based architecturesReconstructing microservice-based architectures
Reconstructing microservice-based architecturesIvano Malavolta
 
[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design LanguageIvano Malavolta
 
[2017/2018] Architectural languages
[2017/2018] Architectural languages[2017/2018] Architectural languages
[2017/2018] Architectural languagesIvano Malavolta
 
[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software ArchitectureIvano Malavolta
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineeringIvano Malavolta
 
Mobile Apps quality - a tale about energy, performance, and users’ perception
Mobile Apps quality - a tale about energy, performance, and users’ perceptionMobile Apps quality - a tale about energy, performance, and users’ perception
Mobile Apps quality - a tale about energy, performance, and users’ perceptionIvano Malavolta
 

Plus de Ivano Malavolta (20)

Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
Conducting Experiments on the Software Architecture of Robotic Systems (QRARS...
 
The H2020 experience
The H2020 experienceThe H2020 experience
The H2020 experience
 
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)The Green Lab - Research cocktail  @Vrije Universiteit Amsterdam (October 2020)
The Green Lab - Research cocktail @Vrije Universiteit Amsterdam (October 2020)
 
Software sustainability and Green IT
Software sustainability and Green ITSoftware sustainability and Green IT
Software sustainability and Green IT
 
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...Navigation-aware and Personalized Prefetching of Network Requests in Android ...
Navigation-aware and Personalized Prefetching of Network Requests in Android ...
 
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]How Maintainability Issues of Android Apps Evolve [ICSME 2018]
How Maintainability Issues of Android Apps Evolve [ICSME 2018]
 
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
Experimenting on Mobile Apps Quality - a tale about Energy, Performance, and ...
 
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
Modeling objects interaction via UML sequence diagrams [Software Design] [Com...
 
Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...Modeling behaviour via UML state machines [Software Design] [Computer Science...
Modeling behaviour via UML state machines [Software Design] [Computer Science...
 
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...Object-oriented design patterns in UML [Software Design] [Computer Science] [...
Object-oriented design patterns in UML [Software Design] [Computer Science] [...
 
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
Structure modeling with UML [Software Design] [Computer Science] [Vrije Unive...
 
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
Requirements engineering with UML [Software Design] [Computer Science] [Vrije...
 
Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...Modeling and abstraction, software development process [Software Design] [Com...
Modeling and abstraction, software development process [Software Design] [Com...
 
[2017/2018] Agile development
[2017/2018] Agile development[2017/2018] Agile development
[2017/2018] Agile development
 
Reconstructing microservice-based architectures
Reconstructing microservice-based architecturesReconstructing microservice-based architectures
Reconstructing microservice-based architectures
 
[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language[2017/2018] AADL - Architecture Analysis and Design Language
[2017/2018] AADL - Architecture Analysis and Design Language
 
[2017/2018] Architectural languages
[2017/2018] Architectural languages[2017/2018] Architectural languages
[2017/2018] Architectural languages
 
[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture[2017/2018] Introduction to Software Architecture
[2017/2018] Introduction to Software Architecture
 
[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering[2017/2018] RESEARCH in software engineering
[2017/2018] RESEARCH in software engineering
 
Mobile Apps quality - a tale about energy, performance, and users’ perception
Mobile Apps quality - a tale about energy, performance, and users’ perceptionMobile Apps quality - a tale about energy, performance, and users’ perception
Mobile Apps quality - a tale about energy, performance, and users’ perception
 

Dernier

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Dernier (20)

Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

The Green Lab - [09 B] Experiment validity

  • 1. 1 Het begint met een idee Experiment validity Ivano Malavolta
  • 2. Vrije Universiteit Amsterdam 2 Ivano Malavolta / S2 group / Empirical software engineering Planning phases Scope of this lecture
  • 3. Vrije Universiteit Amsterdam 3 Ivano Malavolta / S2 group / Green Lab Experiment validity ● We aim for adequate validity, not universal validity ○ What matters is our population of interest Validity is the extent to which our results are sound and applicable to the real world ● Validity is in trade-off with experiment scope
  • 4. Vrije Universiteit Amsterdam Threats Identification 4 ● Identifying threats helps to plan for adequate validity ● Each threat needs appropriate mitigation ● Several classifications of validity threats: ○ Campbell and Stanley [1] ○ Cook and Campbell [2] Ivano Malavolta / S2 group / Green Lab
  • 5. Vrije Universiteit Amsterdam 5 Types of threat to validity Theory Observation Cause EffectCausation e.g. encoding algorithms e.g. Energy efficiency Treatment Experiment Outcome e.g. JPEG e.g. energy per image Ivano Malavolta / S2 group / Green Lab
  • 6. Vrije Universiteit Amsterdam Causation Experiment 6 Types of threat to validity Theory Observation Cause Effect Treatment Outcome Construct Internal Conclusion Construct External Ivano Malavolta / S2 group / Green Lab e.g. encoding algorithms e.g. Energy efficiency e.g. JPEG e.g. energy per image
  • 7. Vrije Universiteit Amsterdam 7 Internal validity Internal Validity: causality between treatment and outcome ● Strongly related to the experiment design and operation ○ Are my results caused by the treatment? ○ Have I considered all possible factors? Ivano Malavolta / S2 group / Green Lab
  • 8. Vrije Universiteit Amsterdam 8 Internal validity: types of threat ● History ○ Different trials of the experiment performed in different time frames (eg, after holidays vs normal days) ● Maturation ○ Subjects may react differently over time (eg, learning effect, tiresome, boredome) ● Selection ○ Some subjects may abandon the experiment ○ Event worse, some specific type of subjects may leave it ● Direction of causal influence ○ A causes B, B causes A, or X causes A and B? Ivano Malavolta / S2 group / Green Lab
  • 9. Vrije Universiteit Amsterdam 9 Internal validity: mitigation Analyze and identify confounding factors/noise Choose appropriate experiment design Ivano Malavolta / S2 group / Green Lab
  • 10. Vrije Universiteit Amsterdam Conclusion Validity: statistical correctness and significance ● Are my conclusions correct? ● Are my results significant enough? 10 Conclusion validity Ivano Malavolta / S2 group / Green Lab
  • 11. Vrije Universiteit Amsterdam 11 Conclusion validity: types of threat ● Low statistical power ○ Results not statistically significant ○ There is a significant difference but the statistical test does not reveal it due to the low number of data points ● Violated assumptions of statistical tests ○ eg, many tests assume normally distributed samples ● Fishing and error rate ○ If you are combining multiple statistical tests, also their significance should be adapted ● Reliability of measures ○ If you repeat the measurement you should get similar results → same conclusions Ivano Malavolta / S2 group / Green Lab
  • 12. Vrije Universiteit Amsterdam 12 Conclusion validity: mitigation Select appropriate tests Use only as much significance as needed Keep environment under control Ivano Malavolta / S2 group / Green Lab
  • 13. Vrije Universiteit Amsterdam 13 Construct validity ● Have I defined my constructs properly? ● Am I analyzing the correct variables for the effects? Construct Validity: relation between theory and observation Ivano Malavolta / S2 group / Green Lab
  • 14. Vrije Universiteit Amsterdam 14 Construct validity: types of threat ● Inadequate preoperational explication of constructs ○ construct not well defined before being translated into measures ○ Theory unclear ○ Comparing two methods, but not clear what does mean that a method is better than another ● Mono-operation bias ○ I have one independent variable only, one single object or treatment → the experiment could not represent the theory ○ eg, inspection conducted on a single document not representative of the set of documents on which the technique is often applied ● Mono-method bias ○ When you use a single type of measures or observations ○ The experimenter may bias the measures Ivano Malavolta / S2 group / Green Lab
  • 15. Vrije Universiteit Amsterdam 15 Construct validity: mitigation Early definition of constructs (GQM) Use appropriate experiment design Introduce redundancy for cross-checks Ivano Malavolta / S2 group / Green Lab
  • 16. Vrije Universiteit Amsterdam 16 External validity ● Are my results valid for the whole target population? ● Have I selected a representative sample? External Validity: generalizability of the results Ivano Malavolta / S2 group / Green Lab
  • 17. Vrije Universiteit Amsterdam 17 External validity: types of threat ● Interaction of selection and treatment ○ the population of subjects is not representative of the one for which I would like to generalize my results ○ eg, performing experiments with students to use results in industry ● Interaction of setting and treatment ○ the experimental setting or the material are not representative ○ e.g. I let the subjects using tools that they don’t use in the reality ○ e.g. Web development using textual editors ○ Use of toy objects ● Interaction of history and treatment ○ the experiment is conducted on a special time or day which affects the results ○ eg, our experiment on green software is performed after a big congress at which some subjects participated Ivano Malavolta / S2 group / Green Lab
  • 18. Vrije Universiteit Amsterdam 18 External validity: mitigation Use an environment as realistic as possible Explicitly define and model your context Ivano Malavolta / S2 group / Green Lab
  • 19. Vrije Universiteit Amsterdam ● You know that you have to explicitly take into account the threats to validity of your experiment ● Discussing threats actually makes your experiment stronger ○ you are not showing your weaknesses, but you are playing for replicability ● You will make tradeoffs between threats to validity in your experiment ● Consider threats to validity from the beginning ○ Reasoning on them will make you feel more confident about the scope and design of your experiment 19 What this lecture means to you? Ivano Malavolta / S2 group / Green Lab
  • 20. Vrije Universiteit Amsterdam 20 Ivano Malavolta / S2 group / Empirical software engineering Readings Chapter 8 [1] Campbell and Stanley, Experimental and Quasi- Experimental designs for Research (1963). (Blackboard) [2] Cook and Campbell, Quasi-experimentation - Design and Analysis Issues for Field Settings (1979). Available at the VU library. Ivano Malavolta / S2 group / Green Lab
  • 21. Vrije Universiteit Amsterdam 21 Ivano Malavolta / S2 group / Empirical software engineering Some contents of lecture extracted from: ● Giuseppe Procaccianti’s lectures at VU Acknowledgements