SlideShare une entreprise Scribd logo
1  sur  46
Learning Analytics for the Evaluation of
Competencies and Behaviors in Serious Games
José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es
Introductions
Where are you coming from?
playful.mit.edu
@playfulMIT
What we believe in
Who we are
Main contributors to this research
José A. Ruipérez-Valiente
BEng Telecomunications Systems (UCAM),
MEng Telecomunications, MSc y PhD
Telematics (UC3M), Postdoc (MIT)
6 years working in learning analytics across
many objectives and contexts
Currently focused in large scale trends in
MOOCs and game-based assessment
Juan de la Cierva Researcher at UMU and
affiliate at MIT Playful Journey Lab
YJ (Yoon Jeon) Kim
Executive Director Playful
Journey Lab located at
MIT Open Learning
Assessment scientist
Focus on games and
playful approaches for
assessment
Topics related to this talk
- Games for Learning
- Game-based Assessment
- Learning Analytics
- … and Design (which is transverse to numerous areas and applications)
Motivations
Why and how are we doing this?
A game is a voluntary interactive
activity, in which one or more players
follow rules that constrain their
behavior, enacting an artificial conflict
that ends in a quantifiable outcome.
~Eric Zimmerman (2004)
Why Games?
● Games are “flexible enough for players to
inhabit and explore through meaningful
play” (Salen & Zimmerman) (deep learning)
● Majority of children grow up playing games
● Learners have more freedom related to
how much effort they choose to expend,
how often they fail and try again (Osterweil,
2014) (real life)
Assessment is a process of reasoning
from evidence. Therefore, an
assessment is a tool designed to
observe students’ behavior and
produce data that can be used to draw
reasonable inferences about what
students know.
~ Bob Mislevy
Why Games for Assessment?
● Games incorporate multiple pathways to solution(s) where learners can make
meaningful choices and demonstrate multiple ways of solving problems
● Use complex and authentic problems → hard-to-measure constructs
o We need to assess 21st century skills
● Games are motivating and engaging → accurate assessment (Sundre &
Wise, 2003)
● It doesn’t feel like assessment (i.e. stealth assessment)
o Less stresful situations for students
Metaphor
The Broad view of Learning Analytics
…collection, analysis and reporting of data about learners and
their contexts, for purposes of understanding and optimising
learning and the environments in which it occurs…
Source: First Learning Analytics
and Knowledge Conference
The Learning Analytics data-driven Process
Raw data
generation
Feature
engineering
Visualizations
Recommendation
Report generator
Meaningful features
Which raw data is
necessary?
What to do with the processed
data?
What to obtain and How
to do it?
Technology as an engine to enhance learning
Exploration,
Correlation,
clustering,
prediction,
causes…
Learning
environments
Conclusions generate feedback and close the LA loop
Game-based Assessment
Design, model implementation and evaluation process
Design, Development and Evaluation Process of Game-based
Assessment
Design
● Design and implementation of game system
○ Game mechanics that can generate evidence
from the constructs and a data infrastructure that
effectively stores that evidence
○ The most iterative step of the process with very
frequent playtesting
1. Start with paper prototypes
2. Move to drafty digital prototypes
3. End with advanced digital prototypes
● Data collection
○ Diverse audiences and contexts
○ Very important for game mechanics and tech side
○ Face-to-face playtesting
○ Amazon MTurk
Face-to-face playtesting
Amazon Mechnical Turk as part of the design process
Amazon Mechnical Turk as part of the design process
Balance between Game Design and Assessment Design
Meet Shadowspect!
More at https://shadowspect.org/
Model development
● Implementation of the assessment machinery:
○ Process of turning evidence into constructs
○ Content knowledge assessment: Following a
traditional Evidence-centered Design
○ Cognitive and behavioral assessment: Combining
knowledge engineering process and ML with expert
labelling
● Data collection:
○ Same high school context, age, and settings
○ Two sessions of one hour each
○ Around 10 US high school classes and more than 200
hundred students
Model development:
Content knowledge assessment
Implementation via Evidence-centered Design
Common Core Geometry Standards
● Competency model: We focus on the common core geometry standards
o MG.A.1: Use geometric shapes, their measures, and their properties to describe
objects (e.g., modeling a tree trunk or a human torso as a cylinder)
o GMD.B.4: Identify the shapes of two-dimensional cross-sections of three-
dimensional objects, and identify three-dimensional objects generated by rotations
of two-dimensional objects
o CO.A.5: Given a geometric figure and a rotation, reflection, or translation, draw the
transformed figure
o CO.B.6: Use geometric descriptions of rigid motions to transform figures and to
predict the effect of a given rigid motion on a given figure
ECD Summary for Geometry Common Standards Assessmement
● Collaboration with geometry specialist, game designer and assessment designer
○ Evidence model: We generate puzzles that generate evidence from the Geometry Common Standards
○ Task model: We map the relationship (none, weak or strong) of each puzzle with the common standard
○ Assembly model: We put all the evidence from a student together to assess their content knowledge
○ Presentation & Delivery model: Reports and dashboards by student/standard. Difficulty by exercise
Puzzle MG.A.1 GMD.B.4 …
Puzzle 1 Weak Weak …
Puzzle 2 None None …
… … … …
Student Puzzle 1 Puzzle 2 …
Student 1
OK, # 1
attempt
OK, # 3
attempts
…
Student 1 NA
Fail, # 5
attempt
…
… … … …
Our simplified case scenario right now
Evidence Standards
map
Model development:
Cognitive and Behavioral Assessment
Implementation via a Learning Analytics Knowledge Engineering Process
Knowledge Engineering Process
● We acquire knowledge about the construct that we want to measure
1. Reading about the construct
2. Conducting interview with experts
3. Reviewing related scientific literature
● We algorithmically implement features that use the data/evidence that can inform the
construct that we want to measure
Our simplified case scenario now updates to:
Evidence Constructs
map
Data Features
data schema inform
algorithms
Efficiency construct
- Efficiency is the ability to do things well, successfully, and without waste. It
often specifically comprises the capability of a specific application of effort
to produce a specific outcome with a minimum amount or quantity of
waste, expense, or unnecessary effort (Wikipedia)
Evidence in Shadowspect related to efficiency
● Ability to do things well:
○ Solving puzzles correctly
● Expense or effort:
○ Time invested
○ Number of attempts to solve a problem
Mapping evidence into necessary data in Shadowspect
● We need: puzzles solved correctly, time invested and attempts
○ Necessary types of events for that:
■ puzzle_start (timestamp, student, puzzle_id)
■ leave_to_menu (timestamp, student, puzzle_id)
■ puzzle_attempt (timestamp, student, puzzle_id, correct)
How does data in Shadowspect actually looks like?
Algorithm to compute features from data (pseudo-code)
# note this is a VERY simplified version that do not aim to be the most effective implementation of this algorithm
computeEfficiencyFeatures(student):
student_events = getStudentEvents(student)
correct_exercises_list = list(); number_attempts = 0; total_time = 0; puzzle_started_event = None
for event in student_events:
if(event[‘type’] == ‘puzzle_started’) then
puzzle_started_event = event
elif(event[‘type’] == ‘leave_to_menu’) then
total_time += (event[‘timestamp’] - puzzle_started_event[‘timestamp’])
puzzle_started_event = None
elif(event[‘type’] == ‘puzzle_attempt’):
number_attempts += 1
if(event[‘correct’] == True) then
correct_exercises_list.add(event[‘puzzle_id’])
attempts_per_correct_problem = length(unique(correct_exercises_list))/number_attempts
time_per_correct_problem = length(unique(correct_exercises_list))/total_time
return(attempts_per_correct_problem, time_per_correct_problem)
The previous general scenario
Evidence Constructs
map
Data Features
data schema inform
algorithms
Model for efficiency in Shadowspect
Evidence
● Correct puzzles
● Time
● Number attempts
Data
● puzzle_start
● leave_to_menu
● puzzle_attempt
data schema inform
computeEfficiency
Features(student)
Construct
Efficiency
Features
attempts_per_correct_problem
time_per_correct_problem
map
Model development:
Cognitive and Behavioral Assessment
Implementation via Learning Analytics with Experts and Machine Learning
Expert Labelling and Machine Learning Process
● Two or more experts label text or video replays that can be visually assessed
○ We divide all level interactions in replays that can be labelled
○ Experts review replays and label them for each construct that we want to measure
■ They might use rubrics and we are looking for expert inter-agreement (Cohen’s kappa)
○ We implement a supervised machine learning assessment model based on these labels
● Challenges here include achieving good inter-agreement, technical logistics, replay
resolution and final implementation of the ML model
Example of simplified text replay: 1. Start puzzle – 2. Create shape square – 3. Move square – 4. Create cone
5. Rotate cone – 6. Change perspective – 7. Snapshot – 8. Move cone – 9. Submit – 10 Puzzle correct
Expert Labelling and Machine Learning Process
Evidence
Constructsmap
Data Features
data schema
inform
algorithms
expert
assessment
ML/AI
Evaluation
● We are not here yet! Future plans:
● Data collection:
○ Implementation as part of the curriculum in high
school classes
○ Demographic and school data with external measures
● Game analytics: How is the game being used by
students? Improvements, enjoyment…
● Model performance evaluation: How are the
models working? What do teachers think about
models?
● Psychometric evaluation: Are our models
correlated to other external tests, e.g. geometry
traditional tests or spatial reasoning validated
instruments
It’s time to say goodbye
But let’s conclude before that
Conclusions
● Alternative assessment method with great potential
○ Focus on complex constructs, can focus on the process (on only outcomes), is less stressful
and more enjoyable for students
● Highly challenging and multidisciplinary field, main problems:
○ Cost, scalability and generalization across GBA tools, model validity, trustworthiness, and
teacher literacy
● Some companies are already using GBA as part pre-hiring
● Difference between Assessment and assessment
● Opportunities for collaboration!
Thank you!
José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es

Contenu connexe

Similaire à Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games

2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...eMadrid network
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learningKnoldus Inc.
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Bart Rienties
 
How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksKatie Fang
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...KnowledgeGraph
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning台灣資料科學年會
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningCITE
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.pptbutest
 
Learning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLearning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLaila Shoukry
 
Umap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangUmap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangYun Huang
 
ToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOPiTech
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Gamesstemrpm
 
On Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography GameOn Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography Gamearammann
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Gamesstemprm
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statisticssrthomas
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAAsrkthomas
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAsrthomas
 

Similaire à Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games (20)

2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
2021_06_30 «Collaborative Design with Classroom Teachers for Impactful Game-B...
 
Introduction to Machine learning
Introduction to Machine learningIntroduction to Machine learning
Introduction to Machine learning
 
Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...Investigating learning strategies in a dispositional learning analytics conte...
Investigating learning strategies in a dispositional learning analytics conte...
 
How AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinksHow AI will change the way you help students succeed - SchooLinks
How AI will change the way you help students succeed - SchooLinks
 
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...Report on the First Knowledge Graph Reasoning Challenge  2018 -Toward the eXp...
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXp...
 
林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning林守德/Practical Issues in Machine Learning
林守德/Practical Issues in Machine Learning
 
Dr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based LearningDr. David Gibson: Challenge-Based Learning
Dr. David Gibson: Challenge-Based Learning
 
Applying learning analytics in serious games
Applying learning analytics in serious games Applying learning analytics in serious games
Applying learning analytics in serious games
 
Chapter01.ppt
Chapter01.pptChapter01.ppt
Chapter01.ppt
 
Learning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and ConsiderationsLearning Analytics and Serious Games: Trends and Considerations
Learning Analytics and Serious Games: Trends and Considerations
 
Week 1.pdf
Week 1.pdfWeek 1.pdf
Week 1.pdf
 
EDPC605_7&8
EDPC605_7&8EDPC605_7&8
EDPC605_7&8
 
Umap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuangUmap17 learner modelingforintegrationskills_yunhuang
Umap17 learner modelingforintegrationskills_yunhuang
 
ToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_enToTCOOP+i Lesson plan unit_3_final_version_en
ToTCOOP+i Lesson plan unit_3_final_version_en
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Games
 
On Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography GameOn Evaluating The 80 Days Geography Game
On Evaluating The 80 Days Geography Game
 
On Evaluating Serious Games
On Evaluating Serious GamesOn Evaluating Serious Games
On Evaluating Serious Games
 
2webquest Srkthomas Statistics
2webquest Srkthomas Statistics2webquest Srkthomas Statistics
2webquest Srkthomas Statistics
 
Descriptive Statistics in the NCAA
Descriptive Statistics in the NCAADescriptive Statistics in the NCAA
Descriptive Statistics in the NCAA
 
Webquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAAWebquest Descriptive Statistics of the NCAA
Webquest Descriptive Statistics of the NCAA
 

Plus de MIT

Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...MIT
 
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
Investigación en Learning Analyticsvs.Learning Analytics en la UniversidadInvestigación en Learning Analyticsvs.Learning Analytics en la Universidad
Investigación en Learning Analytics vs. Learning Analytics en la UniversidadMIT
 
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente MIT
 
A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...MIT
 
Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs MIT
 
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...MIT
 

Plus de MIT (6)

Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
Multiplatform MOOC Analytics: Comparing Global and Regional Patterns in edX a...
 
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
Investigación en Learning Analyticsvs.Learning Analytics en la UniversidadInvestigación en Learning Analyticsvs.Learning Analytics en la Universidad
Investigación en Learning Analytics vs. Learning Analytics en la Universidad
 
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
Ph.D. Defense - Dr. Jose A. Ruiperez Valiente
 
A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...A Data-driven Method for the Detection of Close Submitters in Online Learning...
A Data-driven Method for the Detection of Close Submitters in Online Learning...
 
Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs Using Multiple Accounts for Harvesting Solutions in MOOCs
Using Multiple Accounts for Harvesting Solutions in MOOCs
 
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
Diseño e Implementación de un Módulo de Analítica de Aprendizaje en la Plataf...
 

Dernier

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...ZurliaSoop
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxPooja Bhuva
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxUmeshTimilsina1
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17Celine George
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17Celine George
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxJisc
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxRamakrishna Reddy Bijjam
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxEsquimalt MFRC
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Pooja Bhuva
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxmarlenawright1
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024Elizabeth Walsh
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibitjbellavia9
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...Amil baba
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jisc
 

Dernier (20)

TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptxExploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
Exploring_the_Narrative_Style_of_Amitav_Ghoshs_Gun_Island.pptx
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
NO1 Top Black Magic Specialist In Lahore Black magic In Pakistan Kala Ilam Ex...
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 

Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games

  • 1. Learning Analytics for the Evaluation of Competencies and Behaviors in Serious Games José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es
  • 6. Main contributors to this research José A. Ruipérez-Valiente BEng Telecomunications Systems (UCAM), MEng Telecomunications, MSc y PhD Telematics (UC3M), Postdoc (MIT) 6 years working in learning analytics across many objectives and contexts Currently focused in large scale trends in MOOCs and game-based assessment Juan de la Cierva Researcher at UMU and affiliate at MIT Playful Journey Lab YJ (Yoon Jeon) Kim Executive Director Playful Journey Lab located at MIT Open Learning Assessment scientist Focus on games and playful approaches for assessment
  • 7. Topics related to this talk - Games for Learning - Game-based Assessment - Learning Analytics - … and Design (which is transverse to numerous areas and applications)
  • 8. Motivations Why and how are we doing this?
  • 9. A game is a voluntary interactive activity, in which one or more players follow rules that constrain their behavior, enacting an artificial conflict that ends in a quantifiable outcome. ~Eric Zimmerman (2004)
  • 10. Why Games? ● Games are “flexible enough for players to inhabit and explore through meaningful play” (Salen & Zimmerman) (deep learning) ● Majority of children grow up playing games ● Learners have more freedom related to how much effort they choose to expend, how often they fail and try again (Osterweil, 2014) (real life)
  • 11. Assessment is a process of reasoning from evidence. Therefore, an assessment is a tool designed to observe students’ behavior and produce data that can be used to draw reasonable inferences about what students know. ~ Bob Mislevy
  • 12. Why Games for Assessment? ● Games incorporate multiple pathways to solution(s) where learners can make meaningful choices and demonstrate multiple ways of solving problems ● Use complex and authentic problems → hard-to-measure constructs o We need to assess 21st century skills ● Games are motivating and engaging → accurate assessment (Sundre & Wise, 2003) ● It doesn’t feel like assessment (i.e. stealth assessment) o Less stresful situations for students
  • 14. The Broad view of Learning Analytics …collection, analysis and reporting of data about learners and their contexts, for purposes of understanding and optimising learning and the environments in which it occurs… Source: First Learning Analytics and Knowledge Conference
  • 15. The Learning Analytics data-driven Process Raw data generation Feature engineering Visualizations Recommendation Report generator Meaningful features Which raw data is necessary? What to do with the processed data? What to obtain and How to do it? Technology as an engine to enhance learning Exploration, Correlation, clustering, prediction, causes… Learning environments Conclusions generate feedback and close the LA loop
  • 16. Game-based Assessment Design, model implementation and evaluation process
  • 17. Design, Development and Evaluation Process of Game-based Assessment
  • 18. Design ● Design and implementation of game system ○ Game mechanics that can generate evidence from the constructs and a data infrastructure that effectively stores that evidence ○ The most iterative step of the process with very frequent playtesting 1. Start with paper prototypes 2. Move to drafty digital prototypes 3. End with advanced digital prototypes ● Data collection ○ Diverse audiences and contexts ○ Very important for game mechanics and tech side ○ Face-to-face playtesting ○ Amazon MTurk
  • 20. Amazon Mechnical Turk as part of the design process
  • 21. Amazon Mechnical Turk as part of the design process
  • 22. Balance between Game Design and Assessment Design
  • 23. Meet Shadowspect! More at https://shadowspect.org/
  • 24. Model development ● Implementation of the assessment machinery: ○ Process of turning evidence into constructs ○ Content knowledge assessment: Following a traditional Evidence-centered Design ○ Cognitive and behavioral assessment: Combining knowledge engineering process and ML with expert labelling ● Data collection: ○ Same high school context, age, and settings ○ Two sessions of one hour each ○ Around 10 US high school classes and more than 200 hundred students
  • 25. Model development: Content knowledge assessment Implementation via Evidence-centered Design
  • 26.
  • 27. Common Core Geometry Standards ● Competency model: We focus on the common core geometry standards o MG.A.1: Use geometric shapes, their measures, and their properties to describe objects (e.g., modeling a tree trunk or a human torso as a cylinder) o GMD.B.4: Identify the shapes of two-dimensional cross-sections of three- dimensional objects, and identify three-dimensional objects generated by rotations of two-dimensional objects o CO.A.5: Given a geometric figure and a rotation, reflection, or translation, draw the transformed figure o CO.B.6: Use geometric descriptions of rigid motions to transform figures and to predict the effect of a given rigid motion on a given figure
  • 28. ECD Summary for Geometry Common Standards Assessmement ● Collaboration with geometry specialist, game designer and assessment designer ○ Evidence model: We generate puzzles that generate evidence from the Geometry Common Standards ○ Task model: We map the relationship (none, weak or strong) of each puzzle with the common standard ○ Assembly model: We put all the evidence from a student together to assess their content knowledge ○ Presentation & Delivery model: Reports and dashboards by student/standard. Difficulty by exercise Puzzle MG.A.1 GMD.B.4 … Puzzle 1 Weak Weak … Puzzle 2 None None … … … … … Student Puzzle 1 Puzzle 2 … Student 1 OK, # 1 attempt OK, # 3 attempts … Student 1 NA Fail, # 5 attempt … … … … …
  • 29. Our simplified case scenario right now Evidence Standards map
  • 30. Model development: Cognitive and Behavioral Assessment Implementation via a Learning Analytics Knowledge Engineering Process
  • 31. Knowledge Engineering Process ● We acquire knowledge about the construct that we want to measure 1. Reading about the construct 2. Conducting interview with experts 3. Reviewing related scientific literature ● We algorithmically implement features that use the data/evidence that can inform the construct that we want to measure
  • 32. Our simplified case scenario now updates to: Evidence Constructs map Data Features data schema inform algorithms
  • 33. Efficiency construct - Efficiency is the ability to do things well, successfully, and without waste. It often specifically comprises the capability of a specific application of effort to produce a specific outcome with a minimum amount or quantity of waste, expense, or unnecessary effort (Wikipedia)
  • 34. Evidence in Shadowspect related to efficiency ● Ability to do things well: ○ Solving puzzles correctly ● Expense or effort: ○ Time invested ○ Number of attempts to solve a problem
  • 35. Mapping evidence into necessary data in Shadowspect ● We need: puzzles solved correctly, time invested and attempts ○ Necessary types of events for that: ■ puzzle_start (timestamp, student, puzzle_id) ■ leave_to_menu (timestamp, student, puzzle_id) ■ puzzle_attempt (timestamp, student, puzzle_id, correct)
  • 36. How does data in Shadowspect actually looks like?
  • 37. Algorithm to compute features from data (pseudo-code) # note this is a VERY simplified version that do not aim to be the most effective implementation of this algorithm computeEfficiencyFeatures(student): student_events = getStudentEvents(student) correct_exercises_list = list(); number_attempts = 0; total_time = 0; puzzle_started_event = None for event in student_events: if(event[‘type’] == ‘puzzle_started’) then puzzle_started_event = event elif(event[‘type’] == ‘leave_to_menu’) then total_time += (event[‘timestamp’] - puzzle_started_event[‘timestamp’]) puzzle_started_event = None elif(event[‘type’] == ‘puzzle_attempt’): number_attempts += 1 if(event[‘correct’] == True) then correct_exercises_list.add(event[‘puzzle_id’]) attempts_per_correct_problem = length(unique(correct_exercises_list))/number_attempts time_per_correct_problem = length(unique(correct_exercises_list))/total_time return(attempts_per_correct_problem, time_per_correct_problem)
  • 38. The previous general scenario Evidence Constructs map Data Features data schema inform algorithms
  • 39. Model for efficiency in Shadowspect Evidence ● Correct puzzles ● Time ● Number attempts Data ● puzzle_start ● leave_to_menu ● puzzle_attempt data schema inform computeEfficiency Features(student) Construct Efficiency Features attempts_per_correct_problem time_per_correct_problem map
  • 40. Model development: Cognitive and Behavioral Assessment Implementation via Learning Analytics with Experts and Machine Learning
  • 41. Expert Labelling and Machine Learning Process ● Two or more experts label text or video replays that can be visually assessed ○ We divide all level interactions in replays that can be labelled ○ Experts review replays and label them for each construct that we want to measure ■ They might use rubrics and we are looking for expert inter-agreement (Cohen’s kappa) ○ We implement a supervised machine learning assessment model based on these labels ● Challenges here include achieving good inter-agreement, technical logistics, replay resolution and final implementation of the ML model Example of simplified text replay: 1. Start puzzle – 2. Create shape square – 3. Move square – 4. Create cone 5. Rotate cone – 6. Change perspective – 7. Snapshot – 8. Move cone – 9. Submit – 10 Puzzle correct
  • 42. Expert Labelling and Machine Learning Process Evidence Constructsmap Data Features data schema inform algorithms expert assessment ML/AI
  • 43. Evaluation ● We are not here yet! Future plans: ● Data collection: ○ Implementation as part of the curriculum in high school classes ○ Demographic and school data with external measures ● Game analytics: How is the game being used by students? Improvements, enjoyment… ● Model performance evaluation: How are the models working? What do teachers think about models? ● Psychometric evaluation: Are our models correlated to other external tests, e.g. geometry traditional tests or spatial reasoning validated instruments
  • 44. It’s time to say goodbye But let’s conclude before that
  • 45. Conclusions ● Alternative assessment method with great potential ○ Focus on complex constructs, can focus on the process (on only outcomes), is less stressful and more enjoyable for students ● Highly challenging and multidisciplinary field, main problems: ○ Cost, scalability and generalization across GBA tools, model validity, trustworthiness, and teacher literacy ● Some companies are already using GBA as part pre-hiring ● Difference between Assessment and assessment ● Opportunities for collaboration!
  • 46. Thank you! José A. Ruipérez Valiente — @JoseARuiperez — jruiperez@um.es

Notes de l'éditeur

  1. begins by identifying what should be assessed in terms of knowledge, skills, or other learner attributes. These variables cannot be observed directly, so behaviors and performances that demonstrate these variables need to be identified instead. The next step is determining the types of tasks or situations that would draw out such behaviors or performances. Example around simple math knowledge in a game: