Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness

•

1 j'aime•1,296 vues

This document discusses using player rating systems to balance task difficulty in human computation games. It proposes treating tasks as players and using player rating algorithms to sequence tasks based on a player's changing skill level over time. The study tests whether a bipartite graph structure between players and tasks negatively impacts prediction accuracy of player rating algorithms. It finds that bipartiteness does not affect accuracy, and unbalanced graphs with "super vertices" may improve accuracy by providing more information. The approach shows promise for difficulty balancing, but requires further testing on retention and with different games.

Design

Player Rating Systems
for Balancing Human
Computation Games
testing the effect of bipartiteness
Seth Cooper, Sebastian Deterding, Theo Tsapakos
DiGRA 2016, August 6, 2016
c b

»flow«
Difficulty
Skill/time
frustration
boredom
flow (1990)
Mihaly Csikszentmihalyi

winning odds correlate w/ retention
Lomas et al., 2013

Difficulty
Skill/time
1. scientific tasks are predetermined
the
problem

Difficulty
Skill/time
2. tasks can’t be changed

Difficulty
Skill/time
3. Difficulty is unknown in advance
?
?
? ?
?
?
?
?
?
?
? ?
?
?
?
?
?

Difficulty
Skill/time
4. solving tasks defeats crowdsourcing
!
!
! !
!
!
!
!
!
!
! !
!
!
!
!
!

Difficulty
Skill/time
?
?
? ?
?
?
?
?
?
?
? ?
?
?
?
?
?
… hence tasks are served randomly
Lintott, 2016

hence retention is very poor
Sauermann & Franzoni, 2015

%Playerretained
Time/levels
most leave after balanced tutorials
* idealised
tutorial
actual tasks
*

Difficulty
Skill/time
How to sequence tasks w/o solving?
?
?
?
?
?
?
?
?
the
challenge
?
?
?

elo, 1978 glicko-2, 2012/3 trueskill, 2006
uses player rating algorithms

skill = winning odds, updated w/ each game
Moser, 2010

remember: winning odds > retention
Lomas et al., 2013

widely used, effective prediction
Menke, 2016

our approach: tasks = players
Player rating = skill
Task rating = difficulty
Player rating = skill

we produce a bipartite graph
Asratian et al., 1998

we produce a bipartite graph
Asratian et al., 1998
PlayersTasks

PlayersTasks
less density, less information flow
Scott, 2012

more structural holes
Scott, 2012
PlayersTasks

more unbalanced graphs
Scott, 2012
PlayersTasks

Research question
does a bipartite (player-player
or user-task) graph negatively
affect the prediction accuracy of
player rating algorithms? does
graph balancedness affect
accurcay?

predicting chess matches with elo
data
set1

unbalanced bipartite graphs perform better

unbalanced bipartite graphs have super vertices

elo, glicko2, Truskill on paradox game
data
set2

main contributions
• Identified 4 challenges to difficulty balancing in human
computation games, crowdsourcing, UGC
• Introduced content sequencing through adapting player
rating algorithms as a novel approach
• Identified bipartiteness of user-task graph as potential issue
• Found that bipartiteness does not affect prediction accuracy
of ELO, Glicko-2, Truskill in Chess matches or human
computation game Paradox
• Found that unbalanced graphs improve prediction accuracy,
presumably due to super vertices/players
• Provided first support that our approach is viable

limitations & future work I
• Approach requires previous/initial data
• Use super-users to provide initial data
• Use “calibration” tasks in tutorials
• Use mixed method data to identify skill & difficulty indicators, data &
machine learning to validate & extract additional indicators
• Current algorithms only compute win/loss/draw
• Graded success measures could improve accuracy and learning speed
• Study trained on large data sets (10,000, 37 edges)
• Testing learning speed of algorithms w/ current default retention in human
computation games
• Study tested only one human computation game
• Replication with multiple games

limitations & future work II
• Study didn’t test direct effect on retention
• Follow-up user study
• Task pool might not contain tasks of best-fitting
difficulty (similar to empty bar in mulitplayer games)
• Procedural content generation to generate training/filler tasks
• Many human computation tasks don’t vary much in difficulty
• Expand matching approach to other factors like curiosity/variety

sebastian@codingconduct.cc
@dingstweets
codingconduct.cc
Thank you.

Contenu connexe

Tendances

Gameful Design: Creating Engaging ExperiencesSebastian Deterding

Gamification: Future ToolsSebastian Deterding

Introduction to GamificationСнежана Бежнар

Magic Pixie Wonder Dust 3000 (Enterprise Edition): Designing Motivational Exp...Sebastian Deterding

W207 - Creating a 3-D Behavioral Assessment Based Simulation or Game Karl Kapp

Would the real Mary Poppins please stand up? Approaches and Methods in Gamefu...Sebastian Deterding

Global Game Jam OverviewSusan Gold

The MAO Model: Research for Behavior Change.Sebastian Deterding

Blind Spots in Persuasive DesignSebastian Deterding

Gamification In EducationNatalie Denmeade

Gamification - A Brief Introduction to GamificationChetan Sundarde

Gamification and the Moodle gradebookNatalie Denmeade

9.5 Theses on the Power and Efficacy of GamificationSebastian Deterding

How to get started in gamification in educationNatalie Denmeade

9,5 Theses on the Power and Efficacy of GamificationSebastian Deterding

Learning Analytics Design in Game-based LearningMIT

Experience Design in the MuseumSebastian Deterding

Gamification learningInPresso.ru - Публикации в СМИ с гарантией размещения по минимальным ценам

GAMIFIN 2019 Conference Keynote: How to fail at #gamification researchLennart Nacke

Soft on People, Hard on Code: interpersonal approaches that promote high qual...Mark Brannan

Tendances (20)

Gameful Design: Creating Engaging Experiences

Gamification: Future Tools

Introduction to Gamification

Magic Pixie Wonder Dust 3000 (Enterprise Edition): Designing Motivational Exp...

W207 - Creating a 3-D Behavioral Assessment Based Simulation or Game

Would the real Mary Poppins please stand up? Approaches and Methods in Gamefu...

Global Game Jam Overview

The MAO Model: Research for Behavior Change.

Blind Spots in Persuasive Design

Gamification In Education

Gamification - A Brief Introduction to Gamification

Gamification and the Moodle gradebook

9.5 Theses on the Power and Efficacy of Gamification

How to get started in gamification in education

9,5 Theses on the Power and Efficacy of Gamification

Learning Analytics Design in Game-based Learning

Experience Design in the Museum

Gamification learning

GAMIFIN 2019 Conference Keynote: How to fail at #gamification research

Soft on People, Hard on Code: interpersonal approaches that promote high qual...

En vedette

Design Against ProductivitySebastian Deterding

The Mechanic is not the (whole) message: Procedural rhetoric meets framing in...Sebastian Deterding

Gamification: Missverständnisse und LösungenSebastian Deterding

Un-Boring MeetingsSebastian Deterding

Explodierende MedienSebastian Deterding

What Larp can Learn from RPG StudiesSebastian Deterding

The Great Escape from the Prison House of Language: Games, Production Studies...Sebastian Deterding

Contextual Autonomy Support in Video Game Play: A Grounded TheorySebastian Deterding

Modes of Play: A Frame Analytic Account of Video GamingSebastian Deterding

Progress Wars: Idle Games and the Demarcation of "Real Games"Sebastian Deterding

It's the Autonomy, Stupid: Autonomy Experiences Between Playful Work and Work...Sebastian Deterding

I wonder ... Designing for CuriositySebastian Deterding

Coding conduct: Games, Play, and Human Conduct Between Technical Code and Soc...Sebastian Deterding

En vedette (13)

Design Against Productivity

The Mechanic is not the (whole) message: Procedural rhetoric meets framing in...

Gamification: Missverständnisse und Lösungen

Un-Boring Meetings

Explodierende Medien

What Larp can Learn from RPG Studies

The Great Escape from the Prison House of Language: Games, Production Studies...

Contextual Autonomy Support in Video Game Play: A Grounded Theory

Modes of Play: A Frame Analytic Account of Video Gaming

Progress Wars: Idle Games and the Demarcation of "Real Games"

It's the Autonomy, Stupid: Autonomy Experiences Between Playful Work and Work...

I wonder ... Designing for Curiosity

Coding conduct: Games, Play, and Human Conduct Between Technical Code and Soc...

Similaire à Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness

Learning analytics for improving educational games jcsg2017Baltasar Fernández-Manjón

User Experience 6: Qualitative Methods, Playtesting and InterviewsMarc Miquel

KorraAI - a probabilistic virtual agent frameworkAntonAndreev13

Learning Analytics Serious Games Cognitive DisabilitiesBaltasar Fernández-Manjón

Using Game Learning Analytics to Improve the Design, Evaluation and Deploymen...Baltasar Fernández-Manjón

21CLHK9 - Building HeroesAnthony Copeland

Learning Analytics in serious gamesBaltasar Fernández-Manjón

Downtown, A Subway Adventure: Using Learning Analytics to Improve the Develop...Ana Rus Cano Moreno

Game analytics - The challenges of mobile free-to-play gamesChristian Beckers

Not WHEN Games but WHICH Learning GamesSharon Boller

Interplay of Game Incentives, Player Proﬁles and Task Diﬃculty in Games with ...Irene Celino

STARCANADA 2013 Keynote: Cool! Testing’s Getting Fun AgainTechWell

Practical Scrum - day 2Anat (Alon) Salhov

Th202 slidestrickyraymer

Applying learning analytics in serious games Baltasar Fernández-Manjón

The Art of Project EstimationReturn on Intelligence

Tcea 2014 Video Game Design for New TEKSMike Ploor

Game design to enhance ca webinar final 12 16Shane Gallagher

Team Dating Leads to Better Online Ad Hoc CollaborationsIoanna Lykourentzou

Phx dl meetupJames Sirota

Similaire à Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness (20)

Learning analytics for improving educational games jcsg2017

User Experience 6: Qualitative Methods, Playtesting and Interviews

KorraAI - a probabilistic virtual agent framework

Learning Analytics Serious Games Cognitive Disabilities

Using Game Learning Analytics to Improve the Design, Evaluation and Deploymen...

21CLHK9 - Building Heroes

Learning Analytics in serious games

Downtown, A Subway Adventure: Using Learning Analytics to Improve the Develop...

Game analytics - The challenges of mobile free-to-play games

Not WHEN Games but WHICH Learning Games

Interplay of Game Incentives, Player Proﬁles and Task Diﬃculty in Games with ...

STARCANADA 2013 Keynote: Cool! Testing’s Getting Fun Again

Practical Scrum - day 2

Th202 slides

Applying learning analytics in serious games

The Art of Project Estimation

Tcea 2014 Video Game Design for New TEKS

Game design to enhance ca webinar final 12 16

Team Dating Leads to Better Online Ad Hoc Collaborations

Phx dl meetup

Plus de Sebastian Deterding

Mechanics, Messages, Meta-Media: How Persuasive Games Persuade, and What They...Sebastian Deterding

Gamification for Health Behaviour ChangeSebastian Deterding

Outside the Box: Toward an Ecology of Gaming EnjoymentSebastian Deterding

City Games: Up and Down and Sideways on the Ladder of AbstractionSebastian Deterding

Toward Economic Platform StudiesSebastian Deterding

Joys of Absence: Emotion, Emotion Display, and Interaction Tension in Video G...Sebastian Deterding

Would the real Mary Poppins please stand up?Sebastian Deterding

Game Studies, Meet Convergence CultureSebastian Deterding

Plus de Sebastian Deterding (8)

Mechanics, Messages, Meta-Media: How Persuasive Games Persuade, and What They...

Gamification for Health Behaviour Change

Outside the Box: Toward an Ecology of Gaming Enjoyment

City Games: Up and Down and Sideways on the Ladder of Abstraction

Toward Economic Platform Studies

Joys of Absence: Emotion, Emotion Display, and Interaction Tension in Video G...

Would the real Mary Poppins please stand up?

Game Studies, Meet Convergence Culture

Dernier

Best VIP Call Girls Noida Sector 47 Call Me: 8448380779Delhi Call girls

Jigani Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...amitlee9823

Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

UI:UX Design and Empowerment Strategies for Underprivileged Transgender Indiv...RitikaRoy32

Peaches App development presentation decktbatkhuu1

Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)amitlee9823

Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...Call Girls in Nagpur High Profile

VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...SUHANI PANDEY

young call girls in Vivek Vihar🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...ranjana rawat

Chapter 19_DDA_TOD Policy_First Draft 2012.pdfParomita Roy

Best VIP Call Girls Noida Sector 44 Call Me: 8448380779Delhi Call girls

SD_The MATATAG Curriculum Training Design.pptxjanettecruzeiro1

call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR

VVIP CALL GIRLS Lucknow 💓 Lucknow < Renuka Sharma > 7877925207 Escorts Servicearoranaina404

Tapestry Clothing Brands: Collapsing the Funneljen_giacalone

young call girls in Pandav nagar 🔝 9953056974 🔝 Delhi escort Service9953056974 Low Rate Call Girls In Saket, Delhi NCR

Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...babafaisel

call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...Delhi Call girls

CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵anilsa9823

Dernier (20)

Best VIP Call Girls Noida Sector 47 Call Me: 8448380779

Jigani Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Bangal...

Call Girls Service Mukherjee Nagar @9999965857 Delhi 🫦 No Advance VVIP 🍎 SER...

UI:UX Design and Empowerment Strategies for Underprivileged Transgender Indiv...

Peaches App development presentation deck

Escorts Service Nagavara ☎ 7737669865☎ Book Your One night Stand (Bangalore)

Top Rated Pune Call Girls Saswad ⟟ 6297143586 ⟟ Call Me For Genuine Sex Serv...

VIP Model Call Girls Kalyani Nagar ( Pune ) Call ON 8005736733 Starting From ...

young call girls in Vivek Vihar🔝 9953056974 🔝 Delhi escort Service

(AISHA) Ambegaon Khurd Call Girls Just Call 7001035870 [ Cash on Delivery ] P...

Chapter 19_DDA_TOD Policy_First Draft 2012.pdf

Best VIP Call Girls Noida Sector 44 Call Me: 8448380779

SD_The MATATAG Curriculum Training Design.pptx

call girls in Dakshinpuri (DELHI) 🔝 >༒9953056974 🔝 genuine Escort Service 🔝✔️✔️

VVIP CALL GIRLS Lucknow 💓 Lucknow < Renuka Sharma > 7877925207 Escorts Service

Tapestry Clothing Brands: Collapsing the Funnel

young call girls in Pandav nagar 🔝 9953056974 🔝 Delhi escort Service

Kala jadu for love marriage | Real amil baba | Famous amil baba | kala jadu n...

call girls in Kaushambi (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝...

CALL ON ➥8923113531 🔝Call Girls Kalyanpur Lucknow best Female service 🧵

Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness

1. Player Rating Systems for Balancing Human Computation Games testing the effect of bipartiteness Seth Cooper, Sebastian Deterding, Theo Tsapakos DiGRA 2016, August 6, 2016 c b

2. <1> the challenge

3. »flow« Difficulty Skill/time frustration boredom flow (1990) Mihaly Csikszentmihalyi

4. winning odds correlate w/ retention Lomas et al., 2013

5. human computation games

6. Difficulty Skill/time 1. scientific tasks are predetermined the problem

7. Difficulty Skill/time 2. tasks can’t be changed

8. Difficulty Skill/time 3. Difficulty is unknown in advance ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?

9. Difficulty Skill/time 4. solving tasks defeats crowdsourcing ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! !

10. Difficulty Skill/time ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? … hence tasks are served randomly Lintott, 2016

11. hence retention is very poor Sauermann & Franzoni, 2015

12. %Playerretained Time/levels most leave after balanced tutorials * idealised tutorial actual tasks *

13. Difficulty Skill/time How to sequence tasks w/o solving? ? ? ? ? ? ? ? ? the challenge ? ? ?

14. user-generated content also appliesto

15. crowdsourcing also appliesto

16. <2> the approach

17. multiplayer matchmaking

18. elo, 1978 glicko-2, 2012/3 trueskill, 2006 uses player rating algorithms

19. skill = winning odds, updated w/ each game Moser, 2010

20. remember: winning odds > retention Lomas et al., 2013

21. widely used, effective prediction Menke, 2016

22. our approach: tasks = players Player rating = skill Task rating = difficulty Player rating = skill

23. <3> the question

24. we produce a bipartite graph Asratian et al., 1998

25. we produce a bipartite graph Asratian et al., 1998 PlayersTasks

26. PlayersTasks less density, less information flow Scott, 2012

27. more structural holes Scott, 2012 PlayersTasks

28. more unbalanced graphs Scott, 2012 PlayersTasks

29. Research question does a bipartite (player-player or user-task) graph negatively affect the prediction accuracy of player rating algorithms? does graph balancedness affect accurcay?

30. <4> the study

31. predicting chess matches with elo data set1

32. bipartite training data has no effect

33. unbalanced bipartite graphs perform better

34. unbalanced bipartite graphs have super vertices

35. elo, glicko2, Truskill on paradox game data set2

36. all rating systems outperform baseline

37. <5> discussion & outlook

38. main contributions • Identified 4 challenges to difficulty balancing in human computation games, crowdsourcing, UGC • Introduced content sequencing through adapting player rating algorithms as a novel approach • Identified bipartiteness of user-task graph as potential issue • Found that bipartiteness does not affect prediction accuracy of ELO, Glicko-2, Truskill in Chess matches or human computation game Paradox • Found that unbalanced graphs improve prediction accuracy, presumably due to super vertices/players • Provided first support that our approach is viable

39. limitations & future work I • Approach requires previous/initial data • Use super-users to provide initial data • Use “calibration” tasks in tutorials • Use mixed method data to identify skill & difficulty indicators, data & machine learning to validate & extract additional indicators • Current algorithms only compute win/loss/draw • Graded success measures could improve accuracy and learning speed • Study trained on large data sets (10,000, 37 edges) • Testing learning speed of algorithms w/ current default retention in human computation games • Study tested only one human computation game • Replication with multiple games

40. limitations & future work II • Study didn’t test direct effect on retention • Follow-up user study • Task pool might not contain tasks of best-fitting difficulty (similar to empty bar in mulitplayer games) • Procedural content generation to generate training/filler tasks • Many human computation tasks don’t vary much in difficulty • Expand matching approach to other factors like curiosity/variety

41. sebastian@codingconduct.cc @dingstweets codingconduct.cc Thank you.

Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (13)

Similaire à Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness

Similaire à Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness (20)

Plus de Sebastian Deterding

Plus de Sebastian Deterding (8)

Dernier

Dernier (20)

Player Rating Algorithms for Balancing Human Computation Games: Testing the Effect of Bipartiteness