SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
Discovering Playing Patterns:
Time Series Clustering of Free-To-Play Game Data
Alain Saas, Anna Guitart and ´Africa Peri´a˜nez (Silicon Studio)
IEEE CIG 2016 Santorini
21 September, 2016
About us
• Who are we?
◦ Game studio and graphics
middleware company based in Tokyo
◦ Research project to provide Game
Data Science as a service
◦ Goals: predict player behavior, scale
to big data and intuitive result
visualization
• Which data?
◦ RPG free-to-play games
◦ TS of two games
◦ TS of in-app purchases and activity
behavioral data
2 of 17
Challenge
Unsupervised clustering of Time Series of player activity
• Why?
◦ discover temporary player patterns
◦ evaluation of game events and business diagnosis
◦ assess common characteristics of players belonging to the same cluster
• How?
1. representation techniques: reducing the high dimensionality of TS
2. similarity measures for free-to-play game data
3. hierarquical clustering
4. visual validation of the results
3 of 17
Representation methods
Symbolic Aggregate Approximation
Trend Extraction
Discrete Wavelet Transfrom
4 of 17
Similarity measures
Dynamic Time Warping
DTW (X, Y ) = min
r∈M
M
m=1
|xim − yjm|
Correlation-based measure
COR(X, Y ) =
N
n=1(xn − ¯X)(yn − ¯Y )
N
n=1(xn − ¯X)2 N
n=1(yn − ¯Y )2
Temporal Correlation and Raw Values
Behaviors measure
CORT(X, Y ) =
N−1
n=1 (xn+1 − xn)(yn+1 − yn)
N−1
n=1 (xn+1 − xn)2 N−1
n=1 (yn+1 − yn)2
Complexity-Invariant Distance
measure
CID(X, Y ) = dist(X, Y ) · CF(X, Y ),
CF complexity correction factor
CF(X, Y ) =
max(CE(X), CE(Y ))
min(CE(X), CE(Y ))
CE is the complexity estimation
CE(X) =
N−1
n=1
(xn − xn+1)2
5 of 17
Similarity measure comparison
Euclidean vs. Correlation Correlation vs. Complexity-Invariant Distance
Dynamic Time Warping vs.Correlation Correlation vs. Discrete Wavelet Transform
6 of 17
Comparison clustering methods
• DTW Dynamic Time Warping
◦ similar player profiles with a
shift on the time axis
◦ different patterns but at
different scale
• DWT Discrete Wavelet Transform
◦ dimensionality reduction
◦ frequency of the series
• SAX Symbolic Aggregate
Approximation
◦ parameters w,a
• COR Correlation
◦ similar geometric and
synchronous profiles
◦ sensitive to noise data and
outliers
• CORT Temporal Correlation
◦ similar to COR but with time
consideration?
• CID Complexity-Invariant distance
◦ similar complexity patterns
◦ good for sparse time series
• COR+trend Correlation and trend extraction
◦ addresses COR’s sensitivity to noise
◦ does not work well with sparse time series
7 of 17
Hierarchical clustering
Agglomerative Ward method:
Lead to a minimum increase of total within-cluster variance
Single Linkage
Complete Linkage
Average Linkage
Centroid Method
Ward Method
8 of 17
Our data
Time series measured per user per day.
Game Activity
Behavioral data
Time: The amount of time spent in the game
Sessions: The total number of playing sessions
Actions: The total number of actions performed
In-app Sales Purchase: The total amount of in-app purchases
9 of 17
Data selection, constraints
Time Series: Multi-dimensional data
⇒ selection of period P
• in our data weekly game events
• period P of length 21 days
• played time → active users
min connections 6/7 days a week
• purchases → paying users
at least one purchase in period P
• players alive during period P
10 of 17
Datasets and tests
Game Data Technique Clusters Date range
Age of Ishtaria Daily played time COR-trend 8 Oct2014 - Jan2016
Age of Ishtaria Daily purchase CID 5 Oct2014 - Jan2016
Grand Sphere Daily played time COR-trend 8 Jun2015 - Mar2016
11 of 17
Clustering time series of time played
1. representation method: trend extraction
2. similarity measure: correlation
3. hierarchical clustering: Ward method
4. validation of results: visualization with
heatmap (raw data)
12 of 17
Extraction of players characteristics
13 of 17
Clustering time series of time played
Also able to extract differentiate patterns as in Age of Ishtaria
14 of 17
Clustering time series of purchases
1. similarity measure:
complexity-invariant distance
2. hierarchical clustering: Ward method
3. validation of results: visualization with
heatmap (raw data)
15 of 17
Summary and Next Steps
• Unsupervised clustering time series data from two free-to-play
games
• Evaluate several similarity measures and representation methods
• Extract meaningful behavioral patterns of players
• Assess impact of weekly game events
• Discover hidden playing dynamics regarding purchases and time
played
• Feature for churn prediction
• Event recommender
• Cluster level behaviour
16 of 17
http://www.siliconstudio.co.jp/rd/4front/
Thank you!
17 of 17

Contenu connexe

Dernier

Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
gajnagarg
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
nirzagarg
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
HyderabadDolls
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
nirzagarg
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
vexqp
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
HyderabadDolls
 

Dernier (20)

Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
Identify Customer Segments to Create Customer Offers for Each Segment - Appli...
 
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With OrangePredicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
Predicting HDB Resale Prices - Conducting Linear Regression Analysis With Orange
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Latur [ 7014168258 ] Call Me For Genuine Models We ...
 
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
Top profile Call Girls In Bihar Sharif [ 7014168258 ] Call Me For Genuine Mod...
 
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
Kalyani ? Call Girl in Kolkata | Service-oriented sexy call girls 8005736733 ...
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptxRESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
RESEARCH-FINAL-DEFENSE-PPT-TEMPLATE.pptx
 
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
💞 Safe And Secure Call Girls Agra Call Girls Service Just Call 🍑👄6378878445 🍑...
 
Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?Case Study 4 Where the cry of rebellion happen?
Case Study 4 Where the cry of rebellion happen?
 
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
TrafficWave Generator Will Instantly drive targeted and engaging traffic back...
 
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service AvailableVastral Call Girls Book Now 7737669865 Top Class Escort Service Available
Vastral Call Girls Book Now 7737669865 Top Class Escort Service Available
 
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Indore [ 7014168258 ] Call Me For Genuine Models We...
 
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Tumkur [ 7014168258 ] Call Me For Genuine Models We...
 
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
怎样办理圣地亚哥州立大学毕业证(SDSU毕业证书)成绩单学校原版复制
 
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
Lake Town / Independent Kolkata Call Girls Phone No 8005736733 Elite Escort S...
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...Top Call Girls in Balaghat  9332606886Call Girls Advance Cash On Delivery Ser...
Top Call Girls in Balaghat 9332606886Call Girls Advance Cash On Delivery Ser...
 
Introduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptxIntroduction to Statistics Presentation.pptx
Introduction to Statistics Presentation.pptx
 

En vedette

How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Discovering Playing Patterns: Time Series Clustering of Free-To-Play Game Data [IEEE CIG 2016]

  • 1. Discovering Playing Patterns: Time Series Clustering of Free-To-Play Game Data Alain Saas, Anna Guitart and ´Africa Peri´a˜nez (Silicon Studio) IEEE CIG 2016 Santorini 21 September, 2016
  • 2. About us • Who are we? ◦ Game studio and graphics middleware company based in Tokyo ◦ Research project to provide Game Data Science as a service ◦ Goals: predict player behavior, scale to big data and intuitive result visualization • Which data? ◦ RPG free-to-play games ◦ TS of two games ◦ TS of in-app purchases and activity behavioral data 2 of 17
  • 3. Challenge Unsupervised clustering of Time Series of player activity • Why? ◦ discover temporary player patterns ◦ evaluation of game events and business diagnosis ◦ assess common characteristics of players belonging to the same cluster • How? 1. representation techniques: reducing the high dimensionality of TS 2. similarity measures for free-to-play game data 3. hierarquical clustering 4. visual validation of the results 3 of 17
  • 4. Representation methods Symbolic Aggregate Approximation Trend Extraction Discrete Wavelet Transfrom 4 of 17
  • 5. Similarity measures Dynamic Time Warping DTW (X, Y ) = min r∈M M m=1 |xim − yjm| Correlation-based measure COR(X, Y ) = N n=1(xn − ¯X)(yn − ¯Y ) N n=1(xn − ¯X)2 N n=1(yn − ¯Y )2 Temporal Correlation and Raw Values Behaviors measure CORT(X, Y ) = N−1 n=1 (xn+1 − xn)(yn+1 − yn) N−1 n=1 (xn+1 − xn)2 N−1 n=1 (yn+1 − yn)2 Complexity-Invariant Distance measure CID(X, Y ) = dist(X, Y ) · CF(X, Y ), CF complexity correction factor CF(X, Y ) = max(CE(X), CE(Y )) min(CE(X), CE(Y )) CE is the complexity estimation CE(X) = N−1 n=1 (xn − xn+1)2 5 of 17
  • 6. Similarity measure comparison Euclidean vs. Correlation Correlation vs. Complexity-Invariant Distance Dynamic Time Warping vs.Correlation Correlation vs. Discrete Wavelet Transform 6 of 17
  • 7. Comparison clustering methods • DTW Dynamic Time Warping ◦ similar player profiles with a shift on the time axis ◦ different patterns but at different scale • DWT Discrete Wavelet Transform ◦ dimensionality reduction ◦ frequency of the series • SAX Symbolic Aggregate Approximation ◦ parameters w,a • COR Correlation ◦ similar geometric and synchronous profiles ◦ sensitive to noise data and outliers • CORT Temporal Correlation ◦ similar to COR but with time consideration? • CID Complexity-Invariant distance ◦ similar complexity patterns ◦ good for sparse time series • COR+trend Correlation and trend extraction ◦ addresses COR’s sensitivity to noise ◦ does not work well with sparse time series 7 of 17
  • 8. Hierarchical clustering Agglomerative Ward method: Lead to a minimum increase of total within-cluster variance Single Linkage Complete Linkage Average Linkage Centroid Method Ward Method 8 of 17
  • 9. Our data Time series measured per user per day. Game Activity Behavioral data Time: The amount of time spent in the game Sessions: The total number of playing sessions Actions: The total number of actions performed In-app Sales Purchase: The total amount of in-app purchases 9 of 17
  • 10. Data selection, constraints Time Series: Multi-dimensional data ⇒ selection of period P • in our data weekly game events • period P of length 21 days • played time → active users min connections 6/7 days a week • purchases → paying users at least one purchase in period P • players alive during period P 10 of 17
  • 11. Datasets and tests Game Data Technique Clusters Date range Age of Ishtaria Daily played time COR-trend 8 Oct2014 - Jan2016 Age of Ishtaria Daily purchase CID 5 Oct2014 - Jan2016 Grand Sphere Daily played time COR-trend 8 Jun2015 - Mar2016 11 of 17
  • 12. Clustering time series of time played 1. representation method: trend extraction 2. similarity measure: correlation 3. hierarchical clustering: Ward method 4. validation of results: visualization with heatmap (raw data) 12 of 17
  • 13. Extraction of players characteristics 13 of 17
  • 14. Clustering time series of time played Also able to extract differentiate patterns as in Age of Ishtaria 14 of 17
  • 15. Clustering time series of purchases 1. similarity measure: complexity-invariant distance 2. hierarchical clustering: Ward method 3. validation of results: visualization with heatmap (raw data) 15 of 17
  • 16. Summary and Next Steps • Unsupervised clustering time series data from two free-to-play games • Evaluate several similarity measures and representation methods • Extract meaningful behavioral patterns of players • Assess impact of weekly game events • Discover hidden playing dynamics regarding purchases and time played • Feature for churn prediction • Event recommender • Cluster level behaviour 16 of 17