SlideShare une entreprise Scribd logo
1  sur  123
Recommending Actions,
Not Content


       david @ayman shamma
         internet experiences
   microeconomics & social systems
Internet Experiences Group




(David)
Ayman

   Shamma        Lyndon
Kennedy   Jude
Yew   Elizabeth
Churchill
Disclaimer: I !<3 Recommendation Systems
Disclaimer: I !<3 Recommendation Systems

           I <3 Engagement
#FAIL
#FAIL
Really? What are we doing?
Really? What are we doing?
What are we recommending?
Why are we doing that?
Image
Search
Image
Search
Image
Search   A
Text

               Box!!!!
Search as Recommendation
Play
the

 Music
Click
Through
on
Search
Pages

                                                          BoIom
of

                                                            “fold”


                                                                        BoIom
of

                                                                          Page

                                                                                           BoIom
of

                                                                                          2nd
Window




Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
Is Recent a Relevant Recommendation?
Recent




Normal
Does Relevance Matter?
• Bottom of the page
   – Normally low click through
   – Show alternate results
Does Relevance Matter?
• Bottom of the page
   – Normally low click through
   – Show alternate results
Does Relevance Matter?
• Bottom of the page
   – Normally low click through
                     G
             WRON
   – Show alternate results
Does Relevance Matter?
• Bottom of the page              Precision/recall
   – Normally low click through
                     G            doesn’t (always)
             WRON                      matter!!
   – Show alternate results
                                  (for multimedia)
Un-related images at the bottom of the page
                       should be here.


                                                          BoIom
of

                                                            “fold”


                                                                        BoIom
of

                                                                          Page

                                                                                           BoIom
of

                                                                                          2nd
Window




Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
Un-related images at the bottom of the page
                         are here!!!


                                                          BoIom
of

                                                            “fold”


                                                                        BoIom
of

                                                                          Page

                                                                                           BoIom
of

                                                                                          2nd
Window




Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
What’s
Similar?
Have
a
listen.


    Song
1            Song
2




             Song
3




                                 32
Song 1
Song 1
Song 2
Song 2
Song 3
Song 3
Context

        Song

Rater
Context

        Song

Rater
Context

        Song

Rater
So
what
do
you
like?


Song
1            Song
2




         Song
3




                           37
Song 1
Song 1
Song 2
Song 2
Song 3
Song 3
Think about ratings
Song
Similarity
Example

                  Song 1   Song 2   Song 3
    Jazz Lover      5        0        5
   Rock Lover       5        0        5
Classical Lover     0        5        0
Song
Similarity
Example

                  Song 1    Song 2      Song 3
    Jazz Lover      5          0          5
   Rock Lover       5          0          5
Classical Lover     0          5          0


                        Similar
Songs
A
Small
Experiment
(by
M.
Slaney)
• 380,911
Subjects
• 1000
Jazz
Songs
• 1,449,335
Rabngs




  Never
Play
this
Again   Love
It!
Users
do
not
rate
everything….
 Self‐Selected
Rabng
Histogram                                            True
Rabng
Histogram




               (1.5B
rabngs)                                                        (350k
rabngs)


From:
Marlin,
Zemel,
Roweis,
Slaney.
“Collaborabve
Filtering
and
the
missing
at
random
assumpbon.”
UAI
2007
About
the
Data











             46
About
the
Data







• Real
rabng
data
  – Y!
Music




                       Y!
Data
  – 700M
rabngs




                                 46
About
the
Data







• Real
rabng
data
  – Y!
Music




                       Y!
Data
  – 700M
rabngs




                       True
Distribubon




                                          46
About
the
Data







• Real
rabng
data
  – Y!
Music




                       Y!
Data
  – 700M
rabngs



                                                               

                                                          d 
of
                                                          o
                       True
Distribubon             l iho
                                                ike ing
                                               L y
                                                  pla



                                          46
Neilix
Compebbon
• Create
new
recommendabon
algorithm
  – 10%
beIer
than
Neilix
algorithm
• Data
  – 100M
rabngs
  – 480k
users,
17k
movies
• Winner
  – BellKorPragmabcChaos
Movie
rabng
data
                                Training
data              Test
data
• Training
data            score     movie      user          movie    user
   – 100
million
rabngs     1          21        1     ?        62      1
                            5         213        1     ?        96      1
   – 480,000
users
                            4         345        2     ?         7      2
   – 17,770
movies
                            4         123        2     ?         3      2
   – 6
years
of
data:

                            3         768        2     ?        47      3
      2000‐2005
                            5          76        3     ?        15      3
• Test
data
                            4          45        4     ?        41      4
   – Last
few
rabngs
of

                            1         568        5     ?        28      4
      each
user
(2.8

                            2         342        5     ?        93      5
      million)
                            2         234        5     ?        74      5
• Dates
of
rabngs
are

                            5          76        6     ?        69      6
  given
                            4          56        6     ?        83      6
Components
of
a
rabng
predictor

       user
bias                movie
bias               user‐movie
interacbon




                Baseline
predictor                    User‐movie
interacbon
   •     Separates
users
and
movies           •   Characterizes
the
matching

   •     Onen
overlooked
                         between
users
and
movies
   •     Benefits
from
insights
into
users’
   •   AIracts
most
research
in
the
field
         behavior
                                              •   Benefits
from
algorithmic
and

   •     Among
the
main
pracbcal

         contribubons
of
the
compebbon            mathemabcal
innovabons




Courtesy
of
YehudaKoren
This is kinda why we are here...
Legacy Video
Traditional Comments and Tags
Left in Whole, Unattached.
Quickly...let me tell you why I hate tags...
Tag this.
Tag this.
Tag This
Tag
Noise
Who’s
Christmas?




Canada                Australia
Hey aren’t categories tags anyhow?
Double Rainbow   Pick a category
Anyway, back on track...
Social Conversations Happen Around Media
Dolores Park, San Francisco, 2006
Social Conversations Happen Around Media
Dolores Park, San Francisco, 2006
Social Conversations happen around videos
Well – actually people join in a session and converse afterwards.
What to Collect to measure
• Type of event
  (Zync player command or a normal chat message)
• Anonymous hash
  (uniquely identifies the sender and the receiver, without
  exposing personal account data)
• URL to the shared video
• Timestamp for the event
• The player time (with respect to the specific video) at the
  point the event occurred
• The number of characters and the number words typed
  (for chat messages)
• Emoticons used in the chat message
A Short Movie
Percent of actions over time.
Chat follows the video!


                      CHAT
http://www.flickr.com/photos/wvs/3833148925/
Reciprocity
• 43.6% of the sessions the invitee played at
  least one video back to the session’s initiator.
• 77.7% sharing reciprocation
• Pairs of people often exchanged more than
  one set of videos in a session.
• In the categories of Nonprofit, Technology
  and Shows, the invitees shared more videos
How do we know what people are watching?
How can we give them better things to watch?

CLASSIFICATION
Types of features on YouTube
5 star ratings has been the golden egg for recommendation systems
so far; implicit human cooperative sharing activity works better.



CLASSIFICATION BASED ON
IMPLICIT CONNECTED SOCIAL
20 random videos sent to 43 people.
60.3% identified the category correctly.
52.3% identified the comedies correctly.

PEOPLE REALLY STINK AT THIS
Used and Unused Data
You Tube              Zync
Duration (video)      Duration (session)*
Views (video)
Duration              # of Play/Pause*
                      Duration (session)*
Rating*
Views                 # of Scrubs*
                      # of Play/Pause*
Rating*               # of Chats*
                      # of Scrubs*

You Tube (not used)   Zync (not used)
Tags                  Emoticons
Comments              User ID data
Favorites             # of Sessions
                      # of Loads
Phone in your favorite ML technique.

FIRST ORDER DATA WASN’T
PRETTY
Naïve Bayes Classification
  Type                        Accuracy
  Random Chance                 23.0%
  You Tube Features             14.6%
  You Tube Top 5 Categories     32.4%
  Zync Features                 53.9%
  Humans                        60.9%
What about these three videos? Which one you like?
Nominal Factorization
Ratings doen’t particularly specify order.
Nominal Factorization
Classification with Factoring
   Type                                                         Accuracy
   Random Chance                                                   23.0%
   You Tube Features                                               14.6%
   You Tube Top 5 Categories                                       32.4%
   YT Top 5 Factoring Duration                                     51.8%
   Humans                                                          60.9%
   YT Top 5 Factoring Views                                        66.9%
   YT Top 5 Factoring Ratings                                      75.5%
   YT Top 5 Factoring All Features                                 75.9%


  psst, yes we know that more training will do the same thing eventually,
                                                   I just don’t like waiting.
Classification w/ Zync features
    Type                                                   Accuracy
    Random Chance                                              23.0%
    You Tube Features                                          14.6%
    You Tube Top 5 Categories                                  32.4%
    YT Top 5 Factoring Duration                                51.8%
    Humans                                                     60.9%
    YT Top 5 Factoring Views                                   66.9%
    YT Top 5 Factoring Ratings                                 75.5%
    YT Top 5 Factoring All Features                            75.9%
    Zync Factored All Features                                 87.8%
    psst, we are looking at using Gradient Boosted Decision Trees in our
                                                            future work.
Finding the viral.

Can we predict if a video has over 10M views?
More so, can we do so with say 10 people across 5 sessions?
Remember this is what
    we have for data
Viral Classification w/ Zync features

    Does the video have over 10 M views?   Accuracy

    Guessing Yes                              6.3%

    Guessing No                              93.7%

    Guessing Randomly                        88.3%

    Naive Bayes (25% training set)           89.2%

    Naive Bayes (50% training set)           95.5%

    Naive Bayes (80% training set)           96.6%
Three pieces

              Classifier




Survey Data                Interviews
Audience Perception
                      Just ask Homer
             is Key
I !<3 Recommendation Systems
3 areas prime for social recommendation for disrupt:
1: Understanding the temporal and the recent.
Social Conversations Happen Around Media
Dolores Park, San Francisco, 2006
Social Conversations Happen Around Media
Dolores Park, San Francisco, 2006
Come see my talk!
Lets find a moment   Here’s an example.
All Tweets        Inauguration Tweets




Left: All tweet sample.
Right: Tweets with Inauguration keywords.
All Tweets                  Inauguration Tweets   All Tweets with @




Left: All tweet sample.
Right: Tweets with Inauguration keywords.
12:04 is what you want to
                  watch.
2: Q & A
Likes
                                          Generalization
 Questioning
the Question




                                            Clarification


                                           One Answer

               Finding answers...   ...kinda like Watson.
3: Challenges
Me: You’re in China, go to the night market for   !!
Me: You’re in China, go to the night market for           !!

My friend: Street food? Are you kidding? I’ll get sick!
Me: You’re in China, go to the night market for           !!

My friend: Street food? Are you kidding? I’ll get sick!

Me: I dare you not to!!
Me: You’re in China, go to the night market for     !!

You: Street food? Are you kidding? I’ll get sick!

Me: I dare you not to! (It’s delicious!)
Man vs. Food   http://www.travelchannel.com/TV_Shows/
               Man_V_Food
Why try to understand engagement?

                               Better advertising.




     Better understanding of the relationship between users and the sharing/
                        consumption of media content.



Better organization and classification of media for efficient navigation and content
                                    retrieval.




                     Better recommendations!
Find me: @ayman • aymans@acm.org

                                           Fin & Thanks!
Thanks to D. DuBois, M. Slaney, E. Churchill, L. Kennedy, J.Yew, S. Pentland, A.
                 Brooks, J. Dunning, B. Pardo, M. Cooper.

Knowing Funny: Genre Perception and Categorization in Social Video Sharing Jude Yew; David A. Shamma; Elizabeth F.
      Churchill, CHI 2011, ACM, 2011
Peaks and Persistence: Modeling the Shape of Microblog Conversations David A. Shamma; Lyndon Kennedy; Elizabeth F.
      Churchill, CSCW 2011, ACM, 2011
In the Limelight Over Time: Temporalities of Network Centrality David A. Shamma; Lyndon Kennedy; Elizabeth F. Churchill,
      CSCW 2011, ACM, 2011
Tweet the Debates: Understanding Community Annotation of Uncollected Sources David A. Shamma; Lyndon Kennedy;
      Elizabeth F. Churchill, ACM Multimedia, ACM, 2009
Understanding the Creative Conversation: Modeling to Engagement David A. Shamma; Dan Perkel; Kurt Luther, Creativity and
      Cognition, ACM, 2009
Spinning Online: A Case Study of Internet Broadcasting by DJs David A. Shamma; Elizabeth Churchill; Nikhil Bobb; Matt
      Fukuda, Communities & Technology, ACM, 2009
Zync with Me: Synchronized Sharing of Video through Instant Messaging David A. Shamma; Yiming Liu; Pablo Cesar, David
      Geerts, Konstantinos Chorianopoulos, Social Interactive Television: Immersive Shared Experiences and Perspectives,
      Information Science Reference, IGI Global, 2009
Enhancing online personal connections through the synchronized sharing of online video Shamma, D. A.; Bastéa-Forte, M.;
      Joubert, N.; Liu, Y., Human Factors in Computing Systems (CHI), ACM, 2008
Supporting creative acts beyond dissemination David A. Shamma; Ryan Shaw, Creativity and Cognition, ACM, 2007
Watch what I watch: using community activity to understand content David A. Shamma; Ryan Shaw; Peter Shafton; Yiming
      Liu, ACM Multimedia Workshop on Multimedia Information Retrival (MIR), ACM, 2007
Zync: the design of synchronized video sharing Yiming Liu; David A. Shamma; Peter Shafton; Jeannie Yang, Designing for
      User eXperiences, ACM, 2007

Contenu connexe

Dernier

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Visualising and forecasting stocks using Dash
Visualising and forecasting stocks using DashVisualising and forecasting stocks using Dash
Visualising and forecasting stocks using Dashnarutouzumaki53779
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Dernier (20)

Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Visualising and forecasting stocks using Dash
Visualising and forecasting stocks using DashVisualising and forecasting stocks using Dash
Visualising and forecasting stocks using Dash
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

En vedette

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by HubspotMarius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTExpeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsPixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfmarketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsKurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summarySpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentLily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best PracticesVit Horky
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project managementMindGenius
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...RachelPearson36
 

En vedette (20)

2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 

Recommending Actions, Not Content

  • 1. Recommending Actions, Not Content david @ayman shamma internet experiences microeconomics & social systems
  • 2. Internet Experiences Group (David)
Ayman
 Shamma Lyndon
Kennedy Jude
Yew Elizabeth
Churchill
  • 3. Disclaimer: I !<3 Recommendation Systems
  • 4. Disclaimer: I !<3 Recommendation Systems I <3 Engagement
  • 5.
  • 6.
  • 7.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14. #FAIL
  • 15.
  • 16. Really? What are we doing?
  • 17. Really? What are we doing? What are we recommending? Why are we doing that?
  • 18.
  • 19.
  • 22. Image
Search A
Text
 Box!!!!
  • 24.
  • 26. Click
Through
on
Search
Pages BoIom
of
 “fold” BoIom
of
 Page BoIom
of
 2nd
Window Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
  • 27. Is Recent a Relevant Recommendation?
  • 29.
  • 30. Does Relevance Matter? • Bottom of the page – Normally low click through – Show alternate results
  • 31. Does Relevance Matter? • Bottom of the page – Normally low click through – Show alternate results
  • 32. Does Relevance Matter? • Bottom of the page – Normally low click through G WRON – Show alternate results
  • 33. Does Relevance Matter? • Bottom of the page Precision/recall – Normally low click through G doesn’t (always) WRON matter!! – Show alternate results (for multimedia)
  • 34. Un-related images at the bottom of the page should be here. BoIom
of
 “fold” BoIom
of
 Page BoIom
of
 2nd
Window Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
  • 35. Un-related images at the bottom of the page are here!!! BoIom
of
 “fold” BoIom
of
 Page BoIom
of
 2nd
Window Adapted
from
“A
Dynamic
Bayesian
Network
Click
Model
for
Web
Search
Ranking,”
by
Olivier
Chapelle,
Ya
Zhang,
WWW’09.
  • 36. What’s
Similar?
Have
a
listen. Song
1 Song
2 Song
3 32
  • 43. Context Song Rater
  • 44. Context Song Rater
  • 45. Context Song Rater
  • 54. Song
Similarity
Example Song 1 Song 2 Song 3 Jazz Lover 5 0 5 Rock Lover 5 0 5 Classical Lover 0 5 0
  • 55. Song
Similarity
Example Song 1 Song 2 Song 3 Jazz Lover 5 0 5 Rock Lover 5 0 5 Classical Lover 0 5 0 Similar
Songs
  • 57. Users
do
not
rate
everything…. Self‐Selected
Rabng
Histogram True
Rabng
Histogram (1.5B
rabngs) (350k
rabngs) From:
Marlin,
Zemel,
Roweis,
Slaney.
“Collaborabve
Filtering
and
the
missing
at
random
assumpbon.”
UAI
2007
  • 59. About
the
Data






 • Real
rabng
data – Y!
Music Y!
Data – 700M
rabngs 46
  • 60. About
the
Data






 • Real
rabng
data – Y!
Music Y!
Data – 700M
rabngs True
Distribubon 46
  • 61. About
the
Data






 • Real
rabng
data – Y!
Music Y!
Data – 700M
rabngs 
 d 
of o True
Distribubon l iho ike ing L y pla 46
  • 62. Neilix
Compebbon • Create
new
recommendabon
algorithm – 10%
beIer
than
Neilix
algorithm • Data – 100M
rabngs – 480k
users,
17k
movies • Winner – BellKorPragmabcChaos
  • 63. Movie
rabng
data Training
data Test
data • Training
data score movie user movie user – 100
million
rabngs 1 21 1 ? 62 1 5 213 1 ? 96 1 – 480,000
users 4 345 2 ? 7 2 – 17,770
movies 4 123 2 ? 3 2 – 6
years
of
data:
 3 768 2 ? 47 3 2000‐2005 5 76 3 ? 15 3 • Test
data 4 45 4 ? 41 4 – Last
few
rabngs
of
 1 568 5 ? 28 4 each
user
(2.8
 2 342 5 ? 93 5 million) 2 234 5 ? 74 5 • Dates
of
rabngs
are
 5 76 6 ? 69 6 given 4 56 6 ? 83 6
  • 64. Components
of
a
rabng
predictor user
bias movie
bias user‐movie
interacbon Baseline
predictor User‐movie
interacbon • Separates
users
and
movies • Characterizes
the
matching
 • Onen
overlooked
 between
users
and
movies • Benefits
from
insights
into
users’
 • AIracts
most
research
in
the
field behavior • Benefits
from
algorithmic
and
 • Among
the
main
pracbcal
 contribubons
of
the
compebbon mathemabcal
innovabons Courtesy
of
YehudaKoren
  • 65. This is kinda why we are here...
  • 67. Traditional Comments and Tags Left in Whole, Unattached.
  • 68. Quickly...let me tell you why I hate tags...
  • 74. Hey aren’t categories tags anyhow?
  • 75. Double Rainbow Pick a category
  • 76. Anyway, back on track...
  • 77. Social Conversations Happen Around Media Dolores Park, San Francisco, 2006
  • 78. Social Conversations Happen Around Media Dolores Park, San Francisco, 2006
  • 79. Social Conversations happen around videos Well – actually people join in a session and converse afterwards.
  • 80.
  • 81. What to Collect to measure • Type of event (Zync player command or a normal chat message) • Anonymous hash (uniquely identifies the sender and the receiver, without exposing personal account data) • URL to the shared video • Timestamp for the event • The player time (with respect to the specific video) at the point the event occurred • The number of characters and the number words typed (for chat messages) • Emoticons used in the chat message
  • 83. Percent of actions over time.
  • 84. Chat follows the video! CHAT
  • 86. Reciprocity • 43.6% of the sessions the invitee played at least one video back to the session’s initiator. • 77.7% sharing reciprocation • Pairs of people often exchanged more than one set of videos in a session. • In the categories of Nonprofit, Technology and Shows, the invitees shared more videos
  • 87. How do we know what people are watching? How can we give them better things to watch? CLASSIFICATION
  • 88. Types of features on YouTube
  • 89. 5 star ratings has been the golden egg for recommendation systems so far; implicit human cooperative sharing activity works better. CLASSIFICATION BASED ON IMPLICIT CONNECTED SOCIAL
  • 90. 20 random videos sent to 43 people. 60.3% identified the category correctly. 52.3% identified the comedies correctly. PEOPLE REALLY STINK AT THIS
  • 91. Used and Unused Data You Tube Zync Duration (video) Duration (session)* Views (video) Duration # of Play/Pause* Duration (session)* Rating* Views # of Scrubs* # of Play/Pause* Rating* # of Chats* # of Scrubs* You Tube (not used) Zync (not used) Tags Emoticons Comments User ID data Favorites # of Sessions # of Loads
  • 92. Phone in your favorite ML technique. FIRST ORDER DATA WASN’T PRETTY
  • 93. Naïve Bayes Classification Type Accuracy Random Chance 23.0% You Tube Features 14.6% You Tube Top 5 Categories 32.4% Zync Features 53.9% Humans 60.9%
  • 94. What about these three videos? Which one you like? Nominal Factorization
  • 95. Ratings doen’t particularly specify order. Nominal Factorization
  • 96. Classification with Factoring Type Accuracy Random Chance 23.0% You Tube Features 14.6% You Tube Top 5 Categories 32.4% YT Top 5 Factoring Duration 51.8% Humans 60.9% YT Top 5 Factoring Views 66.9% YT Top 5 Factoring Ratings 75.5% YT Top 5 Factoring All Features 75.9% psst, yes we know that more training will do the same thing eventually, I just don’t like waiting.
  • 97. Classification w/ Zync features Type Accuracy Random Chance 23.0% You Tube Features 14.6% You Tube Top 5 Categories 32.4% YT Top 5 Factoring Duration 51.8% Humans 60.9% YT Top 5 Factoring Views 66.9% YT Top 5 Factoring Ratings 75.5% YT Top 5 Factoring All Features 75.9% Zync Factored All Features 87.8% psst, we are looking at using Gradient Boosted Decision Trees in our future work.
  • 98. Finding the viral. Can we predict if a video has over 10M views? More so, can we do so with say 10 people across 5 sessions?
  • 99. Remember this is what we have for data
  • 100. Viral Classification w/ Zync features Does the video have over 10 M views? Accuracy Guessing Yes 6.3% Guessing No 93.7% Guessing Randomly 88.3% Naive Bayes (25% training set) 89.2% Naive Bayes (50% training set) 95.5% Naive Bayes (80% training set) 96.6%
  • 101. Three pieces Classifier Survey Data Interviews
  • 102. Audience Perception Just ask Homer is Key
  • 104. 3 areas prime for social recommendation for disrupt:
  • 105. 1: Understanding the temporal and the recent.
  • 106. Social Conversations Happen Around Media Dolores Park, San Francisco, 2006
  • 107. Social Conversations Happen Around Media Dolores Park, San Francisco, 2006
  • 108. Come see my talk!
  • 109. Lets find a moment Here’s an example.
  • 110. All Tweets Inauguration Tweets Left: All tweet sample. Right: Tweets with Inauguration keywords.
  • 111. All Tweets Inauguration Tweets All Tweets with @ Left: All tweet sample. Right: Tweets with Inauguration keywords.
  • 112. 12:04 is what you want to watch.
  • 113. 2: Q & A
  • 114.
  • 115. Likes Generalization Questioning the Question Clarification One Answer Finding answers... ...kinda like Watson.
  • 117. Me: You’re in China, go to the night market for !!
  • 118. Me: You’re in China, go to the night market for !! My friend: Street food? Are you kidding? I’ll get sick!
  • 119. Me: You’re in China, go to the night market for !! My friend: Street food? Are you kidding? I’ll get sick! Me: I dare you not to!!
  • 120. Me: You’re in China, go to the night market for !! You: Street food? Are you kidding? I’ll get sick! Me: I dare you not to! (It’s delicious!)
  • 121. Man vs. Food http://www.travelchannel.com/TV_Shows/ Man_V_Food
  • 122. Why try to understand engagement? Better advertising. Better understanding of the relationship between users and the sharing/ consumption of media content. Better organization and classification of media for efficient navigation and content retrieval. Better recommendations!
  • 123. Find me: @ayman • aymans@acm.org Fin & Thanks! Thanks to D. DuBois, M. Slaney, E. Churchill, L. Kennedy, J.Yew, S. Pentland, A. Brooks, J. Dunning, B. Pardo, M. Cooper. Knowing Funny: Genre Perception and Categorization in Social Video Sharing Jude Yew; David A. Shamma; Elizabeth F. Churchill, CHI 2011, ACM, 2011 Peaks and Persistence: Modeling the Shape of Microblog Conversations David A. Shamma; Lyndon Kennedy; Elizabeth F. Churchill, CSCW 2011, ACM, 2011 In the Limelight Over Time: Temporalities of Network Centrality David A. Shamma; Lyndon Kennedy; Elizabeth F. Churchill, CSCW 2011, ACM, 2011 Tweet the Debates: Understanding Community Annotation of Uncollected Sources David A. Shamma; Lyndon Kennedy; Elizabeth F. Churchill, ACM Multimedia, ACM, 2009 Understanding the Creative Conversation: Modeling to Engagement David A. Shamma; Dan Perkel; Kurt Luther, Creativity and Cognition, ACM, 2009 Spinning Online: A Case Study of Internet Broadcasting by DJs David A. Shamma; Elizabeth Churchill; Nikhil Bobb; Matt Fukuda, Communities & Technology, ACM, 2009 Zync with Me: Synchronized Sharing of Video through Instant Messaging David A. Shamma; Yiming Liu; Pablo Cesar, David Geerts, Konstantinos Chorianopoulos, Social Interactive Television: Immersive Shared Experiences and Perspectives, Information Science Reference, IGI Global, 2009 Enhancing online personal connections through the synchronized sharing of online video Shamma, D. A.; Bastéa-Forte, M.; Joubert, N.; Liu, Y., Human Factors in Computing Systems (CHI), ACM, 2008 Supporting creative acts beyond dissemination David A. Shamma; Ryan Shaw, Creativity and Cognition, ACM, 2007 Watch what I watch: using community activity to understand content David A. Shamma; Ryan Shaw; Peter Shafton; Yiming Liu, ACM Multimedia Workshop on Multimedia Information Retrival (MIR), ACM, 2007 Zync: the design of synchronized video sharing Yiming Liu; David A. Shamma; Peter Shafton; Jeannie Yang, Designing for User eXperiences, ACM, 2007

Notes de l'éditeur

  1. here are my notes\n\n
  2. There are many of us, but this is the work of three.\n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. are we that bad at this?\n
  17. are we that bad at this?\n
  18. These verbs have us trapped in 1998&amp;#x2026;oh ya and the anti-flash silliness doesn&amp;#x2019;t help.\n
  19. These verbs have us trapped in 1998&amp;#x2026;oh ya and the anti-flash silliness doesn&amp;#x2019;t help.\n
  20. Recommendation buys us the ability to discover (search) without text.\n
  21. \n
  22. \n
  23. \n
  24. Adapted from &amp;#x201C;A Dynamic Bayesian Network Click Model for Web Search Ranking,&amp;#x201D; by Olivier Chapelle, Ya Zhang, WWW&amp;#x2019;09.\n
  25. \n
  26. \n
  27. Side bar of related people\n
  28. \n
  29. \n
  30. \n
  31. Adapted from &amp;#x201C;A Dynamic Bayesian Network Click Model for Web Search Ranking,&amp;#x201D; by Olivier Chapelle, Ya Zhang, WWW&amp;#x2019;09.\n
  32. Adapted from &amp;#x201C;A Dynamic Bayesian Network Click Model for Web Search Ranking,&amp;#x201D; by Olivier Chapelle, Ya Zhang, WWW&amp;#x2019;09.\n
  33. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  34. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  35. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  36. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  37. In a study I performed a few years ago we compared two different approaches for judging music similarity [Slaney and White]. &amp;#xA0;In the classic approach we use music features, often used to judge genre. &amp;#xA0;The assumption is that if these features are good for making genre judgements, then they will also tell us something about similarity. &amp;#xA0;This feature is known as a genregram [Tsanatakis]. &amp;#xA0;The content is rich---it tells us everything we need to know about the music. &amp;#xA0;In fact, listeners can tell whether they like a radio station within seconds of changing the dial.\n\n
  38. In a study I performed a few years ago we compared two different approaches for judging music similarity [Slaney and White]. &amp;#xA0;In the classic approach we use music features, often used to judge genre. &amp;#xA0;The assumption is that if these features are good for making genre judgements, then they will also tell us something about similarity. &amp;#xA0;This feature is known as a genregram [Tsanatakis]. &amp;#xA0;The content is rich---it tells us everything we need to know about the music. &amp;#xA0;In fact, listeners can tell whether they like a radio station within seconds of changing the dial.\n\n
  39. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  40. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  41. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  42. Bagpipes from: http://www.weddingbagpipes.com/ \nBeethoven Orchestral Ode to Joy from Various (Walt Disney Records)/\nClassical Silly Songs\nAlong with the Mozart (Symphony No. 40)\n
  43. \n
  44. The alternative is an item-to-item judgement based on user ratings. &amp;#xA0;The idea considers each song as a point in a multidimensional space defined by a user&apos;s rating of the song. &amp;#xA0;On a 5-point scale, this is just 2.2 bits of information! &amp;#xA0;If a jazz lover, a rock lover, and a hip-hop lover all give two songs the same rating, then the two songs are probably quite similar.\n\n
  45. The alternative is an item-to-item judgement based on user ratings. &amp;#xA0;The idea considers each song as a point in a multidimensional space defined by a user&apos;s rating of the song. &amp;#xA0;On a 5-point scale, this is just 2.2 bits of information! &amp;#xA0;If a jazz lover, a rock lover, and a hip-hop lover all give two songs the same rating, then the two songs are probably quite similar.\n\n
  46. In our study, we used the ratings by XXX listeners of 1000 different songs. After adjusting for missing data, we formed a vector of all user ratings for each song. &amp;#xA0;Song similarity was defined as the correlation between the user-rating vectors for the two songs.\n\n
  47. We initially expected that a bias of 50% would be best. This means that strong likes and dislikes would be equally important. \n\nBut user&amp;#x2019;s don&amp;#x2019;t rate everything. Left, summary of 717M user ratings. Right 35k users, rating 10 songs at random.\n
  48. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  49. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  50. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  51. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  52. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  53. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  54. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  55. We tested the two song-similarity approaches by starting with a seed song and forming playlists. &amp;#xA0;In a blind test, user&apos;s overwhelmingly said that the songs on the playlist based on rating data were more similar to each other than those based on the genre space, or a random selection of songs. &amp;#xA0;How can this be? &amp;#xA0;Just 2.2 bits beat out a state-of-the-art system based on content.\n\nProblem: How do we figure out the semantics of media signals? We can do simple problems like ASR and OCR. This is the holy grail of image analysis. We want to solve the problem when we have some information about the signal (like a caption).\n\nProblem: How do we describe the time course of a podcast, a musical signal, or a movie? What parts are similar to each other? How do we pick out the most salient portions? How do we segment?\n
  56. Netflix recently hosted a one million dollar competition to find a better recommendation system for their movies. &amp;#xA0;It is not an understatement to say that it captured the entire machine-learning community&apos;s interest. &amp;#xA0;Thousands of hours of research, in all different directions, were directed at this problem.\n\nWhile the identity of the users was unknown, the movie titles were not. &amp;#xA0;Researchers quickly identified each movie and analyzed their content. &amp;#xA0;It only makes sense that Alice, who loves romance movies, will like very different content from Bob, who wants action films. &amp;#xA0;We should be able to use this information to build a better recommendation system.\n\n
  57. But alas, content didn&apos;t help! &amp;#xA0;The winning systems included every possible signal [Koren, Y]. &amp;#xA0;One that surprised me was that the amount of time between the movie&apos;s release and the user&apos;s rating. &amp;#xA0;Evidently there is a strong correlation, with older movies getting a higher rating. &amp;#xA0;All available signals were combined using a boosting. &amp;#xA0;In boosting various (weak) classifiers are combined to make a prediction (the movie&apos;s rating by a new user) &amp;#xA0;if they reduce the error on an unseen test data set. &amp;#xA0;Dozens of different features were included.\n\n\n
  58. Not a single feature was derived from the movie&apos;s content! &amp;#xA0;These were well-motivated researchers, with access to the best of the algorithms in the multimedia literature. But we couldn&apos;t help them. &amp;#xA0;Arguably, the movie&apos;s genre was reflected in the rating data. &amp;#xA0;But in the end the FFT lost to *****&apos;s.\n\n
  59. \n
  60. Transactional. There is MORE to tagging and comments in social media than how we think of it currently as the single browser/site/startup.\n
  61. These tags and comments are regulated to anchored explicit annotation. This is the problem. Temporally, there is a gap &amp;#x2013; we cannot leverage these components like we have with photos. Some tags and notes are added as deep annotation, but that&amp;#x2019;s rare.\n
  62. \n
  63. Notre Dame!\n
  64. Augsburg Cathedral\n
  65. Australia\n
  66. All tagged Christmas\n
  67. Likewise, the context of an image tells us a LOT about what might be in the image. &amp;#xA0;We like to treat multimedia classification as a simple problem---here is an image, does it show a telephone box? &amp;#xA0;But in the real world every piece of content has a history. &amp;#xA0;At the very least we know it was shot by a real person (or a real person owned the camera.) &amp;#xA0;The image was uploaded to a web site, and each web site has a flavor. &amp;#xA0;Photos on the ESPN web site are very different from those at TMZ. &amp;#xA0;Photos uploaded to Flickr (tm) are often more artistic than the people shots typical on Facebook. &amp;#xA0;Even more finely, the friends of a person who takes lots of pictures of cats, will probably have friends who like and take pictures of cats.\n\n
  68. \n
  69. \n
  70. \n
  71. \n
  72. \n
  73. \n
  74. \n
  75. \n
  76. \n
  77. \n
  78. \n
  79. http://www.flickr.com/photos/wvs/3833148925/\n\nThis is a three part talk where I&amp;#x2019;ll discuss IM, Chatrooms, and Twitter.\n
  80. Gift giving at its finest\n
  81. \n
  82. \n
  83. \n
  84. \n
  85. So we started looking at classification based on two datasets YouTube and Zync. Each is about 5000 videos (or sessions).\n
  86. I come from a strong AI family&amp;#x2026;so I don&amp;#x2019;t wanna get too into it&amp;#x2026;\n
  87. \n
  88. So we started to think about what the data was saying to us&amp;#x2026;\n
  89. \n
  90. \n
  91. \n
  92. \n
  93. \n
  94. \n
  95. Triangulate between the classifier results, the survey results and the interviews:\n Determine whether the Na&amp;#xEF;ve Bayes classifier or humans are better at determining whether a video belongs to the &amp;#x201C;comedy&amp;#x201D; genre.\n Determine if the &amp;#x201C;ground truth&amp;#x201D; genre categories provided by the original uploader is reliable.\n
  96. \n
  97. \n
  98. \n
  99. \n
  100. \n
  101. \n
  102. \n
  103. \n
  104. \n
  105. \n
  106. \n
  107. \n
  108. \n
  109. \n
  110. \n
  111. A dare is my favorite type of social recommandation\n
  112. A dare is my favorite type of social recommandation\n
  113. A dare is my favorite type of social recommandation\n
  114. A dare is my favorite type of social recommandation\n
  115. 72-oz. steak\n
  116. \n
  117. Conversational \n