SlideShare une entreprise Scribd logo
1  sur  37
Sidelines:
An Algorithm for Increasing Diversity in News
and Opinion Aggregators

Sean Munson, Daniel Zhou, Paul Resnick
School of Information, University of Michigan
“front page stories from the last seven days shows that liberal
sites… have had multiple articles a day on the front page while
weeks will go by without a single major conservative blog
achieving popular status.”
                                 – Simon Owens, Mediashift Blog
September 2008
today

• Diversity goals
• Sidelines algorithm, based on votes and voters
• Diversity measures, based on votes, voters, and
  affiliations
• Pilot test
  – metrics
  – user response
• Future work
diversity goals

• Make people feel represented
• Proportional representation of viewpoints
• Expose everyone to challenging viewpoints
approval voting
• Each voter can vote for
  an unlimited number of
  items, up to once each
• Select the kitems with
  the most votes




For news aggregator, votes weighted according to age
approval voting
• Each voter can vote for
  an unlimited number of
  items, up to once each
• Select the kitems with
  the most votes


     Risk of tipping?
     With approval voting, a small
     majority may be able to claim
     all the top kspots.




For news aggregator, votes weighted according to age
approval voting                        sidelines
• Each voter can vote for              • Each voter can vote for an
  an unlimited number of                 unlimited number of
  items, up to once each                 items, up to once each
• Select the kitems with               • Selection: repeat k times
  the most votes                           1) Select item with the most
                                              votes
                                           2) Voters for that item
     Risk of tipping?                         sidelined for next t turns
     With approval voting, a small
     majority may be able to claim
     all the top kspots.




For news aggregator, votes weighted according to age
documents
      A   B    C   D      E   F   Approval
                                             Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
  2     ✔          ✔          ✔
 3    ✔ ✔          ✔
  4 ✔ ✔                       ✔
  5       ✔          ✔
  6       ✔          ✔
total 3 4 2        3 2 3
documents
      A   B    C   D      E   F   Approval
                                             Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
                                     B
  2     ✔          ✔          ✔
                                     A
 3    ✔ ✔          ✔
                                     D
  4 ✔ ✔                       ✔
                                     F
  5       ✔          ✔
  6       ✔          ✔
total 3 4 2        3 2 3
documents
      A   B    C   D      E   F   Approval
                                                  Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
                                     B                B
  2     ✔          ✔          ✔
                                     A
 3    ✔ ✔          ✔
                                     D
  4 ✔ ✔                       ✔
                                     F
  5       ✔          ✔
                                             Wait of just 1 turn
  6       ✔          ✔
total 3 4 2        3 2 3
documents
      A   B    C   D      E   F   Approval
                                                  Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
                                     B                B
  2     ✔          ✔          ✔
                                     A                C
 3    ✔ ✔          ✔
                                     D
  4 ✔ ✔                       ✔
                                     F
  5       ✔          ✔
                                             Wait of just 1 turn
  6       ✔          ✔
total 0 0 2        0 2 0
documents
      A   B    C   D      E   F   Approval
                                                  Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
                                     B                B
  2     ✔          ✔          ✔
                                     A                C
 3    ✔ ✔          ✔
                                     D                A
  4 ✔ ✔                       ✔
                                     F
  5       ✔          ✔
                                             Wait of just 1 turn
  6       ✔          ✔
total 3 4 0        3 0 3
documents
      A   B    C   D      E   F   Approval
                                                  Sidelines
                                   voting
 1    ✔ ✔          ✔          ✔
                                     B                B
  2     ✔          ✔          ✔
                                     A                C
 3    ✔ ✔          ✔
                                     D                A
  4 ✔ ✔                       ✔
                                     F                E
  5       ✔          ✔
                                             Wait of just 1 turn
  6       ✔          ✔
total 0 1 2        1 2 1
Measures
Inclusion /Exclusion :: Alienation :: Proportionality
inclusion / exclusion

Inclusion: portion of voters who had something
  they voted for in the result set

Exclusion: portion who didn’t.
Salienation

How far down the result list to find a voted-for item.
For user u, result set K:




 so for result set K:
proportional representation
Groups G=(g1, g2, g3), and each voter has membership
in these groups




For set of users U, representation vector:




                                         UG
proportional representation (continued)
Items’ representativeness defined according to voters’
affiliations:




So for set K:
proportional representation (continued)

Compare vectors UG    and KG   using Kullback-
Leibler divergence:
proportional representation (continued)

Compare vectors UG    and KG   using Kullback-
Leibler divergence:
sidelines vs. approval voting (pure popularity)
Digg World and Business Category

Data from 11 October 2008 to 30 November 2008.

Daily average:
                 New stories      4600
                 Diggs (votes)   85000
                 Voters          24000
Digg World and Business Category




                  Pure Popularity Sidelines      p
     Inclusion             0.651     0.668     <0.001
     Alienation            0.476     0.463     <0.001
    No user groups, so we couldn’t calculate
    Proportional Representation score.
Data source: Links from 500 Political Blogs
• Links treated as votes, blogs as voters
• 24 Oct – 25 Nov
• Blogs coded as liberal (52%), conservative (35%), or
  independent (13%)
Edges indicate Jaccard similarity above average.
Multidimensional scaling layout according to Jaccard similarity.
proportional representation

                                                                    Pure popularity
                                                                    showed some
                                                                    evidence of tipping.

        0.07                                      Pure Popularity   Some tipping in
        0.06
                                                  Sidelines         Sidelines as well, but
        0.05                                                        significantly less
                                                                    (paired t-test, p<
divKL




        0.04

        0.03                                                        0.001)
        0.02

        0.01

          0
          25-Oct   30-Oct   4-Nov   9-Nov   14-Nov 19-Nov 24-Nov
inclusion, alienation

High inclusion score for sidelines (0.445) than pure
popularity (0.419) (paired t-test, p<0.001).


                                                           Pure Popularity
                                                                             Significantly reduced
                                                           Sidelines         Salienation for sidelines
        0.85
                                                                             (paired t-test, p<0.001)
Salienation




              0.8



        0.75



              0.7
                25-Oct 29-Oct 2-Nov   6-Nov 10-Nov 14-Nov 18-Nov 22-Nov
noticeable differences?

Asked 40 subjects to view
12-item result sets for
sidelines or pure popularity.

(Not told there were two
possibilities)
noticeable differences


      Somewhat liberally-biased set of
      readers had an 89% chance of
      finding something challenging in
      the sidelines result set (compared
      with 50% for pure popularity).
mixed preferences for diversity

“I make a point of visiting websites with viewpoints
different than my own, so I would have been happy
with this.” (Sidelines)


            “it’s good to know diverse opinions, but, on the other
            hand, I can’t take too much of the opinions that
            disagree with mine.” (Pure Popularity)


“I wouldn't use a news aggregator, but because it's
liberally biased [in agreement with subject’s views], I'm ok
with it.” (Pure Popularity)
applications

• News aggregators based on user votes.
• Other voting systems where diversity matters
  (e.g. Google Moderator)

• Don’t need to know anything about content, user
  groups, or long-term voting behavior
applications

• News aggregators based on user votes.
• Other voting systems where diversity matters
  (e.g. Google Moderator)

• Don’t need to know anything about content, user
  groups, or long-term voting behavior
future work

• Enhancements to sidelines algorithm

• Alternative algorithms

• Actual preferences & behavior for challenging
  vs. affirming content

• Presentation to make people feel represented
  (while still viewing on challenging items!)
thanks!

Sean Munson samunson@umich.edu
Daniel Zhou    mrzhou@umich.edu
Paul Resnickpresnick@umich.edu

Contenu connexe

En vedette

Marcelino santos nhs
Marcelino santos nhsMarcelino santos nhs
Marcelino santos nhsjedslideshare
 
Actividad 4 bloque iii
Actividad 4   bloque iiiActividad 4   bloque iii
Actividad 4 bloque iiidamarisram
 
בריאות זה לא משחק
בריאות זה לא משחקבריאות זה לא משחק
בריאות זה לא משחקUri Goren
 
רשימת שמירה אוקטובר 12
רשימת שמירה אוקטובר 12רשימת שמירה אוקטובר 12
רשימת שמירה אוקטובר 12sadrinat
 
Difference Session Ciikiex
Difference Session CiikiexDifference Session Ciikiex
Difference Session CiikiexOPENLANE
 
רשימת שמירה אוגוסט 13
רשימת שמירה אוגוסט 13רשימת שמירה אוגוסט 13
רשימת שמירה אוגוסט 13sadrinat
 
Modelos morfologicos
Modelos morfologicosModelos morfologicos
Modelos morfologicosJuan Padilla
 
Ch09 managing decision making and problem solving
Ch09 managing decision making and problem solvingCh09 managing decision making and problem solving
Ch09 managing decision making and problem solvingChandra Pandey
 
Distilling insights @ AppsFlyer
Distilling insights @ AppsFlyerDistilling insights @ AppsFlyer
Distilling insights @ AppsFlyerArnon Rotem-Gal-Oz
 
Exercícios de marketing bancário
Exercícios de marketing bancárioExercícios de marketing bancário
Exercícios de marketing bancárioFabio Maia
 

En vedette (12)

Marcelino santos nhs
Marcelino santos nhsMarcelino santos nhs
Marcelino santos nhs
 
Actividad 4 bloque iii
Actividad 4   bloque iiiActividad 4   bloque iii
Actividad 4 bloque iii
 
בריאות זה לא משחק
בריאות זה לא משחקבריאות זה לא משחק
בריאות זה לא משחק
 
Tema 5 Esquema
Tema 5 Esquema Tema 5 Esquema
Tema 5 Esquema
 
רשימת שמירה אוקטובר 12
רשימת שמירה אוקטובר 12רשימת שמירה אוקטובר 12
רשימת שמירה אוקטובר 12
 
Difference Session Ciikiex
Difference Session CiikiexDifference Session Ciikiex
Difference Session Ciikiex
 
רשימת שמירה אוגוסט 13
רשימת שמירה אוגוסט 13רשימת שמירה אוגוסט 13
רשימת שמירה אוגוסט 13
 
Modelos morfologicos
Modelos morfologicosModelos morfologicos
Modelos morfologicos
 
Ch09 managing decision making and problem solving
Ch09 managing decision making and problem solvingCh09 managing decision making and problem solving
Ch09 managing decision making and problem solving
 
Distilling insights @ AppsFlyer
Distilling insights @ AppsFlyerDistilling insights @ AppsFlyer
Distilling insights @ AppsFlyer
 
Andreea Chifu, Paradox Interactive
Andreea Chifu, Paradox InteractiveAndreea Chifu, Paradox Interactive
Andreea Chifu, Paradox Interactive
 
Exercícios de marketing bancário
Exercícios de marketing bancárioExercícios de marketing bancário
Exercícios de marketing bancário
 

Plus de Sean Munson

Encouraging Reading of Diverse Political Viewpoints with a Browser Widget
Encouraging Reading of Diverse Political Viewpoints with a Browser WidgetEncouraging Reading of Diverse Political Viewpoints with a Browser Widget
Encouraging Reading of Diverse Political Viewpoints with a Browser WidgetSean Munson
 
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...Sean Munson
 
Happier Together: Integrating a Wellness Application Into a Social Network Site
Happier Together: Integrating a Wellness Application Into a Social Network SiteHappier Together: Integrating a Wellness Application Into a Social Network Site
Happier Together: Integrating a Wellness Application Into a Social Network SiteSean Munson
 
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...Challenges and Opportunities in Using Online Social Networks for Health (CSCW...
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...Sean Munson
 
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)Sean Munson
 
Presenting Diverse Political Opinions: How and How Much (CHI 2010)
Presenting Diverse Political Opinions: How and How Much (CHI 2010)Presenting Diverse Political Opinions: How and How Much (CHI 2010)
Presenting Diverse Political Opinions: How and How Much (CHI 2010)Sean Munson
 
The Prevalence of Political Discourse in Non-Political Blogs
The Prevalence of Political Discourse in Non-Political BlogsThe Prevalence of Political Discourse in Non-Political Blogs
The Prevalence of Political Discourse in Non-Political BlogsSean Munson
 
Attitudes toward Online Availability of US Public Records
Attitudes toward Online Availability of US Public RecordsAttitudes toward Online Availability of US Public Records
Attitudes toward Online Availability of US Public RecordsSean Munson
 
Building Wellness Interventions Into Facebook
Building Wellness Interventions Into Facebook Building Wellness Interventions Into Facebook
Building Wellness Interventions Into Facebook Sean Munson
 
Mechanical Turk for Social Science Introduction
Mechanical Turk for Social Science IntroductionMechanical Turk for Social Science Introduction
Mechanical Turk for Social Science IntroductionSean Munson
 
Motivating and Enabling Organizational Memory with a Workgroup Wiki
Motivating and Enabling Organizational Memory with a Workgroup WikiMotivating and Enabling Organizational Memory with a Workgroup Wiki
Motivating and Enabling Organizational Memory with a Workgroup WikiSean Munson
 

Plus de Sean Munson (11)

Encouraging Reading of Diverse Political Viewpoints with a Browser Widget
Encouraging Reading of Diverse Political Viewpoints with a Browser WidgetEncouraging Reading of Diverse Political Viewpoints with a Browser Widget
Encouraging Reading of Diverse Political Viewpoints with a Browser Widget
 
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...
Exploring Goal-setting, Rewards, Self-monitoring, and Sharing to Motivate Phy...
 
Happier Together: Integrating a Wellness Application Into a Social Network Site
Happier Together: Integrating a Wellness Application Into a Social Network SiteHappier Together: Integrating a Wellness Application Into a Social Network Site
Happier Together: Integrating a Wellness Application Into a Social Network Site
 
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...Challenges and Opportunities in Using Online Social Networks for Health (CSCW...
Challenges and Opportunities in Using Online Social Networks for Health (CSCW...
 
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)
Thanks and Tweets: Comparing Two Public Displays (CSCW 2011)
 
Presenting Diverse Political Opinions: How and How Much (CHI 2010)
Presenting Diverse Political Opinions: How and How Much (CHI 2010)Presenting Diverse Political Opinions: How and How Much (CHI 2010)
Presenting Diverse Political Opinions: How and How Much (CHI 2010)
 
The Prevalence of Political Discourse in Non-Political Blogs
The Prevalence of Political Discourse in Non-Political BlogsThe Prevalence of Political Discourse in Non-Political Blogs
The Prevalence of Political Discourse in Non-Political Blogs
 
Attitudes toward Online Availability of US Public Records
Attitudes toward Online Availability of US Public RecordsAttitudes toward Online Availability of US Public Records
Attitudes toward Online Availability of US Public Records
 
Building Wellness Interventions Into Facebook
Building Wellness Interventions Into Facebook Building Wellness Interventions Into Facebook
Building Wellness Interventions Into Facebook
 
Mechanical Turk for Social Science Introduction
Mechanical Turk for Social Science IntroductionMechanical Turk for Social Science Introduction
Mechanical Turk for Social Science Introduction
 
Motivating and Enabling Organizational Memory with a Workgroup Wiki
Motivating and Enabling Organizational Memory with a Workgroup WikiMotivating and Enabling Organizational Memory with a Workgroup Wiki
Motivating and Enabling Organizational Memory with a Workgroup Wiki
 

Dernier

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Dernier (20)

A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

Sidelines: An Algorithm for Increasing Diversity in News and Opinion Aggregators

  • 1. Sidelines: An Algorithm for Increasing Diversity in News and Opinion Aggregators Sean Munson, Daniel Zhou, Paul Resnick School of Information, University of Michigan
  • 2.
  • 3. “front page stories from the last seven days shows that liberal sites… have had multiple articles a day on the front page while weeks will go by without a single major conservative blog achieving popular status.” – Simon Owens, Mediashift Blog September 2008
  • 4.
  • 5. today • Diversity goals • Sidelines algorithm, based on votes and voters • Diversity measures, based on votes, voters, and affiliations • Pilot test – metrics – user response • Future work
  • 6. diversity goals • Make people feel represented • Proportional representation of viewpoints • Expose everyone to challenging viewpoints
  • 7. approval voting • Each voter can vote for an unlimited number of items, up to once each • Select the kitems with the most votes For news aggregator, votes weighted according to age
  • 8. approval voting • Each voter can vote for an unlimited number of items, up to once each • Select the kitems with the most votes Risk of tipping? With approval voting, a small majority may be able to claim all the top kspots. For news aggregator, votes weighted according to age
  • 9. approval voting sidelines • Each voter can vote for • Each voter can vote for an an unlimited number of unlimited number of items, up to once each items, up to once each • Select the kitems with • Selection: repeat k times the most votes 1) Select item with the most votes 2) Voters for that item Risk of tipping? sidelined for next t turns With approval voting, a small majority may be able to claim all the top kspots. For news aggregator, votes weighted according to age
  • 10. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ 2 ✔ ✔ ✔ 3 ✔ ✔ ✔ 4 ✔ ✔ ✔ 5 ✔ ✔ 6 ✔ ✔ total 3 4 2 3 2 3
  • 11. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ B 2 ✔ ✔ ✔ A 3 ✔ ✔ ✔ D 4 ✔ ✔ ✔ F 5 ✔ ✔ 6 ✔ ✔ total 3 4 2 3 2 3
  • 12. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ B B 2 ✔ ✔ ✔ A 3 ✔ ✔ ✔ D 4 ✔ ✔ ✔ F 5 ✔ ✔ Wait of just 1 turn 6 ✔ ✔ total 3 4 2 3 2 3
  • 13. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ B B 2 ✔ ✔ ✔ A C 3 ✔ ✔ ✔ D 4 ✔ ✔ ✔ F 5 ✔ ✔ Wait of just 1 turn 6 ✔ ✔ total 0 0 2 0 2 0
  • 14. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ B B 2 ✔ ✔ ✔ A C 3 ✔ ✔ ✔ D A 4 ✔ ✔ ✔ F 5 ✔ ✔ Wait of just 1 turn 6 ✔ ✔ total 3 4 0 3 0 3
  • 15. documents A B C D E F Approval Sidelines voting 1 ✔ ✔ ✔ ✔ B B 2 ✔ ✔ ✔ A C 3 ✔ ✔ ✔ D A 4 ✔ ✔ ✔ F E 5 ✔ ✔ Wait of just 1 turn 6 ✔ ✔ total 0 1 2 1 2 1
  • 16. Measures Inclusion /Exclusion :: Alienation :: Proportionality
  • 17. inclusion / exclusion Inclusion: portion of voters who had something they voted for in the result set Exclusion: portion who didn’t.
  • 18. Salienation How far down the result list to find a voted-for item. For user u, result set K: so for result set K:
  • 19. proportional representation Groups G=(g1, g2, g3), and each voter has membership in these groups For set of users U, representation vector: UG
  • 20. proportional representation (continued) Items’ representativeness defined according to voters’ affiliations: So for set K:
  • 21. proportional representation (continued) Compare vectors UG and KG using Kullback- Leibler divergence:
  • 22. proportional representation (continued) Compare vectors UG and KG using Kullback- Leibler divergence:
  • 23. sidelines vs. approval voting (pure popularity)
  • 24. Digg World and Business Category Data from 11 October 2008 to 30 November 2008. Daily average: New stories 4600 Diggs (votes) 85000 Voters 24000
  • 25. Digg World and Business Category Pure Popularity Sidelines p Inclusion 0.651 0.668 <0.001 Alienation 0.476 0.463 <0.001 No user groups, so we couldn’t calculate Proportional Representation score.
  • 26. Data source: Links from 500 Political Blogs • Links treated as votes, blogs as voters • 24 Oct – 25 Nov • Blogs coded as liberal (52%), conservative (35%), or independent (13%)
  • 27. Edges indicate Jaccard similarity above average. Multidimensional scaling layout according to Jaccard similarity.
  • 28. proportional representation Pure popularity showed some evidence of tipping. 0.07 Pure Popularity Some tipping in 0.06 Sidelines Sidelines as well, but 0.05 significantly less (paired t-test, p< divKL 0.04 0.03 0.001) 0.02 0.01 0 25-Oct 30-Oct 4-Nov 9-Nov 14-Nov 19-Nov 24-Nov
  • 29. inclusion, alienation High inclusion score for sidelines (0.445) than pure popularity (0.419) (paired t-test, p<0.001). Pure Popularity Significantly reduced Sidelines Salienation for sidelines 0.85 (paired t-test, p<0.001) Salienation 0.8 0.75 0.7 25-Oct 29-Oct 2-Nov 6-Nov 10-Nov 14-Nov 18-Nov 22-Nov
  • 30. noticeable differences? Asked 40 subjects to view 12-item result sets for sidelines or pure popularity. (Not told there were two possibilities)
  • 31. noticeable differences Somewhat liberally-biased set of readers had an 89% chance of finding something challenging in the sidelines result set (compared with 50% for pure popularity).
  • 32. mixed preferences for diversity “I make a point of visiting websites with viewpoints different than my own, so I would have been happy with this.” (Sidelines) “it’s good to know diverse opinions, but, on the other hand, I can’t take too much of the opinions that disagree with mine.” (Pure Popularity) “I wouldn't use a news aggregator, but because it's liberally biased [in agreement with subject’s views], I'm ok with it.” (Pure Popularity)
  • 33. applications • News aggregators based on user votes. • Other voting systems where diversity matters (e.g. Google Moderator) • Don’t need to know anything about content, user groups, or long-term voting behavior
  • 34.
  • 35. applications • News aggregators based on user votes. • Other voting systems where diversity matters (e.g. Google Moderator) • Don’t need to know anything about content, user groups, or long-term voting behavior
  • 36. future work • Enhancements to sidelines algorithm • Alternative algorithms • Actual preferences & behavior for challenging vs. affirming content • Presentation to make people feel represented (while still viewing on challenging items!)
  • 37. thanks! Sean Munson samunson@umich.edu Daniel Zhou mrzhou@umich.edu Paul Resnickpresnick@umich.edu

Notes de l'éditeur

  1. If people do not feel represented – they feel they have not been heard, and they don’t see content that supports them -- they may exit to places where they do. This can create balkanization and polarization. Sunstein and others have warned about the problems this may cause for democracy, society.
  2. Make people feel represented can encourage people to speak up (who would have otherwise remained silent to promote social harmony). People may also be more open to hearing other views after they feel they have been heard.Proportional representation of viewpoints. As Duncan mentioned yesterday, people are not very good at knowing when others agree or disagree with them. They tend to think that support for their point of view is broader than it is. Those in the minority may think they are in the majority, and when their candidate does not win or their idea is not selected, they may feel disenfranchised or concoct conspiracy theories about how the election was “rigged” or “stolen.” Proportionally representing ideas can help people realize when they are in the minority. This can increase legitimacy of public decisions. It also may encourage the majority to stop and listen to dissenting views.Finally, exposing everyone to challenging viewpoints can lead to better problem solving as more ideas and viewpoints are included in the conversation. It can also help reduce polarization.
  3. Not saying that approval voting is exactly what anyone uses!
  4. Exclusion is just 1-inclusion.
  5. S_alienation normalized by the maximum alienation so it always falls on the range [1/(|K|+1), 1]
  6. Actually do need to know something about user groups and affiliations for this metric.
  7. Blogs of one bias more likely to link to items linked by blogs with the same bias (Jaccard similarity).
  8. Remember that the alienation score as how far down the list people had to go on average. So, a little over half the time, they didn't get an item at all. When they did which counted as an alienation of 1. The other half the time, on average, they had to go about 30-40% of the way down the list.
  9. Subjects were recruited primarily from the university of Michigan and were somewhat liberally biased.
  10. Need to do more work in this area of actual user preferences and behavior with respect to diversity.
  11. The Obama administration used Google Moderator on Change.gov during the transition to collect questions. In one category, most of the top questions were about the legalization of marijuana. Sidelines may have let the stop question still be on this topic while letting other questions also make it to the first few questions.
  12. Complement to content analysis approaches.
  13. So we have an algorithm for increasing diversity without knowledge about content or voters’ political affiliations, and some potential applications for this algorithm. We also have some metrics for measuring diversity in result sets where people have voted on the candidate items. What’s next?Enhancement:suppress votes based on users’ voting history, optimize parametersOther algorithms:clustering based on votes