SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Tag & Tag-based Recommenders


IBM Research – China

Presenter: Xiatian Zhang (张夏天)

Team:




  赵石顽       张夏天        袁   泉
About Me
   2000-2004, B.S. Math, Central South University

   2004-2007, M.S. Computer Science, BUPT

   2007-Present, Researcher, Working on Recommender Systems and
    Data Mining
Agenda

  Social Tagging System and Its Features

  Tag Recommender

  Tag-based Recommender
Social Tagging


   A folksonomy is a system of classification derived from the practice
    and method of collaboratively creating and managing tags to annotate
    and categorize content; this practice is also known as collaborative
    tagging, social classification, social indexing, and social tagging.
    Folksonomy is a portmaneau of folk and taxonomy.

   Social Tagging boomed from 2004, with the wave of Web 2.0.
      –   Delicious
      –   Citeulike
      –   Bibsonomy
      –   Youtube
      –   Flickr
      –   Dogear – A internal social book marking system in IBM
      –   …
Some Insights of Tagging System

 Shilad Sen et.al., tagging, communities, vocabulary, evolution,
  CSCW’06
    – Modeling vocabulary evolution
    – Tagging system features
    – Based on Movielens recommender system
    – Personal tendency and community influence
    – Tag displaying strategies and their effects
    – Tag utility
Modeling vocabulary evolution
Tagging System Features

  Design Features
     – Tag Sharing
     – Tag Selection
     – Item Ownership
     – Tag Scope
         – Broad
         – Narrow
  Tag Class
     – Factual Tag
     – Subjective Tag
     – Personal Tag
Tagging System in Movielens
Personal Tendency

 How strongly do investment and
  habit affect personal tagging
  behavior?
    – 1. Habit and investment
      influence user’s tag applications.
    – 2. Habit and investment
      influence grows stronger as
      users apply more tags.
    – 3. Habit and investment cannot
      be the only factors thatcontribute
      to vocabulary evolution.
Community Influence
 How does the tagging
  community influence
  personal vocabulary?
    – 1. Community influence
      affects a user’s personal
      vocabulary.
    – 2. Community influence
      on a user’s first tag is
      stronger for users who
      have seen more tags.
Tag Displaying Strategies Effects
Tag Utility
Tag Recommender

   Purpose
     – Encourage users to tag more frequently, apply more tags to an
       individual resource, reuse common tags
     – Make user use tags not previously considered.
     – Eliminate Redundant tags
         – Promote a core tag vocabulary steering the user toward adopting
           certain tags while not imposing any strict rules.
         – Avoid ambiguous tags in favor of tags that offer greater information
           value.
Tag Recommender – Technologies

   Naive Methods
      – Most Popular Tags on Resources
      – Most Popular Tags on Users
      – Most Popular Tags on Resources and Users
   Classical Collaborative Filtering
      – User-KNN
      – Item-KNN
   Adapted KNN Methods
      – Extend User-Item Matrix
      – Degrade User-Item-Tag Relationship
   Content-based Method
   Tensor Method
      – Tensor Factorization
   Graph Based
      – FolkRank
   Our Work
Adapted KNN – Extend UI Matrix
Adapted KNN – Degrade User-Item-Tag relationship




  Process
    – TF/IDF on UI, UT, IT
    – P-Core Processing
        – Remove noise data
    – Extract User Model by
      Hebbian Deflation
Tensor Factorization
FolkRank
   PageRank

                                                    PR( p j )
      PR( pi )  (1 d ) / N  d       
                                    p j M ( pi )    L( p j )             (1)

   Personalized PageRank

                                                    PR( p j )
      PR( pi )  (1 d ) pi  d       
                                   p j M ( pi )    L( p j )              (2)

   FolkRank
       1. Compute global PageRank by (1)
       2. Then for each <user, item> pair, compute personalized PageRank by (2)
                –    p[i] = 1, but p [u] = 1 + |U| and p [r] = 1 + |R|.
       3. FolkRank = Personalized PageRank - PageRank
Our Work

 Explored and Exploring Methods
    – Non-classical Tensor Fusion Factorization
    – Multi-label Classification by Random Decision Trees, High Speed
    – The performance of both two methods are close to FolkRank
 Current Progress
    – Shiwan develop a simple graph model
    – Best precision and recall on several datasets compared to other
      methods
    – We are writing paper targeting ACM RecSys 2010
Tag-based Recommender

   Our Work
     – IUI 2008 Paper, Improved Recommendation based on Collaborative
       Tagging Behaviors
     – Explored Methods
         – Tensor Factorization
         – Non-classical Tensor and Matrix Fusion Factorization
   Other Works
     – Shilad Sen, Jesse Vig, and John Riedl, Tagommenders: Connecting
       Users to Items through Tags, WWW 2009
IUI 2008 Paper Overview


    We invent a new collaborative filtering approach TBCF (Tag-based Collaborative
     Filtering) based on the semantic distance among tags assigned by different users
     to improve the effectiveness of neighbor selection.
    That is, two users could be considered similar not only if they rated the items
     similarly, but also if they have similar cognitions over these items.
    Example
        – Both Bob and Tom may rate the movie Avatar with 5 stars, which indicates they
          all like this movie very much.
        – Nevertheless, as a 3D fan, Bob appreciates this movie for its high quality 3D
          animations, while Tom may think that it is a wonderful action movie.
Tag-based Collaborative Filtering
 Tag-based User-Item Matrix

                       Item1            Item2            Item3              Item4
       Alice         Art, photo     Home, Products   Writing, Design      Learning,
                                                                          Education
       Daniel      Photo, Album,          Ø            Typewriter      Tutorial, Training
                      Image
      Sherry             Ø             Cleaning            Ø           Language, Study

      Maggie        Photography           Ø              Ovens                 Ø

 Steps

 1.   Calculate the semantic similarity of tags based on WordNet (for the tags not
      included in WordNet, calculate the edit-distance instead)
 2.   Calculate the similarity between tag sets
 3.   Calculate the similarity between user u and v by summing up the similarity of tag
      sets on common pages (tagged by both u & v)
 4.   Find the top-N nearest neighbors of the active user to make the prediction
 5.   Return the top-M predicted items to the active user
Tag Similarity Calculation


  Tag similarity
      –   WordNet
      –   LSA/PLSA
  Tag set similarity
      – Hungarian method




                                                WordNet Concept Tree
 Word similarity in WordNet




 If x and y are contained in WordNet, dis(x,y) is the shortest path length between x and y.
Experimental Evaluation
   Data Set
    Extract total 8000 users, 5315 pages and 7670 tags from web logs.
           Algorithm              Average Precision          Average Ranking

             TBCF                       0.27                       2.8

            cosine                      0.13                       1.5




   Random generated subset       Average Precision          Average Precision
                                      TBCF                       cosine

              500                      0.208                     0.121

            2000                       0.182                     0.118

            4000                       0.202                     0.173

            6000                       0.209                     0.180
Tagommenders: Connecting Users to Items through Tags
Q&A

Contenu connexe

Tendances

Email Classification
Email ClassificationEmail Classification
Email ClassificationXi Chen
 
Recommender systems
Recommender systemsRecommender systems
Recommender systemsTamer Rezk
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRoelof Pieters
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsNavisro Analytics
 
Tutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social NetworksTutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social Networkspjing2
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative FilteringTayfun Sen
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringViet-Trung TRAN
 
Preference Elicitation in Recommender Systems
Preference Elicitation in Recommender SystemsPreference Elicitation in Recommender Systems
Preference Elicitation in Recommender SystemsAnish Shenoy
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation enginesGeorgian Micsa
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Arjen de Vries
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperChangsung Moon
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systemsguest77b0cd12
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systemsyoualab
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 
Tutorial on query auto completion
Tutorial on query auto completionTutorial on query auto completion
Tutorial on query auto completionYichen Feng
 

Tendances (20)

Email Classification
Email ClassificationEmail Classification
Email Classification
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and Graphs
 
Collaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro AnalyticsCollaborative Filtering and Recommender Systems By Navisro Analytics
Collaborative Filtering and Recommender Systems By Navisro Analytics
 
Tutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social NetworksTutorial on Relationship Mining In Online Social Networks
Tutorial on Relationship Mining In Online Social Networks
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative Filtering
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Preference Elicitation in Recommender Systems
Preference Elicitation in Recommender SystemsPreference Elicitation in Recommender Systems
Preference Elicitation in Recommender Systems
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
Twente ir-course 20-10-2010
Twente ir-course 20-10-2010Twente ir-course 20-10-2010
Twente ir-course 20-10-2010
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paper
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Social Recommender Systems
Social Recommender SystemsSocial Recommender Systems
Social Recommender Systems
 
Survey of Recommendation Systems
Survey of Recommendation SystemsSurvey of Recommendation Systems
Survey of Recommendation Systems
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Tutorial on query auto completion
Tutorial on query auto completionTutorial on query auto completion
Tutorial on query auto completion
 
Filtering content bbased crs
Filtering content bbased crsFiltering content bbased crs
Filtering content bbased crs
 

Similaire à Tag And Tag Based Recommender

Seeding Weeding Fertilizing - Tag Gardening for Folksonomy Maintenance
Seeding Weeding Fertilizing - Tag Gardening for Folksonomy MaintenanceSeeding Weeding Fertilizing - Tag Gardening for Folksonomy Maintenance
Seeding Weeding Fertilizing - Tag Gardening for Folksonomy MaintenanceKatrin Weller
 
Extracting Semantic User Networks from Informal Communication Exchanges
Extracting Semantic User Networks from Informal Communication ExchangesExtracting Semantic User Networks from Informal Communication Exchanges
Extracting Semantic User Networks from Informal Communication ExchangesSuvodeep Mazumdar
 
PhD defense : Multi-points of view semantic enrichment of folksonomies
PhD defense : Multi-points of view semantic enrichment of folksonomiesPhD defense : Multi-points of view semantic enrichment of folksonomies
PhD defense : Multi-points of view semantic enrichment of folksonomiesFreddy Limpens
 
Major_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxMajor_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxLokeshKumarReddy8
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsIRJET Journal
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Sudeep Das, Ph.D.
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory acijjournal
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisStuart Wrigley
 
8 Information Architecture Better Practices
8 Information Architecture Better Practices8 Information Architecture Better Practices
8 Information Architecture Better PracticesLouis Rosenfeld
 
2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...Ed Chi
 
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET Journal
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE
 
COSC 426 Lect. 7: Evaluating AR Applications
COSC 426 Lect. 7: Evaluating AR ApplicationsCOSC 426 Lect. 7: Evaluating AR Applications
COSC 426 Lect. 7: Evaluating AR ApplicationsMark Billinghurst
 
Toward Personalized Peer-to-Peer Top-k Processing
Toward Personalized Peer-to-Peer Top-k ProcessingToward Personalized Peer-to-Peer Top-k Processing
Toward Personalized Peer-to-Peer Top-k Processingasapteam
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Gabriel Moreira
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionPerumalPitchandi
 

Similaire à Tag And Tag Based Recommender (20)

Seeding Weeding Fertilizing - Tag Gardening for Folksonomy Maintenance
Seeding Weeding Fertilizing - Tag Gardening for Folksonomy MaintenanceSeeding Weeding Fertilizing - Tag Gardening for Folksonomy Maintenance
Seeding Weeding Fertilizing - Tag Gardening for Folksonomy Maintenance
 
Extracting Semantic
Extracting Semantic Extracting Semantic
Extracting Semantic
 
Extracting Semantic User Networks from Informal Communication Exchanges
Extracting Semantic User Networks from Informal Communication ExchangesExtracting Semantic User Networks from Informal Communication Exchanges
Extracting Semantic User Networks from Informal Communication Exchanges
 
PhD defense : Multi-points of view semantic enrichment of folksonomies
PhD defense : Multi-points of view semantic enrichment of folksonomiesPhD defense : Multi-points of view semantic enrichment of folksonomies
PhD defense : Multi-points of view semantic enrichment of folksonomies
 
Major_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxMajor_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptx
 
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
 
Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory Generating domain specific sentiment lexicons using the Web Directory
Generating domain specific sentiment lexicons using the Web Directory
 
Rae
RaeRae
Rae
 
Improving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log AnalysisImproving Semantic Search Using Query Log Analysis
Improving Semantic Search Using Query Log Analysis
 
Deep Learning Recommender Systems
Deep Learning Recommender SystemsDeep Learning Recommender Systems
Deep Learning Recommender Systems
 
8 Information Architecture Better Practices
8 Information Architecture Better Practices8 Information Architecture Better Practices
8 Information Architecture Better Practices
 
2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...2017 10-10 (netflix ml platform meetup) learning item and user representation...
2017 10-10 (netflix ml platform meetup) learning item and user representation...
 
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
IRJET- Fusion Method for Image Reranking and Similarity Finding based on Topi...
 
Ems10 V1 0 Printer Friendly Table
Ems10 V1 0 Printer Friendly TableEms10 V1 0 Printer Friendly Table
Ems10 V1 0 Printer Friendly Table
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
COSC 426 Lect. 7: Evaluating AR Applications
COSC 426 Lect. 7: Evaluating AR ApplicationsCOSC 426 Lect. 7: Evaluating AR Applications
COSC 426 Lect. 7: Evaluating AR Applications
 
Toward Personalized Peer-to-Peer Top-k Processing
Toward Personalized Peer-to-Peer Top-k ProcessingToward Personalized Peer-to-Peer Top-k Processing
Toward Personalized Peer-to-Peer Top-k Processing
 
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
Discovering User's Topics of Interest in Recommender Systems @ Meetup Machine...
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 

Plus de gu wendong

宜信大数据金融云-CSDN
宜信大数据金融云-CSDN宜信大数据金融云-CSDN
宜信大数据金融云-CSDNgu wendong
 
Social Recommendation
Social RecommendationSocial Recommendation
Social Recommendationgu wendong
 
Pharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social WebsitesPharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social Websitesgu wendong
 
Resys China 创刊号
Resys China 创刊号Resys China 创刊号
Resys China 创刊号gu wendong
 
孙超 - Recommendation Algorithm as a product
孙超 - Recommendation Algorithm as a product孙超 - Recommendation Algorithm as a product
孙超 - Recommendation Algorithm as a productgu wendong
 
EdChang - Parallel Algorithms For Mining Large Scale Data
EdChang - Parallel Algorithms For Mining Large Scale DataEdChang - Parallel Algorithms For Mining Large Scale Data
EdChang - Parallel Algorithms For Mining Large Scale Datagu wendong
 
王守崑 - 豆瓣在推荐领域的实践和思考
王守崑 - 豆瓣在推荐领域的实践和思考王守崑 - 豆瓣在推荐领域的实践和思考
王守崑 - 豆瓣在推荐领域的实践和思考gu wendong
 
From Search To Discover by Wanght
From Search To Discover by WanghtFrom Search To Discover by Wanght
From Search To Discover by Wanghtgu wendong
 
Understanding Rbm by WangYuanTao
Understanding Rbm by WangYuanTaoUnderstanding Rbm by WangYuanTao
Understanding Rbm by WangYuanTaogu wendong
 
Netflix Prize by Xlvector
Netflix Prize by XlvectorNetflix Prize by Xlvector
Netflix Prize by Xlvectorgu wendong
 

Plus de gu wendong (10)

宜信大数据金融云-CSDN
宜信大数据金融云-CSDN宜信大数据金融云-CSDN
宜信大数据金融云-CSDN
 
Social Recommendation
Social RecommendationSocial Recommendation
Social Recommendation
 
Pharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social WebsitesPharos Social Map Based Recommendation For Content Centric Social Websites
Pharos Social Map Based Recommendation For Content Centric Social Websites
 
Resys China 创刊号
Resys China 创刊号Resys China 创刊号
Resys China 创刊号
 
孙超 - Recommendation Algorithm as a product
孙超 - Recommendation Algorithm as a product孙超 - Recommendation Algorithm as a product
孙超 - Recommendation Algorithm as a product
 
EdChang - Parallel Algorithms For Mining Large Scale Data
EdChang - Parallel Algorithms For Mining Large Scale DataEdChang - Parallel Algorithms For Mining Large Scale Data
EdChang - Parallel Algorithms For Mining Large Scale Data
 
王守崑 - 豆瓣在推荐领域的实践和思考
王守崑 - 豆瓣在推荐领域的实践和思考王守崑 - 豆瓣在推荐领域的实践和思考
王守崑 - 豆瓣在推荐领域的实践和思考
 
From Search To Discover by Wanght
From Search To Discover by WanghtFrom Search To Discover by Wanght
From Search To Discover by Wanght
 
Understanding Rbm by WangYuanTao
Understanding Rbm by WangYuanTaoUnderstanding Rbm by WangYuanTao
Understanding Rbm by WangYuanTao
 
Netflix Prize by Xlvector
Netflix Prize by XlvectorNetflix Prize by Xlvector
Netflix Prize by Xlvector
 

Dernier

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 

Dernier (20)

A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 

Tag And Tag Based Recommender

  • 1. Tag & Tag-based Recommenders IBM Research – China Presenter: Xiatian Zhang (张夏天) Team: 赵石顽 张夏天 袁 泉
  • 2. About Me  2000-2004, B.S. Math, Central South University  2004-2007, M.S. Computer Science, BUPT  2007-Present, Researcher, Working on Recommender Systems and Data Mining
  • 3. Agenda  Social Tagging System and Its Features  Tag Recommender  Tag-based Recommender
  • 4. Social Tagging  A folksonomy is a system of classification derived from the practice and method of collaboratively creating and managing tags to annotate and categorize content; this practice is also known as collaborative tagging, social classification, social indexing, and social tagging. Folksonomy is a portmaneau of folk and taxonomy.  Social Tagging boomed from 2004, with the wave of Web 2.0. – Delicious – Citeulike – Bibsonomy – Youtube – Flickr – Dogear – A internal social book marking system in IBM – …
  • 5. Some Insights of Tagging System  Shilad Sen et.al., tagging, communities, vocabulary, evolution, CSCW’06 – Modeling vocabulary evolution – Tagging system features – Based on Movielens recommender system – Personal tendency and community influence – Tag displaying strategies and their effects – Tag utility
  • 7. Tagging System Features  Design Features – Tag Sharing – Tag Selection – Item Ownership – Tag Scope – Broad – Narrow  Tag Class – Factual Tag – Subjective Tag – Personal Tag
  • 8. Tagging System in Movielens
  • 9. Personal Tendency  How strongly do investment and habit affect personal tagging behavior? – 1. Habit and investment influence user’s tag applications. – 2. Habit and investment influence grows stronger as users apply more tags. – 3. Habit and investment cannot be the only factors thatcontribute to vocabulary evolution.
  • 10. Community Influence  How does the tagging community influence personal vocabulary? – 1. Community influence affects a user’s personal vocabulary. – 2. Community influence on a user’s first tag is stronger for users who have seen more tags.
  • 13. Tag Recommender  Purpose – Encourage users to tag more frequently, apply more tags to an individual resource, reuse common tags – Make user use tags not previously considered. – Eliminate Redundant tags – Promote a core tag vocabulary steering the user toward adopting certain tags while not imposing any strict rules. – Avoid ambiguous tags in favor of tags that offer greater information value.
  • 14. Tag Recommender – Technologies  Naive Methods – Most Popular Tags on Resources – Most Popular Tags on Users – Most Popular Tags on Resources and Users  Classical Collaborative Filtering – User-KNN – Item-KNN  Adapted KNN Methods – Extend User-Item Matrix – Degrade User-Item-Tag Relationship  Content-based Method  Tensor Method – Tensor Factorization  Graph Based – FolkRank  Our Work
  • 15. Adapted KNN – Extend UI Matrix
  • 16. Adapted KNN – Degrade User-Item-Tag relationship  Process – TF/IDF on UI, UT, IT – P-Core Processing – Remove noise data – Extract User Model by Hebbian Deflation
  • 18. FolkRank  PageRank PR( p j ) PR( pi )  (1 d ) / N  d  p j M ( pi ) L( p j ) (1)  Personalized PageRank PR( p j ) PR( pi )  (1 d ) pi  d  p j M ( pi ) L( p j ) (2)  FolkRank 1. Compute global PageRank by (1) 2. Then for each <user, item> pair, compute personalized PageRank by (2) – p[i] = 1, but p [u] = 1 + |U| and p [r] = 1 + |R|. 3. FolkRank = Personalized PageRank - PageRank
  • 19. Our Work  Explored and Exploring Methods – Non-classical Tensor Fusion Factorization – Multi-label Classification by Random Decision Trees, High Speed – The performance of both two methods are close to FolkRank  Current Progress – Shiwan develop a simple graph model – Best precision and recall on several datasets compared to other methods – We are writing paper targeting ACM RecSys 2010
  • 20. Tag-based Recommender  Our Work – IUI 2008 Paper, Improved Recommendation based on Collaborative Tagging Behaviors – Explored Methods – Tensor Factorization – Non-classical Tensor and Matrix Fusion Factorization  Other Works – Shilad Sen, Jesse Vig, and John Riedl, Tagommenders: Connecting Users to Items through Tags, WWW 2009
  • 21. IUI 2008 Paper Overview  We invent a new collaborative filtering approach TBCF (Tag-based Collaborative Filtering) based on the semantic distance among tags assigned by different users to improve the effectiveness of neighbor selection.  That is, two users could be considered similar not only if they rated the items similarly, but also if they have similar cognitions over these items.  Example – Both Bob and Tom may rate the movie Avatar with 5 stars, which indicates they all like this movie very much. – Nevertheless, as a 3D fan, Bob appreciates this movie for its high quality 3D animations, while Tom may think that it is a wonderful action movie.
  • 22. Tag-based Collaborative Filtering Tag-based User-Item Matrix Item1 Item2 Item3 Item4 Alice Art, photo Home, Products Writing, Design Learning, Education Daniel Photo, Album, Ø Typewriter Tutorial, Training Image Sherry Ø Cleaning Ø Language, Study Maggie Photography Ø Ovens Ø Steps 1. Calculate the semantic similarity of tags based on WordNet (for the tags not included in WordNet, calculate the edit-distance instead) 2. Calculate the similarity between tag sets 3. Calculate the similarity between user u and v by summing up the similarity of tag sets on common pages (tagged by both u & v) 4. Find the top-N nearest neighbors of the active user to make the prediction 5. Return the top-M predicted items to the active user
  • 23. Tag Similarity Calculation  Tag similarity – WordNet – LSA/PLSA  Tag set similarity – Hungarian method WordNet Concept Tree Word similarity in WordNet If x and y are contained in WordNet, dis(x,y) is the shortest path length between x and y.
  • 24. Experimental Evaluation Data Set Extract total 8000 users, 5315 pages and 7670 tags from web logs. Algorithm Average Precision Average Ranking TBCF 0.27 2.8 cosine 0.13 1.5 Random generated subset Average Precision Average Precision TBCF cosine 500 0.208 0.121 2000 0.182 0.118 4000 0.202 0.173 6000 0.209 0.180
  • 25. Tagommenders: Connecting Users to Items through Tags
  • 26. Q&A