SlideShare une entreprise Scribd logo
1  sur  22
Scientific Recommender Systems

             Jan Petertonkoker


            January 12th, 2012




   Scientific Recommender Systems   1
Contents


Contents


  1. Motivation (Examples)
  2. Recommender Systems
  3. Categories of Recommender Systems
     3.1 Content-based Recommender: TF-IDF
     3.2 Collaborative Recommender: Apache Mahout
     3.3 Hybrid Recommender: SciPlore
  4. Visualizations (Prototype)
  5. Conclusion




                  Scientific Recommender Systems           2
Motivation


Motivation




                       Example: Amazon




             Scientific Recommender Systems           3
Motivation


Motivation




                       Example: Twitter



             Scientific Recommender Systems           4
Recommender Systems


Recommender Systems



                              u :C ×S →R

    C - set of all users
    S - set of all items
    R - totally ordered set, which describes the usefulness of the
    items to the respective user




                  Scientific Recommender Systems                             5
Categories of Recommender Systems


Categories of Recommender Systems



    content-based: items are recommended that are similar to
    items the user liked in the past
    collaborative: items are recommended that people liked that
    are similar to the user (similar taste/preferences)
    hybrid: a combination of content-based and collaborative
    recommendation approaches




                Scientific Recommender Systems                                  6
Categories of Recommender Systems


Content-based Recommender Systems


    utility u(c, s) of an item s is estimated with the help of the
    utilities u(c, si ) of all items si ∈ S that user c already rated
    that are similar to item s
    similarity between items is calculated according to their
    attributes
    user and item profiles
    common problems
         limited content analysis
         overspecialization
         new user problem




                  Scientific Recommender Systems                                    7
Categories of Recommender Systems


Content-based Recommender: TF-IDF
    N - total number of documents in the system
    keyword ki appears in ni of the documents
    fi,j denotes the number of times a certain keyword ki appears
    in a document dj




                Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj




                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj
 Inverse Document Frequency
                                   N
     for a keyword ki : IDFi = log ni



                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj
 Inverse Document Frequency
                                   N
     for a keyword ki : IDFi = log ni
 TF-IDF
     wi,j = TFi,j × IDFi
                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Collaborative Recommender Systems



    utility u(c, s) of an item s is estimated with the help of the
    utilities u(ci , s) assigned by users ci ∈ C that are similar to
    user c.
    common problems
         new user/item problem
         cold start
         sparsity
         scalability




                  Scientific Recommender Systems                                    9
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (1)



    provides a ”toolbox” to create collaborative recommender
    systems
    input
        user (long), item (long), preference (double)
        1, 111, 2.5
    data model
        input from different file formats, database
        increase performance with specific data structures




                 Scientific Recommender Systems                                    10
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (2)
    user-based recommender




               Scientific Recommender Systems                                 11
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (2)
    user-based recommender




    item-based recommender




               Scientific Recommender Systems                                 11
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (3)


    similarity measures
        pearson correlation (cosine similarity)
        euclidean distance
        spearman correlation
        log-likelihood
        ...
    slope-one recommender
    other experimental recommender implementations
        e.g. cluster-based




                 Scientific Recommender Systems                                  12
Categories of Recommender Systems


Hybrid Recommender Systems


    combination of content-based and collaborative methods
        seperate content-based and collaborative recommender
        systems; results get combined somehow
        collaborative recommender system with some added aspects of
        content-based methods
        content-based recommender system with some added aspects
        of collaborative methods
        a single recommender system which unifies content-based and
        collaborative methods from the beginning




                Scientific Recommender Systems                                 13
Categories of Recommender Systems


Hybrid Recommender: SciPlore




                       SciPlore Overview


             Scientific Recommender Systems                                 14
Visualizations (Prototype)


Visualizations (Prototype)



     several recommenders based on given database
     visualizations for explaining recommendations




                  Live Presentation


                 Scientific Recommender Systems                              15
Conclusion


Summary



   utility function
   categories of recommender systems
        content-based
        collaborative
        hybrid
   implementation with Apache Mahout
   possible visualizations




                Scientific Recommender Systems          16
Conclusion




      Questions?




Scientific Recommender Systems          17
References


References

    Apache Mahout: Scalable machine learning and data mining.
    http://mahout.apache.org/ - accessed on 6th January 2012
    SciPlore: Exploring Science. http://www.sciplore.org -
    accessed on 6th January 2012
    G Adomavicius and A Tuzhilin. Toward the next generation of
    recommender systems: a survey of the state-of-the-art and
    possible extensions. IEEE Transactions on Knowledge and
    Data Engineering, 17(6):734-749, 2005
    B Gipp, J Beel and C Hentschel. Scienstein: A research paper
    recommender system, volume 301, pages 309-315. IEEE, 2009
    Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman.
    Mahout in Action, 2011

                Scientific Recommender Systems                       18

Contenu connexe

Similaire à Scientific Recommender Systems - PG PUSHPIN

An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
PyData
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
Dave King
 
Expert systems in artificial intelegence
Expert systems in artificial intelegenceExpert systems in artificial intelegence
Expert systems in artificial intelegence
Anna Aquarian
 

Similaire à Scientific Recommender Systems - PG PUSHPIN (20)

COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
 
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014
 
Major_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxMajor_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptx
 
On the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activitiesOn the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activities
 
Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação
 
WORD
WORDWORD
WORD
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
 
CS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationCS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_Recommendation
 
lms final ppt.pptx
lms final ppt.pptxlms final ppt.pptx
lms final ppt.pptx
 
Typicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationTypicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering Recommendation
 
Learning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors Simultaneously
 
Mis
MisMis
Mis
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 
Mendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scaleMendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scale
 
Expert systems in artificial intelegence
Expert systems in artificial intelegenceExpert systems in artificial intelegence
Expert systems in artificial intelegence
 

Dernier

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
PECB
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
heathfieldcps1
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 

Dernier (20)

Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 

Scientific Recommender Systems - PG PUSHPIN

  • 1. Scientific Recommender Systems Jan Petertonkoker January 12th, 2012 Scientific Recommender Systems 1
  • 2. Contents Contents 1. Motivation (Examples) 2. Recommender Systems 3. Categories of Recommender Systems 3.1 Content-based Recommender: TF-IDF 3.2 Collaborative Recommender: Apache Mahout 3.3 Hybrid Recommender: SciPlore 4. Visualizations (Prototype) 5. Conclusion Scientific Recommender Systems 2
  • 3. Motivation Motivation Example: Amazon Scientific Recommender Systems 3
  • 4. Motivation Motivation Example: Twitter Scientific Recommender Systems 4
  • 5. Recommender Systems Recommender Systems u :C ×S →R C - set of all users S - set of all items R - totally ordered set, which describes the usefulness of the items to the respective user Scientific Recommender Systems 5
  • 6. Categories of Recommender Systems Categories of Recommender Systems content-based: items are recommended that are similar to items the user liked in the past collaborative: items are recommended that people liked that are similar to the user (similar taste/preferences) hybrid: a combination of content-based and collaborative recommendation approaches Scientific Recommender Systems 6
  • 7. Categories of Recommender Systems Content-based Recommender Systems utility u(c, s) of an item s is estimated with the help of the utilities u(c, si ) of all items si ∈ S that user c already rated that are similar to item s similarity between items is calculated according to their attributes user and item profiles common problems limited content analysis overspecialization new user problem Scientific Recommender Systems 7
  • 8. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Scientific Recommender Systems 8
  • 9. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Scientific Recommender Systems 8
  • 10. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Inverse Document Frequency N for a keyword ki : IDFi = log ni Scientific Recommender Systems 8
  • 11. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Inverse Document Frequency N for a keyword ki : IDFi = log ni TF-IDF wi,j = TFi,j × IDFi Scientific Recommender Systems 8
  • 12. Categories of Recommender Systems Collaborative Recommender Systems utility u(c, s) of an item s is estimated with the help of the utilities u(ci , s) assigned by users ci ∈ C that are similar to user c. common problems new user/item problem cold start sparsity scalability Scientific Recommender Systems 9
  • 13. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (1) provides a ”toolbox” to create collaborative recommender systems input user (long), item (long), preference (double) 1, 111, 2.5 data model input from different file formats, database increase performance with specific data structures Scientific Recommender Systems 10
  • 14. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (2) user-based recommender Scientific Recommender Systems 11
  • 15. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (2) user-based recommender item-based recommender Scientific Recommender Systems 11
  • 16. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (3) similarity measures pearson correlation (cosine similarity) euclidean distance spearman correlation log-likelihood ... slope-one recommender other experimental recommender implementations e.g. cluster-based Scientific Recommender Systems 12
  • 17. Categories of Recommender Systems Hybrid Recommender Systems combination of content-based and collaborative methods seperate content-based and collaborative recommender systems; results get combined somehow collaborative recommender system with some added aspects of content-based methods content-based recommender system with some added aspects of collaborative methods a single recommender system which unifies content-based and collaborative methods from the beginning Scientific Recommender Systems 13
  • 18. Categories of Recommender Systems Hybrid Recommender: SciPlore SciPlore Overview Scientific Recommender Systems 14
  • 19. Visualizations (Prototype) Visualizations (Prototype) several recommenders based on given database visualizations for explaining recommendations Live Presentation Scientific Recommender Systems 15
  • 20. Conclusion Summary utility function categories of recommender systems content-based collaborative hybrid implementation with Apache Mahout possible visualizations Scientific Recommender Systems 16
  • 21. Conclusion Questions? Scientific Recommender Systems 17
  • 22. References References Apache Mahout: Scalable machine learning and data mining. http://mahout.apache.org/ - accessed on 6th January 2012 SciPlore: Exploring Science. http://www.sciplore.org - accessed on 6th January 2012 G Adomavicius and A Tuzhilin. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6):734-749, 2005 B Gipp, J Beel and C Hentschel. Scienstein: A research paper recommender system, volume 301, pages 309-315. IEEE, 2009 Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman. Mahout in Action, 2011 Scientific Recommender Systems 18