SlideShare une entreprise Scribd logo
1  sur  22
Scientific Recommender Systems

             Jan Petertonkoker


            January 12th, 2012




   Scientific Recommender Systems   1
Contents


Contents


  1. Motivation (Examples)
  2. Recommender Systems
  3. Categories of Recommender Systems
     3.1 Content-based Recommender: TF-IDF
     3.2 Collaborative Recommender: Apache Mahout
     3.3 Hybrid Recommender: SciPlore
  4. Visualizations (Prototype)
  5. Conclusion




                  Scientific Recommender Systems           2
Motivation


Motivation




                       Example: Amazon




             Scientific Recommender Systems           3
Motivation


Motivation




                       Example: Twitter



             Scientific Recommender Systems           4
Recommender Systems


Recommender Systems



                              u :C ×S →R

    C - set of all users
    S - set of all items
    R - totally ordered set, which describes the usefulness of the
    items to the respective user




                  Scientific Recommender Systems                             5
Categories of Recommender Systems


Categories of Recommender Systems



    content-based: items are recommended that are similar to
    items the user liked in the past
    collaborative: items are recommended that people liked that
    are similar to the user (similar taste/preferences)
    hybrid: a combination of content-based and collaborative
    recommendation approaches




                Scientific Recommender Systems                                  6
Categories of Recommender Systems


Content-based Recommender Systems


    utility u(c, s) of an item s is estimated with the help of the
    utilities u(c, si ) of all items si ∈ S that user c already rated
    that are similar to item s
    similarity between items is calculated according to their
    attributes
    user and item profiles
    common problems
         limited content analysis
         overspecialization
         new user problem




                  Scientific Recommender Systems                                    7
Categories of Recommender Systems


Content-based Recommender: TF-IDF
    N - total number of documents in the system
    keyword ki appears in ni of the documents
    fi,j denotes the number of times a certain keyword ki appears
    in a document dj




                Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj




                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj
 Inverse Document Frequency
                                   N
     for a keyword ki : IDFi = log ni



                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Content-based Recommender: TF-IDF
     N - total number of documents in the system
     keyword ki appears in ni of the documents
     fi,j denotes the number of times a certain keyword ki appears
     in a document dj
 Term Frequency
                 fi,j
     TFi,j =   maxz fz,j
     maximum in the denominator calculated over the frequencies
     of all keywords kz that appear in document dj
 Inverse Document Frequency
                                   N
     for a keyword ki : IDFi = log ni
 TF-IDF
     wi,j = TFi,j × IDFi
                     Scientific Recommender Systems                                  8
Categories of Recommender Systems


Collaborative Recommender Systems



    utility u(c, s) of an item s is estimated with the help of the
    utilities u(ci , s) assigned by users ci ∈ C that are similar to
    user c.
    common problems
         new user/item problem
         cold start
         sparsity
         scalability




                  Scientific Recommender Systems                                    9
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (1)



    provides a ”toolbox” to create collaborative recommender
    systems
    input
        user (long), item (long), preference (double)
        1, 111, 2.5
    data model
        input from different file formats, database
        increase performance with specific data structures




                 Scientific Recommender Systems                                    10
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (2)
    user-based recommender




               Scientific Recommender Systems                                 11
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (2)
    user-based recommender




    item-based recommender




               Scientific Recommender Systems                                 11
Categories of Recommender Systems


Collaborative Recommender: Apache Mahout (3)


    similarity measures
        pearson correlation (cosine similarity)
        euclidean distance
        spearman correlation
        log-likelihood
        ...
    slope-one recommender
    other experimental recommender implementations
        e.g. cluster-based




                 Scientific Recommender Systems                                  12
Categories of Recommender Systems


Hybrid Recommender Systems


    combination of content-based and collaborative methods
        seperate content-based and collaborative recommender
        systems; results get combined somehow
        collaborative recommender system with some added aspects of
        content-based methods
        content-based recommender system with some added aspects
        of collaborative methods
        a single recommender system which unifies content-based and
        collaborative methods from the beginning




                Scientific Recommender Systems                                 13
Categories of Recommender Systems


Hybrid Recommender: SciPlore




                       SciPlore Overview


             Scientific Recommender Systems                                 14
Visualizations (Prototype)


Visualizations (Prototype)



     several recommenders based on given database
     visualizations for explaining recommendations




                  Live Presentation


                 Scientific Recommender Systems                              15
Conclusion


Summary



   utility function
   categories of recommender systems
        content-based
        collaborative
        hybrid
   implementation with Apache Mahout
   possible visualizations




                Scientific Recommender Systems          16
Conclusion




      Questions?




Scientific Recommender Systems          17
References


References

    Apache Mahout: Scalable machine learning and data mining.
    http://mahout.apache.org/ - accessed on 6th January 2012
    SciPlore: Exploring Science. http://www.sciplore.org -
    accessed on 6th January 2012
    G Adomavicius and A Tuzhilin. Toward the next generation of
    recommender systems: a survey of the state-of-the-art and
    possible extensions. IEEE Transactions on Knowledge and
    Data Engineering, 17(6):734-749, 2005
    B Gipp, J Beel and C Hentschel. Scienstein: A research paper
    recommender system, volume 301, pages 309-315. IEEE, 2009
    Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman.
    Mahout in Action, 2011

                Scientific Recommender Systems                       18

Contenu connexe

Similaire à Scientific Recommender Systems - PG PUSHPIN

COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
 
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...IJCSEA Journal
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...PyData
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptxprathammishra28
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf13DikshaDatir
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014Ewout Kramer
 
Major_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxMajor_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxLokeshKumarReddy8
 
On the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activitiesOn the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activitiesDavide Ruscio
 
Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Gabriel Moreira
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3Dave King
 
CS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationCS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationPalani Kumar
 
Typicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationTypicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationPapitha Velumani
 
Learning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyArnab Bhadury
 
Mendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scaleMendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scaleKris Jack
 
Expert systems in artificial intelegence
Expert systems in artificial intelegenceExpert systems in artificial intelegence
Expert systems in artificial intelegenceAnna Aquarian
 

Similaire à Scientific Recommender Systems - PG PUSHPIN (20)

COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
 
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
COMPARISON OF COLLABORATIVE FILTERING ALGORITHMS WITH VARIOUS SIMILARITY MEAS...
 
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
An Example of Predictive Analytics: Building a Recommendation Engine Using Py...
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014FHIR intro and background at HL7 Germany 2014
FHIR intro and background at HL7 Germany 2014
 
Major_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptxMajor_Project_Presentaion_B14.pptx
Major_Project_Presentaion_B14.pptx
 
On the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activitiesOn the way of listening to the crowd for supporting modeling activities
On the way of listening to the crowd for supporting modeling activities
 
Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação Sistemas de Recomendação sem Enrolação
Sistemas de Recomendação sem Enrolação
 
WORD
WORDWORD
WORD
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
 
CS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_RecommendationCS8091_BDA_Unit_III_Content_Based_Recommendation
CS8091_BDA_Unit_III_Content_Based_Recommendation
 
lms final ppt.pptx
lms final ppt.pptxlms final ppt.pptx
lms final ppt.pptx
 
Typicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering RecommendationTypicality-Based Collaborative Filtering Recommendation
Typicality-Based Collaborative Filtering Recommendation
 
Learning Content and Usage Factors Simultaneously
Learning Content and Usage Factors SimultaneouslyLearning Content and Usage Factors Simultaneously
Learning Content and Usage Factors Simultaneously
 
Mis
MisMis
Mis
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 
Mendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scaleMendeley: crowdsourcing and recommending research on a large scale
Mendeley: crowdsourcing and recommending research on a large scale
 
Expert systems in artificial intelegence
Expert systems in artificial intelegenceExpert systems in artificial intelegence
Expert systems in artificial intelegence
 

Dernier

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhikauryashika82
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 

Dernier (20)

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in DelhiRussian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
Russian Escort Service in Delhi 11k Hotel Foreigner Russian Call Girls in Delhi
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 

Scientific Recommender Systems - PG PUSHPIN

  • 1. Scientific Recommender Systems Jan Petertonkoker January 12th, 2012 Scientific Recommender Systems 1
  • 2. Contents Contents 1. Motivation (Examples) 2. Recommender Systems 3. Categories of Recommender Systems 3.1 Content-based Recommender: TF-IDF 3.2 Collaborative Recommender: Apache Mahout 3.3 Hybrid Recommender: SciPlore 4. Visualizations (Prototype) 5. Conclusion Scientific Recommender Systems 2
  • 3. Motivation Motivation Example: Amazon Scientific Recommender Systems 3
  • 4. Motivation Motivation Example: Twitter Scientific Recommender Systems 4
  • 5. Recommender Systems Recommender Systems u :C ×S →R C - set of all users S - set of all items R - totally ordered set, which describes the usefulness of the items to the respective user Scientific Recommender Systems 5
  • 6. Categories of Recommender Systems Categories of Recommender Systems content-based: items are recommended that are similar to items the user liked in the past collaborative: items are recommended that people liked that are similar to the user (similar taste/preferences) hybrid: a combination of content-based and collaborative recommendation approaches Scientific Recommender Systems 6
  • 7. Categories of Recommender Systems Content-based Recommender Systems utility u(c, s) of an item s is estimated with the help of the utilities u(c, si ) of all items si ∈ S that user c already rated that are similar to item s similarity between items is calculated according to their attributes user and item profiles common problems limited content analysis overspecialization new user problem Scientific Recommender Systems 7
  • 8. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Scientific Recommender Systems 8
  • 9. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Scientific Recommender Systems 8
  • 10. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Inverse Document Frequency N for a keyword ki : IDFi = log ni Scientific Recommender Systems 8
  • 11. Categories of Recommender Systems Content-based Recommender: TF-IDF N - total number of documents in the system keyword ki appears in ni of the documents fi,j denotes the number of times a certain keyword ki appears in a document dj Term Frequency fi,j TFi,j = maxz fz,j maximum in the denominator calculated over the frequencies of all keywords kz that appear in document dj Inverse Document Frequency N for a keyword ki : IDFi = log ni TF-IDF wi,j = TFi,j × IDFi Scientific Recommender Systems 8
  • 12. Categories of Recommender Systems Collaborative Recommender Systems utility u(c, s) of an item s is estimated with the help of the utilities u(ci , s) assigned by users ci ∈ C that are similar to user c. common problems new user/item problem cold start sparsity scalability Scientific Recommender Systems 9
  • 13. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (1) provides a ”toolbox” to create collaborative recommender systems input user (long), item (long), preference (double) 1, 111, 2.5 data model input from different file formats, database increase performance with specific data structures Scientific Recommender Systems 10
  • 14. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (2) user-based recommender Scientific Recommender Systems 11
  • 15. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (2) user-based recommender item-based recommender Scientific Recommender Systems 11
  • 16. Categories of Recommender Systems Collaborative Recommender: Apache Mahout (3) similarity measures pearson correlation (cosine similarity) euclidean distance spearman correlation log-likelihood ... slope-one recommender other experimental recommender implementations e.g. cluster-based Scientific Recommender Systems 12
  • 17. Categories of Recommender Systems Hybrid Recommender Systems combination of content-based and collaborative methods seperate content-based and collaborative recommender systems; results get combined somehow collaborative recommender system with some added aspects of content-based methods content-based recommender system with some added aspects of collaborative methods a single recommender system which unifies content-based and collaborative methods from the beginning Scientific Recommender Systems 13
  • 18. Categories of Recommender Systems Hybrid Recommender: SciPlore SciPlore Overview Scientific Recommender Systems 14
  • 19. Visualizations (Prototype) Visualizations (Prototype) several recommenders based on given database visualizations for explaining recommendations Live Presentation Scientific Recommender Systems 15
  • 20. Conclusion Summary utility function categories of recommender systems content-based collaborative hybrid implementation with Apache Mahout possible visualizations Scientific Recommender Systems 16
  • 21. Conclusion Questions? Scientific Recommender Systems 17
  • 22. References References Apache Mahout: Scalable machine learning and data mining. http://mahout.apache.org/ - accessed on 6th January 2012 SciPlore: Exploring Science. http://www.sciplore.org - accessed on 6th January 2012 G Adomavicius and A Tuzhilin. Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6):734-749, 2005 B Gipp, J Beel and C Hentschel. Scienstein: A research paper recommender system, volume 301, pages 309-315. IEEE, 2009 Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman. Mahout in Action, 2011 Scientific Recommender Systems 18