SlideShare une entreprise Scribd logo
1  sur  26
AFFECT ANALYSIS OF DUTCH SOCIAL
            MEDIA
                      AND
RANKING OF QUERY RESULTS OVER
         LINKED DATA


       Laurens Rietveld
Master Project Background
   Affect analysis of Dutch social media
   Finished July 2010
   VU (Stefan)
   GfK Daphne
       Marketing Research
       Online dashboard
              Data          Data
                                      Analysis
            collection   Processing



       Not involved yet in webmining
       Business case: National Railway Company (NS)
Project Background
Affect Analysis


   Affect: experience of feeling or emotion[1]
   Multiple measurements
       Physiological
       Behavioral
       Vocal
       Linguistic




                        [1] W. Huitt, The Affective System
Project Background
Affect Analysis


   What is online affect analysis
       Detect emotions on web pages
       Types of emotions[2]:
         Love
         Joy
         Surprise
         Anger
         Sadness
         Fear



                  [2] W. Parrott, Emotions in Social Psychology
Project Background
Affect Analysis


   Main problems
       Unstructured data
         Internet   (html)
         Text

       Domain dependencies
         “Goread the book” positive in book reviews, negative
         in movie reviews
       Ambivalence
         Text
         Emotion
Project Background
Dutch Social Media


   Used Social Media Types:
       Blogs (www.blogspot.com)
       Online news item reactions (www.fok.nl)
       Micro-blogs (www.twitter.com)
Project Background
Crowd Sourcing


   Problems:
       Affect analysis needs training data
       Annotating data is time-consuming
       Annotate every domain
       Normally done by researcher
   Solution: Crowd Sourcing
       Mechanical Turks
       Outsourcing simple tasks to large community
          +                       -
          Many tasks              English only
          Quick                   Risk of lower quality
          Cheap                   Unethical (debatably)
Affect Analysis Approach
Research Questions
    Is it possible to apply crowd-sourcing to affect
     analysis of Dutch social media

    Are there differences between social media types
     in affect analysis
Results
    Inter annotator agreement: low
    Neutral outvotes emotion
    Possible causes:
        Missing sentence context
        Too few annotators
        Noise introduced by translation
Results
                                    Period                                 Event
                            July 2007        Problems in the payment system of ticket automats
                            January 2009     Required chip card payment method for students
                            December 2009    Train and railway malfunctions due to snow
                            February 2010    Filthy train stations due to cleaning crew strikes

                                                          All social media
                      9%
                      8%
                      7%
 % of all documents




                      6%
                      5%
                      4%
                      3%
                      2%
                      1%
                      0%
                      -1%



                                                                         Period

                                                    Joy       Surprise      Anger        Sadness
Future work


   Other list of emotions
   Improve annotation process
      More voting

      Use other strategies for annotation tasks

   Not sentence annotation but paragraph/document
   Different social media types, different feature-
    extraction/classifier/annotation strategies
AFFECT ANALYSIS OF DUTCH SOCIAL
            MEDIA
                      AND
RANKING OF QUERY RESULTS OVER
         LINKED DATA


       Laurens Rietveld
Data2Semantics
Data2Semantics
Data2Semantics




Wicherts JM, Bakker M, Molenaar D, 2011
Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results.
PLoS ONE 6(11)
Data2Semantics
   Provide semantic infrastructure for e-Science
   How to
    share, publish, access, analyze, interpret and
    reuse data?
     Querying

     Ranking

     Information utility
     Enriched publications

     Provenance

     Annotation/interpretation
Census
Clinical Decision Support
                                             Linked Data



                                                       Clinical evidence
                                                       e.g. CT report

Hospital
                                AERS




             CDS tools
              CDS tools



            Patient Profile



           EMR            LIS          Elsevier-published
                                       Clinical Guideline
My Research

                           Ranking




 http://dbpedia.org/fct/             http://google.com
My Research

   Ranking
    1.       Relevance
             No proper „PageRank‟ equivalent for semantic web
             Heterogeneous and imprecise data
    2.       Ordering
             Performance
Relevance

   What query results are most relevant?

   Semantic web comes with implicit orderings.
    Possible indicators:
     Which ontologies are used more often?
     What can we say about these ontologies?

     Which query results are semantically similar?

     Which query results can I trust?
Ordering

SELECT ?price ?offer ?product ?vendor ((?rating + ?popularity) AS
?score)
{
   ?product :hasRating ?rating .
   ?product :producer ?producer .
   ?producer :hasPopularity ?popularity .
   ?offer :product ?product .
   ?offer :price ?price .
}
ORDER BY DESC(?score)
LIMIT 1




                                Berlin SPARQL Benchmark
Ordering
                Related work: Sara Magliacane




                                                      SPARQL-Rank
                                                                                        1
traditional




                                 1
                                 Slice                                                      Slice
                                   1                                                          1
                                                                                       1
                         13205
                                 Order                                                      Join

                         13205                                                 95                     1
                                                                               Ran                   BG
                                  Join                                         kJoi
                                                                                                      P
                                                                                n                    ?product
               438634                      13205                30                    29             ?producer
                     Join                  BG                         Ran                  Ran       ?offer
                                            P                          k                    k        ?price

       646                        679     ?product       646                          679
                                          ?producer                   BG                BG
                 BG              BG       ?offer
                  P               P       ?price
                                                                       P                 P
                                                                    ?product           ?producer
              ?product      ?producer
                                                                    ?rating            ?popularity
              ?rating       ?popularity
Current Question

   What if reasoning is required to materialize
    information?

   Top-k Closure (Stefan Schlobach)
     Avoid   full materialization while still being complete

     Vb   materialisatie
Thank You

Contenu connexe

En vedette

Google I/O 2013 - Android Graphics Performance
Google I/O 2013 - Android Graphics PerformanceGoogle I/O 2013 - Android Graphics Performance
Google I/O 2013 - Android Graphics PerformanceDouO
 
Digital Drawing Assignment Directions & Tutorial
Digital Drawing Assignment Directions & TutorialDigital Drawing Assignment Directions & Tutorial
Digital Drawing Assignment Directions & TutorialMrsM2014
 
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren Code
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren CodeDer 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren Code
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren CodeBenjamin Schmid
 
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREY
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREYBUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREY
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREYCodeCore
 
Survival of the Continuist
Survival of the ContinuistSurvival of the Continuist
Survival of the ContinuistPaul Blundell
 

En vedette (9)

Google I/O 2013 - Android Graphics Performance
Google I/O 2013 - Android Graphics PerformanceGoogle I/O 2013 - Android Graphics Performance
Google I/O 2013 - Android Graphics Performance
 
Digital Drawing Assignment Directions & Tutorial
Digital Drawing Assignment Directions & TutorialDigital Drawing Assignment Directions & Tutorial
Digital Drawing Assignment Directions & Tutorial
 
Failfast
FailfastFailfast
Failfast
 
Apache hadoop
Apache hadoopApache hadoop
Apache hadoop
 
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren Code
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren CodeDer 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren Code
Der 10-Punkte Plan für den sicheren Weg zum nicht-wartbaren Code
 
Wlan
WlanWlan
Wlan
 
Dosing systems GWTP
Dosing systems GWTPDosing systems GWTP
Dosing systems GWTP
 
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREY
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREYBUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREY
BUILDING MODERN PYTHON WEB FRAMEWORKS USING FLASK WITH NEIL GREY
 
Survival of the Continuist
Survival of the ContinuistSurvival of the Continuist
Survival of the Continuist
 

Similaire à Presentation on Affect Analysis and Ranking

The Pistoia Alliance Information Ecosystem Workshop
The Pistoia Alliance Information Ecosystem WorkshopThe Pistoia Alliance Information Ecosystem Workshop
The Pistoia Alliance Information Ecosystem WorkshopPistoia Alliance
 
Cnw preso v7 no video no animation slide share
Cnw preso v7 no video no animation slide shareCnw preso v7 no video no animation slide share
Cnw preso v7 no video no animation slide shareDirectionFirst
 
Right Tool for the Job
Right Tool for the JobRight Tool for the Job
Right Tool for the JobBeth Kanter
 
EmPower PRSA, 2012 - Analytics & Influencers
EmPower PRSA, 2012 - Analytics & InfluencersEmPower PRSA, 2012 - Analytics & Influencers
EmPower PRSA, 2012 - Analytics & Influencersjoerhoton
 
Match management
Match managementMatch management
Match managementmaggie4667
 
Hkd2 london issue breakout_paolo guadagni_h&k
Hkd2 london issue breakout_paolo guadagni_h&kHkd2 london issue breakout_paolo guadagni_h&k
Hkd2 london issue breakout_paolo guadagni_h&kH&K Demystifying Digital
 
Social Media for Plastics Executives
Social Media for Plastics ExecutivesSocial Media for Plastics Executives
Social Media for Plastics ExecutivesEric Schwartzman
 
Executive summary banking syndicated report
Executive summary   banking syndicated reportExecutive summary   banking syndicated report
Executive summary banking syndicated reportMichalis A. Michael
 
Enabling smaller independent publishers to participate in Open Access agreeme...
Enabling smaller independent publishers to participate in Open Access agreeme...Enabling smaller independent publishers to participate in Open Access agreeme...
Enabling smaller independent publishers to participate in Open Access agreeme...Alicia Wise
 
Social Media Measurement, ROI and Business Outcomes
Social Media Measurement, ROI and Business OutcomesSocial Media Measurement, ROI and Business Outcomes
Social Media Measurement, ROI and Business OutcomesTim Marklein
 
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...multimediaeval
 
Aalto stanford webinar-on collaborative working ennvironments 2013
Aalto stanford webinar-on collaborative working ennvironments 2013Aalto stanford webinar-on collaborative working ennvironments 2013
Aalto stanford webinar-on collaborative working ennvironments 2013Matti Vartiainen
 
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...Wolfgang_Polt
 
Vocus Webinar: P.R.ove Yourself with Angie Jeffrey
Vocus Webinar: P.R.ove Yourself with Angie JeffreyVocus Webinar: P.R.ove Yourself with Angie Jeffrey
Vocus Webinar: P.R.ove Yourself with Angie JeffreyCision
 
OAA12 - What difference to the sustainability of open access can (a donor lik...
OAA12 - What difference to the sustainability of open access can (a donor lik...OAA12 - What difference to the sustainability of open access can (a donor lik...
OAA12 - What difference to the sustainability of open access can (a donor lik...BioMedCentral
 

Similaire à Presentation on Affect Analysis and Ranking (20)

The Pistoia Alliance Information Ecosystem Workshop
The Pistoia Alliance Information Ecosystem WorkshopThe Pistoia Alliance Information Ecosystem Workshop
The Pistoia Alliance Information Ecosystem Workshop
 
Cnw preso v7 no video no animation slide share
Cnw preso v7 no video no animation slide shareCnw preso v7 no video no animation slide share
Cnw preso v7 no video no animation slide share
 
The future of scientific information & communication
The future of scientific information & communicationThe future of scientific information & communication
The future of scientific information & communication
 
Right Tool for the Job
Right Tool for the JobRight Tool for the Job
Right Tool for the Job
 
EmPower PRSA, 2012 - Analytics & Influencers
EmPower PRSA, 2012 - Analytics & InfluencersEmPower PRSA, 2012 - Analytics & Influencers
EmPower PRSA, 2012 - Analytics & Influencers
 
Introduction to the next generation of GRI Guidelines (G4): How does a global...
Introduction to the next generation of GRI Guidelines (G4): How does a global...Introduction to the next generation of GRI Guidelines (G4): How does a global...
Introduction to the next generation of GRI Guidelines (G4): How does a global...
 
Match management
Match managementMatch management
Match management
 
Hkd2 london issue breakout_paolo guadagni_h&k
Hkd2 london issue breakout_paolo guadagni_h&kHkd2 london issue breakout_paolo guadagni_h&k
Hkd2 london issue breakout_paolo guadagni_h&k
 
Social Media for Plastics Executives
Social Media for Plastics ExecutivesSocial Media for Plastics Executives
Social Media for Plastics Executives
 
Executive summary banking syndicated report
Executive summary   banking syndicated reportExecutive summary   banking syndicated report
Executive summary banking syndicated report
 
Enabling smaller independent publishers to participate in Open Access agreeme...
Enabling smaller independent publishers to participate in Open Access agreeme...Enabling smaller independent publishers to participate in Open Access agreeme...
Enabling smaller independent publishers to participate in Open Access agreeme...
 
Ama Webcast 5.22.08
Ama Webcast 5.22.08Ama Webcast 5.22.08
Ama Webcast 5.22.08
 
Social Media Measurement, ROI and Business Outcomes
Social Media Measurement, ROI and Business OutcomesSocial Media Measurement, ROI and Business Outcomes
Social Media Measurement, ROI and Business Outcomes
 
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...
MediaEval 2018: NewsREEL Multimedia at MediaEval 2018: News Recommendation wi...
 
Aalto stanford webinar-on collaborative working ennvironments 2013
Aalto stanford webinar-on collaborative working ennvironments 2013Aalto stanford webinar-on collaborative working ennvironments 2013
Aalto stanford webinar-on collaborative working ennvironments 2013
 
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platformsChemSpider – disseminating data and enabling an abundance of chemistry platforms
ChemSpider – disseminating data and enabling an abundance of chemistry platforms
 
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...
Wolfgang Polt Performance Of The Czech National Research And Innovation Syste...
 
Vocus Webinar: P.R.ove Yourself with Angie Jeffrey
Vocus Webinar: P.R.ove Yourself with Angie JeffreyVocus Webinar: P.R.ove Yourself with Angie Jeffrey
Vocus Webinar: P.R.ove Yourself with Angie Jeffrey
 
OAA12 - What difference to the sustainability of open access can (a donor lik...
OAA12 - What difference to the sustainability of open access can (a donor lik...OAA12 - What difference to the sustainability of open access can (a donor lik...
OAA12 - What difference to the sustainability of open access can (a donor lik...
 
LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kalte...
LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kalte...LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kalte...
LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kalte...
 

Dernier

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAndikSusilo4
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 

Dernier (20)

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Azure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & ApplicationAzure Monitor & Application Insight to monitor Infrastructure & Application
Azure Monitor & Application Insight to monitor Infrastructure & Application
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 

Presentation on Affect Analysis and Ranking

  • 1. AFFECT ANALYSIS OF DUTCH SOCIAL MEDIA AND RANKING OF QUERY RESULTS OVER LINKED DATA Laurens Rietveld
  • 2. Master Project Background  Affect analysis of Dutch social media  Finished July 2010  VU (Stefan)  GfK Daphne  Marketing Research  Online dashboard Data Data Analysis collection Processing  Not involved yet in webmining  Business case: National Railway Company (NS)
  • 3. Project Background Affect Analysis  Affect: experience of feeling or emotion[1]  Multiple measurements  Physiological  Behavioral  Vocal  Linguistic [1] W. Huitt, The Affective System
  • 4. Project Background Affect Analysis  What is online affect analysis  Detect emotions on web pages  Types of emotions[2]:  Love  Joy  Surprise  Anger  Sadness  Fear [2] W. Parrott, Emotions in Social Psychology
  • 5. Project Background Affect Analysis  Main problems  Unstructured data  Internet (html)  Text  Domain dependencies  “Goread the book” positive in book reviews, negative in movie reviews  Ambivalence  Text  Emotion
  • 6. Project Background Dutch Social Media  Used Social Media Types:  Blogs (www.blogspot.com)  Online news item reactions (www.fok.nl)  Micro-blogs (www.twitter.com)
  • 7. Project Background Crowd Sourcing  Problems:  Affect analysis needs training data  Annotating data is time-consuming  Annotate every domain  Normally done by researcher  Solution: Crowd Sourcing  Mechanical Turks  Outsourcing simple tasks to large community + - Many tasks English only Quick Risk of lower quality Cheap Unethical (debatably)
  • 9. Research Questions  Is it possible to apply crowd-sourcing to affect analysis of Dutch social media  Are there differences between social media types in affect analysis
  • 10. Results  Inter annotator agreement: low  Neutral outvotes emotion  Possible causes:  Missing sentence context  Too few annotators  Noise introduced by translation
  • 11. Results Period Event July 2007 Problems in the payment system of ticket automats January 2009 Required chip card payment method for students December 2009 Train and railway malfunctions due to snow February 2010 Filthy train stations due to cleaning crew strikes All social media 9% 8% 7% % of all documents 6% 5% 4% 3% 2% 1% 0% -1% Period Joy Surprise Anger Sadness
  • 12. Future work  Other list of emotions  Improve annotation process  More voting  Use other strategies for annotation tasks  Not sentence annotation but paragraph/document  Different social media types, different feature- extraction/classifier/annotation strategies
  • 13. AFFECT ANALYSIS OF DUTCH SOCIAL MEDIA AND RANKING OF QUERY RESULTS OVER LINKED DATA Laurens Rietveld
  • 16. Data2Semantics Wicherts JM, Bakker M, Molenaar D, 2011 Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results. PLoS ONE 6(11)
  • 17. Data2Semantics  Provide semantic infrastructure for e-Science  How to share, publish, access, analyze, interpret and reuse data?  Querying  Ranking  Information utility  Enriched publications  Provenance  Annotation/interpretation
  • 19. Clinical Decision Support Linked Data Clinical evidence e.g. CT report Hospital AERS CDS tools CDS tools Patient Profile EMR LIS Elsevier-published Clinical Guideline
  • 20. My Research Ranking http://dbpedia.org/fct/ http://google.com
  • 21. My Research  Ranking 1. Relevance  No proper „PageRank‟ equivalent for semantic web  Heterogeneous and imprecise data 2. Ordering  Performance
  • 22. Relevance  What query results are most relevant?  Semantic web comes with implicit orderings. Possible indicators:  Which ontologies are used more often?  What can we say about these ontologies?  Which query results are semantically similar?  Which query results can I trust?
  • 23. Ordering SELECT ?price ?offer ?product ?vendor ((?rating + ?popularity) AS ?score) { ?product :hasRating ?rating . ?product :producer ?producer . ?producer :hasPopularity ?popularity . ?offer :product ?product . ?offer :price ?price . } ORDER BY DESC(?score) LIMIT 1 Berlin SPARQL Benchmark
  • 24. Ordering  Related work: Sara Magliacane SPARQL-Rank 1 traditional 1 Slice Slice 1 1 1 13205 Order Join 13205 95 1 Ran BG Join kJoi P n ?product 438634 13205 30 29 ?producer Join BG Ran Ran ?offer P k k ?price 646 679 ?product 646 679 ?producer BG BG BG BG ?offer P P ?price P P ?product ?producer ?product ?producer ?rating ?popularity ?rating ?popularity
  • 25. Current Question  What if reasoning is required to materialize information?  Top-k Closure (Stefan Schlobach)  Avoid full materialization while still being complete  Vb materialisatie

Notes de l'éditeur

  1. Physiological:hartslagBehavioral: dmv questionnaires Vocal: stemhoogteLinguistic: analyseren van text
  2. Master project based on rankings, made explicit for a certain application (affect analysis)Thesis written more from application view. PhD research more from data
  3. Information utility: suitability of dataset in answering a query (based on complexity measures)
  4. 1795 onwardsInformation on municipalities, occupations, housing, etc
  5. AERS: adverse event reporting systemCT: clinical trialCDS: clinical decision supportWhere are the enrichted publications of elsevier?
  6. Mention implicit vs. explicit
  7. Example: online storeSimplified example
  8. Avoid full joins
  9. In other words: what if we need to apply rules, to get all the values to rank on