SlideShare a Scribd company logo
1 of 10
Topical Keyphrase Extraction
from Twitter
Abstract
In this paper, we propose to extract topical
keyphrases as one way to summarize Twitter.
1、A context-sensitive topical PageRank method for
keyword ranking
2、A probabilistic scoring function that considers
both relevance and interestingness of keyphrases for
keyphrase ranking
Introduction
Keyphrases are defined as a short list of terms to
summarize the topics of a document.
Challenges:
1. Tweets are much shorter than traditional
articles and not all tweets contain useful
information;
2. Topics tend to be more diverse in Twitter than
informal articles such as news reports.
Keyword ranking; Candidate keyphrase generation;
Keyphrase ranking.
Method
Method

Topic discovery

Keywords ranking
and candidate
generation

Keyphrases
ranking
Topic discovery
A modified author-topic model called Twitter-LDA
introduced by Zhao etal.(2011)
Topical PageRank for Keyword
Ranking
The topic-specific PageRank scores: TPR

A topic context sensitive PageRank method: cTPR
Topical PageRank for Keyword
Ranking
a common candidate keyphrase generation method
proposed by Mihalcea and Tarau(2004) as follows:
1. Select the top S keywords for each topic
2. look for combinations of these keywords that occur
as frequent phrases in the text collection
Probabilistic Models for Topical Keyphrase
Ranking
Relevance : A good keyphrase should be closely
related to the given topic and also discriminative.
Interestingness : A good keyphrase should be
interesting and can attract users’ attention.
Probabilistic Models for Topical Keyphrase
Ranking
In general we can assume that P ( R = 0)>> P ( R = 1)
because there are much more non-relevant
keyphrases than relevant ones,that is, δ>> 1 .

More Related Content

What's hot

Module 2: Information Sources and Tools
Module 2: Information Sources and ToolsModule 2: Information Sources and Tools
Module 2: Information Sources and ToolsLaraLibrarian
 
Effective Searching: Part 2 - Broaden your search (Generic Web)
Effective Searching: Part 2 - Broaden your search (Generic Web)Effective Searching: Part 2 - Broaden your search (Generic Web)
Effective Searching: Part 2 - Broaden your search (Generic Web)Jamie Bisset
 
Module 1: Search Strategy
Module 1:  Search StrategyModule 1:  Search Strategy
Module 1: Search StrategyLaraLibrarian
 
Learning project muse, Spr 09 gaxiola, ca
Learning project muse, Spr 09 gaxiola, caLearning project muse, Spr 09 gaxiola, ca
Learning project muse, Spr 09 gaxiola, cacarriegaxiola
 
Module 3: Evaluating Information
Module 3: Evaluating InformationModule 3: Evaluating Information
Module 3: Evaluating InformationLaraLibrarian
 
Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...Nadeem Sohail
 
NIH Public Access Policy
NIH Public Access PolicyNIH Public Access Policy
NIH Public Access PolicyEblingHSLibrary
 
NIH Public Access Policy
NIH Public Access PolicyNIH Public Access Policy
NIH Public Access Policyguestd7cc4a
 
Reference List Citations - APA 6th Edition
Reference List Citations - APA 6th EditionReference List Citations - APA 6th Edition
Reference List Citations - APA 6th EditionJanice Orcutt
 
Social Work Masters Student Introduction
Social Work Masters Student IntroductionSocial Work Masters Student Introduction
Social Work Masters Student IntroductionLucia Ravi
 
The.research.process pr4 5-6
The.research.process pr4 5-6The.research.process pr4 5-6
The.research.process pr4 5-6Fatma Saadawi
 
Link your ORCID to your Scopus Author ID (Durham Authors)
Link your ORCID to your Scopus Author ID (Durham Authors)Link your ORCID to your Scopus Author ID (Durham Authors)
Link your ORCID to your Scopus Author ID (Durham Authors)Jamie Bisset
 
Phrase Based Indexing
Phrase Based IndexingPhrase Based Indexing
Phrase Based Indexingbalaabirami
 
Literature searching
Literature searchingLiterature searching
Literature searchingazjackson
 

What's hot (20)

Module 2: Information Sources and Tools
Module 2: Information Sources and ToolsModule 2: Information Sources and Tools
Module 2: Information Sources and Tools
 
Effective Searching: Part 2 - Broaden your search (Generic Web)
Effective Searching: Part 2 - Broaden your search (Generic Web)Effective Searching: Part 2 - Broaden your search (Generic Web)
Effective Searching: Part 2 - Broaden your search (Generic Web)
 
Module 1: Search Strategy
Module 1:  Search StrategyModule 1:  Search Strategy
Module 1: Search Strategy
 
Learning project muse, Spr 09 gaxiola, ca
Learning project muse, Spr 09 gaxiola, caLearning project muse, Spr 09 gaxiola, ca
Learning project muse, Spr 09 gaxiola, ca
 
Module 3: Evaluating Information
Module 3: Evaluating InformationModule 3: Evaluating Information
Module 3: Evaluating Information
 
Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...Literature searching techniques and free online resources for scholars by Nad...
Literature searching techniques and free online resources for scholars by Nad...
 
NIH Public Access Policy
NIH Public Access PolicyNIH Public Access Policy
NIH Public Access Policy
 
NIH Public Access Policy
NIH Public Access PolicyNIH Public Access Policy
NIH Public Access Policy
 
Reference List Citations - APA 6th Edition
Reference List Citations - APA 6th EditionReference List Citations - APA 6th Edition
Reference List Citations - APA 6th Edition
 
Automatic indexing
Automatic indexingAutomatic indexing
Automatic indexing
 
Precis
PrecisPrecis
Precis
 
Social Work Masters Student Introduction
Social Work Masters Student IntroductionSocial Work Masters Student Introduction
Social Work Masters Student Introduction
 
The.research.process pr4 5-6
The.research.process pr4 5-6The.research.process pr4 5-6
The.research.process pr4 5-6
 
SEGAP-General resource information
SEGAP-General resource informationSEGAP-General resource information
SEGAP-General resource information
 
Link your ORCID to your Scopus Author ID (Durham Authors)
Link your ORCID to your Scopus Author ID (Durham Authors)Link your ORCID to your Scopus Author ID (Durham Authors)
Link your ORCID to your Scopus Author ID (Durham Authors)
 
Thesaurus 2101
Thesaurus 2101Thesaurus 2101
Thesaurus 2101
 
Phrase Based Indexing
Phrase Based IndexingPhrase Based Indexing
Phrase Based Indexing
 
Keyword Vs. Subject Searching
Keyword Vs. Subject SearchingKeyword Vs. Subject Searching
Keyword Vs. Subject Searching
 
Paper 1
Paper 1Paper 1
Paper 1
 
Literature searching
Literature searchingLiterature searching
Literature searching
 

Similar to Topical keyphrase extraction from twitter

Copyright © 2014 EMC Corporation. All rights reserved.Copy.docx
Copyright © 2014 EMC Corporation. All rights reserved.Copy.docxCopyright © 2014 EMC Corporation. All rights reserved.Copy.docx
Copyright © 2014 EMC Corporation. All rights reserved.Copy.docxmelvinjrobinson2199
 
Analyzing Qualitative Data PR1
Analyzing Qualitative Data PR1 Analyzing Qualitative Data PR1
Analyzing Qualitative Data PR1 BobbyPabores1
 
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET Journal
 
TitleFinder Extracting the Headline of News Web Pages
TitleFinder Extracting the Headline of News Web PagesTitleFinder Extracting the Headline of News Web Pages
TitleFinder Extracting the Headline of News Web PagesHadi Mohammadzadeh
 
Recommending content from social information streams
Recommending content from social information streamsRecommending content from social information streams
Recommending content from social information streamsPARC, a Xerox company
 
Algorithms for the thematic analysis of twitter datasets
Algorithms for the thematic analysis of twitter datasetsAlgorithms for the thematic analysis of twitter datasets
Algorithms for the thematic analysis of twitter datasetsaneeshabakharia
 
Sheet1695 grading rubric assignment 7 note all topics must be rel
Sheet1695 grading rubric assignment 7 note all topics must be relSheet1695 grading rubric assignment 7 note all topics must be rel
Sheet1695 grading rubric assignment 7 note all topics must be relrock73
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag CloudsAakash Gupta
 
Mining Scientific Papers
Mining Scientific PapersMining Scientific Papers
Mining Scientific PapersAmal Alnouri
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisYun Hao
 
Analyzing the Meaning of the Qualitative Data.pptx
Analyzing the Meaning of the Qualitative Data.pptxAnalyzing the Meaning of the Qualitative Data.pptx
Analyzing the Meaning of the Qualitative Data.pptxBobbyPabores1
 
How to write international journal
How to write international journalHow to write international journal
How to write international journalsanacusso
 
NLP Techniques for Text Summarization.docx
NLP Techniques for Text Summarization.docxNLP Techniques for Text Summarization.docx
NLP Techniques for Text Summarization.docxKevinSims18
 
SEO Content Contributors by Andrea H. Berberich @webpresenceopti
SEO Content Contributors by Andrea H. Berberich @webpresenceoptiSEO Content Contributors by Andrea H. Berberich @webpresenceopti
SEO Content Contributors by Andrea H. Berberich @webpresenceoptiAndrea Berberich
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...dannyijwest
 

Similar to Topical keyphrase extraction from twitter (20)

Ire major project
Ire major projectIre major project
Ire major project
 
Copyright © 2014 EMC Corporation. All rights reserved.Copy.docx
Copyright © 2014 EMC Corporation. All rights reserved.Copy.docxCopyright © 2014 EMC Corporation. All rights reserved.Copy.docx
Copyright © 2014 EMC Corporation. All rights reserved.Copy.docx
 
Ranking Twitter Conversations
Ranking Twitter ConversationsRanking Twitter Conversations
Ranking Twitter Conversations
 
sdv.pptx
sdv.pptxsdv.pptx
sdv.pptx
 
Slideshow ire
Slideshow ireSlideshow ire
Slideshow ire
 
Analyzing Qualitative Data PR1
Analyzing Qualitative Data PR1 Analyzing Qualitative Data PR1
Analyzing Qualitative Data PR1
 
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
IRJET- A Survey on Trend Analysis on Twitter for Predicting Public Opinion on...
 
Slideshow ire
Slideshow ireSlideshow ire
Slideshow ire
 
TitleFinder Extracting the Headline of News Web Pages
TitleFinder Extracting the Headline of News Web PagesTitleFinder Extracting the Headline of News Web Pages
TitleFinder Extracting the Headline of News Web Pages
 
Recommending content from social information streams
Recommending content from social information streamsRecommending content from social information streams
Recommending content from social information streams
 
Algorithms for the thematic analysis of twitter datasets
Algorithms for the thematic analysis of twitter datasetsAlgorithms for the thematic analysis of twitter datasets
Algorithms for the thematic analysis of twitter datasets
 
Sheet1695 grading rubric assignment 7 note all topics must be rel
Sheet1695 grading rubric assignment 7 note all topics must be relSheet1695 grading rubric assignment 7 note all topics must be rel
Sheet1695 grading rubric assignment 7 note all topics must be rel
 
Selection of Tags for Tag Clouds
Selection of Tags for Tag CloudsSelection of Tags for Tag Clouds
Selection of Tags for Tag Clouds
 
Mining Scientific Papers
Mining Scientific PapersMining Scientific Papers
Mining Scientific Papers
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment Analysis
 
Analyzing the Meaning of the Qualitative Data.pptx
Analyzing the Meaning of the Qualitative Data.pptxAnalyzing the Meaning of the Qualitative Data.pptx
Analyzing the Meaning of the Qualitative Data.pptx
 
How to write international journal
How to write international journalHow to write international journal
How to write international journal
 
NLP Techniques for Text Summarization.docx
NLP Techniques for Text Summarization.docxNLP Techniques for Text Summarization.docx
NLP Techniques for Text Summarization.docx
 
SEO Content Contributors by Andrea H. Berberich @webpresenceopti
SEO Content Contributors by Andrea H. Berberich @webpresenceoptiSEO Content Contributors by Andrea H. Berberich @webpresenceopti
SEO Content Contributors by Andrea H. Berberich @webpresenceopti
 
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
An Efficient Approach for Keyword Selection ; Improving Accessibility of Web ...
 

More from moresmile

When relevance is not enough
When relevance is not enoughWhen relevance is not enough
When relevance is not enoughmoresmile
 
Using content and interactions for discovering communities in
Using content and interactions for discovering communities inUsing content and interactions for discovering communities in
Using content and interactions for discovering communities inmoresmile
 
Questions about questions
Questions about questionsQuestions about questions
Questions about questionsmoresmile
 
Magnet community identification on social networks
Magnet community identification on social networksMagnet community identification on social networks
Magnet community identification on social networksmoresmile
 
Is it time for a career switch
Is it time for a career switchIs it time for a career switch
Is it time for a career switchmoresmile
 
Generating event storylines from microblogs
Generating event storylines from microblogsGenerating event storylines from microblogs
Generating event storylines from microblogsmoresmile
 
Finding bursty topics from microblogs
Finding bursty topics from microblogsFinding bursty topics from microblogs
Finding bursty topics from microblogsmoresmile
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthmoresmile
 
Event summarization using tweets
Event summarization using tweetsEvent summarization using tweets
Event summarization using tweetsmoresmile
 

More from moresmile (9)

When relevance is not enough
When relevance is not enoughWhen relevance is not enough
When relevance is not enough
 
Using content and interactions for discovering communities in
Using content and interactions for discovering communities inUsing content and interactions for discovering communities in
Using content and interactions for discovering communities in
 
Questions about questions
Questions about questionsQuestions about questions
Questions about questions
 
Magnet community identification on social networks
Magnet community identification on social networksMagnet community identification on social networks
Magnet community identification on social networks
 
Is it time for a career switch
Is it time for a career switchIs it time for a career switch
Is it time for a career switch
 
Generating event storylines from microblogs
Generating event storylines from microblogsGenerating event storylines from microblogs
Generating event storylines from microblogs
 
Finding bursty topics from microblogs
Finding bursty topics from microblogsFinding bursty topics from microblogs
Finding bursty topics from microblogs
 
Exploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouthExploring social influence via posterior effect of word of-mouth
Exploring social influence via posterior effect of word of-mouth
 
Event summarization using tweets
Event summarization using tweetsEvent summarization using tweets
Event summarization using tweets
 

Recently uploaded

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 

Recently uploaded (20)

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 

Topical keyphrase extraction from twitter

  • 2. Abstract In this paper, we propose to extract topical keyphrases as one way to summarize Twitter. 1、A context-sensitive topical PageRank method for keyword ranking 2、A probabilistic scoring function that considers both relevance and interestingness of keyphrases for keyphrase ranking
  • 3. Introduction Keyphrases are defined as a short list of terms to summarize the topics of a document. Challenges: 1. Tweets are much shorter than traditional articles and not all tweets contain useful information; 2. Topics tend to be more diverse in Twitter than informal articles such as news reports. Keyword ranking; Candidate keyphrase generation; Keyphrase ranking.
  • 5. Method Topic discovery Keywords ranking and candidate generation Keyphrases ranking
  • 6. Topic discovery A modified author-topic model called Twitter-LDA introduced by Zhao etal.(2011)
  • 7. Topical PageRank for Keyword Ranking The topic-specific PageRank scores: TPR A topic context sensitive PageRank method: cTPR
  • 8. Topical PageRank for Keyword Ranking a common candidate keyphrase generation method proposed by Mihalcea and Tarau(2004) as follows: 1. Select the top S keywords for each topic 2. look for combinations of these keywords that occur as frequent phrases in the text collection
  • 9. Probabilistic Models for Topical Keyphrase Ranking Relevance : A good keyphrase should be closely related to the given topic and also discriminative. Interestingness : A good keyphrase should be interesting and can attract users’ attention.
  • 10. Probabilistic Models for Topical Keyphrase Ranking In general we can assume that P ( R = 0)>> P ( R = 1) because there are much more non-relevant keyphrases than relevant ones,that is, δ>> 1 .