SlideShare une entreprise Scribd logo
1  sur  97
Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions Kavita Ganesan, ChengXiang Zhai, Jiawei Han University of Illinois @ Urbana Champaign
Opinion Summarization Today… Opinion Summary for iPod Existing methods: Generate structured ratings for an entity[Lu et al., 2009; Lerman et al., 2009;..]
Opinion Summarization Today… Opinion Summary for iPod To know more: read many redundant sentences structured format useful, but not enough!
Ideally, we need….Supporting textual summary!
A good textual Opinion Summary should… ,[object Object]
What are the major complaints/praise in an aspect?
Concise
Easily digestible
Viewable on smaller screens
Readable
Easily understood,[object Object]
How to generate such summaries?
Widely studied for years[Radev et al.2000; Erkan & Radev, 2004; Mihalcea & Tarau, 2004…] ,[object Object],Generating concise summaries  Summarizing highly redundant text ,[object Object]
Bias:with limit on summary size
selected sentence may have missed critical info
Verbose:May contain irrelevant information
not suitable for smaller devicesExtractive Summarization
Widely studied for years[Radev et al.2000; Erkan & Radev, 2004; Mihalcea & Tarau, 2004…] ,[object Object],Generating concise summaries  Summarizing highly redundant text ,[object Object]
Bias:with limit on summary size
selected sentence may have missed critical info
Verbose:May contain irrelevant information
not suitable for smaller devicesExtractive Summarization Extractive Abstractive
Existing methods: Some methods require manual effort [DeJong1982] [Radev and McKeown1998] [Finley and Harabagiu2002] ,[object Object],Some methods rely heavily on NL understanding [Saggion and Lapalme2002] [Jing and McKeown2000] ,[object Object]
Impractical – high computational costsAbstractive Summarization - HARD!!
[object Object]
Generates concise summaries using:
existing text
inherent redundancies
Uses minimal external knowledge
 lightweightOur Method: Opinosis
Opinosis: High Level Overview
Opinosis: High Level Overview Set of sentences: ,[object Object]
POS annotatedInput
Opinosis: High Level Overview too Set of sentences: ,[object Object]
POS annotatedphone  with my  calls  frequently  drop  the  iphone is  a  Input  . great  Step 1: Generate graph representation of sentences (Opinosis-Graph) device
Opinosis: High Level Overview too Set of sentences: ,[object Object]
POS annotatedphone  with my  calls  frequently  drop  the  iphone is  a  Input  . calls  frequently  drop  great  device  great  device  3.2 candidate sum1 2.5 candidate sum2 Step 2:Find promising paths (candidate summaries) & score these candidates Step 1: Generate graph representation of sentences (Opinosis-Graph)
Set of sentences: ,[object Object]
POS annotatedOpinosis: High Level Overview too phone  with The iPhone is a great device, but calls drop frequently.  my  calls  frequently  drop  the  iphone is  a  Input  Step 3: Select top scoring candidates as final summary  . calls  frequently  drop  great  device  3.2 candidate sum1 2.5 candidate sum2 great  Step 2:Find promising paths (candidate summaries) & score these candidates Step 1: Generate graph representation of sentences (Opinosis-Graph) device
Step 1: Building the Opinosis-Graph
Assume: 2 sentences about “call quality of iphone” ,[object Object],Building Opinosis-Graph My phone calls drop frequently with the iPhone. Great device, but the calls drop too frequently.
Building Opinosis-Graph My phone calls drop frequently with the iPhone.
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my my unique(WORD + POS)  = node  1:1 1:1 Positional Reference Information SID PID
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  co-occurrence my phone 1:1 1:2 Positional Reference Information SID PID
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls 1:1 1:2 1:3
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop 1:1 1:2 1:3 1:4
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop frequently 1:1 1:2 1:3 1:4 1:5
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop frequently with 1:1 1:2 1:3 1:4 1:5 1:6
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop the frequently with 1:1 1:2 1:3 1:4 1:7 1:5 1:6
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop the iphone frequently with 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
Building Opinosis-Graph My phone calls drop frequently with the iPhone.  . my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
. Building Opinosis-Graph My phone calls drop frequently with the iPhone.  my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop the iphone great device , but frequently with 2:1 2:2 2:3 2:4 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop iphone great device , but the frequently with 2:1 2:2 2:3 2:4 1:9 1:1 1:2 1:3 1:4 1:8 1:7 1:5 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone calls drop the iphone frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:9 1:1 1:2 1:3 1:4 1:8 1:5 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:9 1:1 1:2 1:4 1:8 1:3,2:6 1:6 1:5
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:6 1:5
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:6 1:5
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9 1:6
Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9  1:6
Graph is now ready for Step 2! . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9  1:6 Building Opinosis-Graph
3 Important Properties of the Opinosis-Graph
Property 1 Naturally captures redundancies . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9  1:6
Naturally captures redundancies . the iphone my phone great device , but drop calls too with frequently 1:9, 2:10 1:7, 2:5 Property 1 2:1 2:2 2:3 2:4 2:8 1:8 1:1 1:2 1:4, 2:7 1:3, 2:6 1:6 1:5, 2:9  Path shared by 2 sentences naturally captured by nodes
Naturally captures redundancies . the iphone my phone great device , but drop calls too with frequently 1:9, 2:10 1:7, 2:5 Property 1 2:1 2:2 2:3 2:4 2:8 1:8 1:1 1:2 1:4, 2:7 1:3, 2:6 1:6 1:5, 2:9  Easily discover redundanciesfor high confidence summaries
Property 2 Captures gapped subsequences 1. My phone calls drop frequently with the iPhone.  2. Great device, but the calls drop toofrequently.
1. My phone calls drop frequently with the iPhone.  2. Great device, but the calls drop toofrequently. Captures gapped subsequences . the iphone my phone Property 2 great device , but drop calls too with frequently 2:8 1:4, 2:7 1:9, 2:10 1:7, 2:5 2:1 2:2 2:3 2:4 1:8 1:1 1:2 1:3, 2:6 1:5, 2:9  1:6 Gap between words = 2
Property 2 Captures gapped subsequences . great device , but the iphone my phone drop calls too with frequently 2:8 1:4, 2:7 1:9, 2:10 2:1 2:2 2:3 2:4 1:7, 2:5 1:8 1:1 1:2 1:3, 2:6 1:5, 2:9  1:6 Gapped subsequences allow: ,[object Object]
 discovery of new sentences,[object Object]
Captures collapsible structures 1. Calls drop frequently with the iPhone 2. Calls drop frequently with the Black Berry calls drop frequently with the iphone     black berry Property 3 - Can easily be discovered using OG - Ideal for collapse & compression
Step 2a: Generate Candidate Summaries
Repeatedly search the Opinosis-Graph for a Valid Path Generate Candidate Summaries
Set of connected nodes Has a Valid Start Node (VSN) Natural starting point of a sentence Opinosis uses avg. positional information Has a Valid End Node (VEN) Point that completes a sentence Opinosis uses punctuations & conjunctions Valid Path
Finding Candidate Summaries drop frequently with the iphone calls ,     . VSN VEN
Finding Candidate Summaries drop frequently with the iphone calls ,     . VSN VEN Candidate summary
Finding Candidate Summaries drop frequently with the iphone calls ,     . VSN VEN Pool of candidate summaries
Some paths are collapsible Identify such paths through a collapsible node Treat linking verbs (e.g. is, are)  as collapsible nodes Linking verbs have hub-like properties Commonly used in opinion text Collapsible Structures
A Collapsible Structure linking verb = collapsible node  is very clear the screen  big anchor - Common structure -High redundancy path collapsed candidates (CC) - Subgraphs to be merged
A Collapsible Structure  is very screen  clear the big collapse + merge and very clear big is screen the CC2 anchor CC1
CC after linking verbs: concatenate using commas “The screen isvery clear,bright,big” Better readability: “The screen is very clear,brightandbig” How to Merge Structures? CC1 CC2 CC3 Find last connector using hints from OG
Step 2b: Score Candidate Summaries
Type 1:  High confidence summaries Select candidateswithhigh redundancy # of sentences sharing same path  controlled by gap threshold, σgap Type 2: +Good coverage Select longer candidates redundancy * length of candidate paths Favor longer but redundant candidates Scoring Options
Gaps vary between sentences sharing nodes Gap Threshold (σgap) w2 w3 w1 Candidate X  Sentence  1  1  2  Sentence  2  4  4  Sentence  X m n gap gap
σgapenforces maximum allowed gap between two adjacent nodes Gap Threshold (σgap) w2 w1 Candidate X  σgap Sentence  1  1  <  σgap >  Sentence  2  4  Sentence  n m -Lower risk of ill-formed sentences  -Avoids over-estimation of redundancy     gap
After candidate scoring: Select top 2 scoring candidates  Most dissimilar candidates Step 3: Final Opinosis Summaries
Evaluation
User Reviews: Hotels: Tripadvisor.com  Products: Amazon.com Cars: Edmunds.com Data
Summarization Task ReviewsEdmunds Tripadvisor Amazon Topic 51 sentence 1….. sentence 2….. sentence 3….. sentence 4….. Topic 1 sentence 1….. sentence 2….. sentence 3….. sentence 4….. Topic 2 sentence 1….. sentence 2….. sentence 3….. sentence 4….. ~100  unordered,  topic-related, sentences review document 1 review document 51 review document 2 summarize summarize summarize
Human composed summaries Concise (<25 words) Focus on summarizing major opinions ~4 human summaries per topic Gold Standard
Hard to find ‘general’ abstractive summarizer Use MEAD - Extractive based method [Radev et al.2000] Select 2 sentences as the summary Baseline
[object Object],Standard measure for summarization tasks Readability Test Measures: How different Opinosis summaries are compared to human composed summaries? Evaluation Measures
Results
Estimate: How much one summary writer agrees with the rest Human Performance
Human Performance Human summaries - semantically similar. Slight difference in word usage.
Human vs. Opinosis vs. MEAD
Human vs. Opinosis vs. MEAD Highest recall Lowest precision Much longer sentences ROUGE Recall ROUGE Precision

Contenu connexe

Tendances

An insight into i phone user behaviour within the app store surikate_gfk
An insight into i phone user behaviour within the app store surikate_gfkAn insight into i phone user behaviour within the app store surikate_gfk
An insight into i phone user behaviour within the app store surikate_gfk
Mozoo
 

Tendances (18)

The iPhone 5
The iPhone 5The iPhone 5
The iPhone 5
 
iPhone
iPhone iPhone
iPhone
 
salesprop2_Tirado_S
salesprop2_Tirado_Ssalesprop2_Tirado_S
salesprop2_Tirado_S
 
Apple-Curious - The iPhone 5 from an Android Fan
Apple-Curious - The iPhone 5 from an Android FanApple-Curious - The iPhone 5 from an Android Fan
Apple-Curious - The iPhone 5 from an Android Fan
 
The iPhone Evolution
The iPhone EvolutionThe iPhone Evolution
The iPhone Evolution
 
Iphone 5g
Iphone 5gIphone 5g
Iphone 5g
 
power point Presentation on I Phone
power point Presentation on I Phonepower point Presentation on I Phone
power point Presentation on I Phone
 
Steve jobs iphone launch 2007
Steve jobs iphone launch 2007Steve jobs iphone launch 2007
Steve jobs iphone launch 2007
 
Evolution of the iPhone
Evolution of the iPhoneEvolution of the iPhone
Evolution of the iPhone
 
Samsung s4 vs apple iphone5s
Samsung s4 vs apple iphone5sSamsung s4 vs apple iphone5s
Samsung s4 vs apple iphone5s
 
iPhone Applications & Luxury Brands - Updated May 5, 2010
iPhone Applications & Luxury Brands - Updated May 5, 2010iPhone Applications & Luxury Brands - Updated May 5, 2010
iPhone Applications & Luxury Brands - Updated May 5, 2010
 
An insight into i phone user behaviour within the app store surikate_gfk
An insight into i phone user behaviour within the app store surikate_gfkAn insight into i phone user behaviour within the app store surikate_gfk
An insight into i phone user behaviour within the app store surikate_gfk
 
Features of i phone 12 pro max
Features of i phone 12 pro maxFeatures of i phone 12 pro max
Features of i phone 12 pro max
 
the impact of iPhone on Korean mobile communication business.
the impact of iPhone on Korean mobile communication business.the impact of iPhone on Korean mobile communication business.
the impact of iPhone on Korean mobile communication business.
 
iPhone
iPhoneiPhone
iPhone
 
I os 7
I os 7I os 7
I os 7
 
iPhone 6s
iPhone 6siPhone 6s
iPhone 6s
 
iOS7 features
iOS7 featuresiOS7 features
iOS7 features
 

En vedette

Query based summarization
Query based summarizationQuery based summarization
Query based summarization
damom77
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 
Tiempo presente del verbo to be en las formas
Tiempo presente del verbo to be en las formasTiempo presente del verbo to be en las formas
Tiempo presente del verbo to be en las formas
Nini Paz
 

En vedette (20)

Micropinion Generation
Micropinion GenerationMicropinion Generation
Micropinion Generation
 
Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)Opinion Mining Tutorial (Sentiment Analysis)
Opinion Mining Tutorial (Sentiment Analysis)
 
Opinion Driven Decision Support System
Opinion Driven Decision Support SystemOpinion Driven Decision Support System
Opinion Driven Decision Support System
 
Segmentation of Clinical Texts
Segmentation of Clinical TextsSegmentation of Clinical Texts
Segmentation of Clinical Texts
 
Opinion-Based Entity Ranking
Opinion-Based Entity RankingOpinion-Based Entity Ranking
Opinion-Based Entity Ranking
 
Introduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita GanesanIntroduction to Java Strings, By Kavita Ganesan
Introduction to Java Strings, By Kavita Ganesan
 
In situ evaluation of entity retrieval and opinion summarization
In situ evaluation of entity retrieval and opinion summarizationIn situ evaluation of entity retrieval and opinion summarization
In situ evaluation of entity retrieval and opinion summarization
 
Community detection (Поиск сообществ в графах)
Community detection (Поиск сообществ в графах)Community detection (Поиск сообществ в графах)
Community detection (Поиск сообществ в графах)
 
Natural Language Processing with Graph Databases and Neo4j
Natural Language Processing with Graph Databases and Neo4jNatural Language Processing with Graph Databases and Neo4j
Natural Language Processing with Graph Databases and Neo4j
 
Argument extraction from news, blogs and social media.
Argument extraction from news, blogs and social media.Argument extraction from news, blogs and social media.
Argument extraction from news, blogs and social media.
 
TextRank: Bringing Order into Texts
TextRank: Bringing Order into TextsTextRank: Bringing Order into Texts
TextRank: Bringing Order into Texts
 
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
 SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ... SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
SummaRuNNer: A Recurrent Neural Network based Sequence Model for Extractive ...
 
Query based summarization
Query based summarizationQuery based summarization
Query based summarization
 
Tutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment AnalysisTutorial on Opinion Mining and Sentiment Analysis
Tutorial on Opinion Mining and Sentiment Analysis
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
AI & NLP pada @begobet
AI & NLP pada @begobetAI & NLP pada @begobet
AI & NLP pada @begobet
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
Introduction to py2neo
Introduction to py2neoIntroduction to py2neo
Introduction to py2neo
 
Automatic Summarizaton Tutorial
Automatic Summarizaton TutorialAutomatic Summarizaton Tutorial
Automatic Summarizaton Tutorial
 
Tiempo presente del verbo to be en las formas
Tiempo presente del verbo to be en las formasTiempo presente del verbo to be en las formas
Tiempo presente del verbo to be en las formas
 

Similaire à Opinosis Presentation @ Coling 2010: Opinosis - A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions

Team winner innovation project pres 2 x
Team winner innovation project   pres 2 xTeam winner innovation project   pres 2 x
Team winner innovation project pres 2 x
walkthis
 

Similaire à Opinosis Presentation @ Coling 2010: Opinosis - A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions (20)

Ambient computing at Lehigh University
Ambient computing at Lehigh UniversityAmbient computing at Lehigh University
Ambient computing at Lehigh University
 
Mobile application testing tutorial
Mobile application testing tutorialMobile application testing tutorial
Mobile application testing tutorial
 
iPhone Dev Camp Keynote
iPhone Dev Camp  KeynoteiPhone Dev Camp  Keynote
iPhone Dev Camp Keynote
 
iPhone Dev Camp Keynote
iPhone Dev Camp KeynoteiPhone Dev Camp Keynote
iPhone Dev Camp Keynote
 
iPhoneDevCamp Keynote
iPhoneDevCamp KeynoteiPhoneDevCamp Keynote
iPhoneDevCamp Keynote
 
iOS Ecosystem @ Fiera del Radioamatore Pordenone
iOS Ecosystem @ Fiera del Radioamatore PordenoneiOS Ecosystem @ Fiera del Radioamatore Pordenone
iOS Ecosystem @ Fiera del Radioamatore Pordenone
 
IOS multitasking is a lie, and that's totally fine.
IOS multitasking is a lie, and that's totally fine.IOS multitasking is a lie, and that's totally fine.
IOS multitasking is a lie, and that's totally fine.
 
I Phone Session Mufix
I Phone Session MufixI Phone Session Mufix
I Phone Session Mufix
 
Sugsa mobile ux_april 2013
Sugsa mobile ux_april 2013Sugsa mobile ux_april 2013
Sugsa mobile ux_april 2013
 
There's an app for that nov 2011
There's an app for that nov 2011There's an app for that nov 2011
There's an app for that nov 2011
 
iPhone Development Overview
iPhone Development OverviewiPhone Development Overview
iPhone Development Overview
 
Deepfakes - How they work and what it means for the future
Deepfakes - How they work and what it means for the futureDeepfakes - How they work and what it means for the future
Deepfakes - How they work and what it means for the future
 
How & where to start iOS development?
How & where to start iOS development?How & where to start iOS development?
How & where to start iOS development?
 
What I Learned about Mobile UX... The Hard Way
What I Learned about Mobile UX... The Hard WayWhat I Learned about Mobile UX... The Hard Way
What I Learned about Mobile UX... The Hard Way
 
What it would be like to build a system for publishing magazines on mobile de...
What it would be like to build a system for publishing magazines on mobile de...What it would be like to build a system for publishing magazines on mobile de...
What it would be like to build a system for publishing magazines on mobile de...
 
Processing Arabic Text
Processing Arabic TextProcessing Arabic Text
Processing Arabic Text
 
iOS development made easy
iOS development made easyiOS development made easy
iOS development made easy
 
Technical speaking 101
Technical speaking 101Technical speaking 101
Technical speaking 101
 
Droidcon thessaloniki 2015
Droidcon thessaloniki 2015Droidcon thessaloniki 2015
Droidcon thessaloniki 2015
 
Team winner innovation project pres 2 x
Team winner innovation project   pres 2 xTeam winner innovation project   pres 2 x
Team winner innovation project pres 2 x
 

Dernier

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Opinosis Presentation @ Coling 2010: Opinosis - A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions

  • 1. Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions Kavita Ganesan, ChengXiang Zhai, Jiawei Han University of Illinois @ Urbana Champaign
  • 2. Opinion Summarization Today… Opinion Summary for iPod Existing methods: Generate structured ratings for an entity[Lu et al., 2009; Lerman et al., 2009;..]
  • 3. Opinion Summarization Today… Opinion Summary for iPod To know more: read many redundant sentences structured format useful, but not enough!
  • 5.
  • 6. What are the major complaints/praise in an aspect?
  • 11.
  • 12. How to generate such summaries?
  • 13.
  • 14. Bias:with limit on summary size
  • 15. selected sentence may have missed critical info
  • 17. not suitable for smaller devicesExtractive Summarization
  • 18.
  • 19. Bias:with limit on summary size
  • 20. selected sentence may have missed critical info
  • 22. not suitable for smaller devicesExtractive Summarization Extractive Abstractive
  • 23.
  • 24. Impractical – high computational costsAbstractive Summarization - HARD!!
  • 25.
  • 32.
  • 34.
  • 35. POS annotatedphone with my calls frequently drop the iphone is a Input . great Step 1: Generate graph representation of sentences (Opinosis-Graph) device
  • 36.
  • 37. POS annotatedphone with my calls frequently drop the iphone is a Input . calls frequently drop great device great device 3.2 candidate sum1 2.5 candidate sum2 Step 2:Find promising paths (candidate summaries) & score these candidates Step 1: Generate graph representation of sentences (Opinosis-Graph)
  • 38.
  • 39. POS annotatedOpinosis: High Level Overview too phone with The iPhone is a great device, but calls drop frequently. my calls frequently drop the iphone is a Input Step 3: Select top scoring candidates as final summary . calls frequently drop great device 3.2 candidate sum1 2.5 candidate sum2 great Step 2:Find promising paths (candidate summaries) & score these candidates Step 1: Generate graph representation of sentences (Opinosis-Graph) device
  • 40. Step 1: Building the Opinosis-Graph
  • 41.
  • 42. Building Opinosis-Graph My phone calls drop frequently with the iPhone.
  • 43. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my my unique(WORD + POS) = node 1:1 1:1 Positional Reference Information SID PID
  • 44. Building Opinosis-Graph My phone calls drop frequently with the iPhone. co-occurrence my phone 1:1 1:2 Positional Reference Information SID PID
  • 45. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls 1:1 1:2 1:3
  • 46. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop 1:1 1:2 1:3 1:4
  • 47. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop frequently 1:1 1:2 1:3 1:4 1:5
  • 48. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop frequently with 1:1 1:2 1:3 1:4 1:5 1:6
  • 49. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop the frequently with 1:1 1:2 1:3 1:4 1:7 1:5 1:6
  • 50. Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop the iphone frequently with 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
  • 51. Building Opinosis-Graph My phone calls drop frequently with the iPhone. . my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
  • 52. . Building Opinosis-Graph My phone calls drop frequently with the iPhone. my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
  • 53. Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop the iphone frequently with 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
  • 54. Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop the iphone great device , but frequently with 2:1 2:2 2:3 2:4 1:9 1:1 1:2 1:3 1:4 1:7 1:8 1:5 1:6
  • 55. Building Opinosis-Graph Great device, but the calls drop too frequently. . my phone calls drop iphone great device , but the frequently with 2:1 2:2 2:3 2:4 1:9 1:1 1:2 1:3 1:4 1:8 1:7 1:5 1:6
  • 56. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone calls drop the iphone frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:9 1:1 1:2 1:3 1:4 1:8 1:5 1:6
  • 57. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:9 1:1 1:2 1:4 1:8 1:3,2:6 1:6 1:5
  • 58. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls frequently with 2:1 2:2 2:3 2:4 1:7,2:5 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:6 1:5
  • 59. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:6 1:5
  • 60. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:9 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9 1:6
  • 61. Building Opinosis-Graph Great device, but the calls drop too frequently. . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9 1:6
  • 62. Graph is now ready for Step 2! . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9 1:6 Building Opinosis-Graph
  • 63. 3 Important Properties of the Opinosis-Graph
  • 64. Property 1 Naturally captures redundancies . great device , but my phone drop the iphone calls too frequently with 1:9, 2:10 2:1 2:2 2:3 2:4 1:7,2:5 2:8 1:4, 2:7 1:1 1:2 1:8 1:3, 2:6 1:5, 2:9 1:6
  • 65. Naturally captures redundancies . the iphone my phone great device , but drop calls too with frequently 1:9, 2:10 1:7, 2:5 Property 1 2:1 2:2 2:3 2:4 2:8 1:8 1:1 1:2 1:4, 2:7 1:3, 2:6 1:6 1:5, 2:9 Path shared by 2 sentences naturally captured by nodes
  • 66. Naturally captures redundancies . the iphone my phone great device , but drop calls too with frequently 1:9, 2:10 1:7, 2:5 Property 1 2:1 2:2 2:3 2:4 2:8 1:8 1:1 1:2 1:4, 2:7 1:3, 2:6 1:6 1:5, 2:9 Easily discover redundanciesfor high confidence summaries
  • 67. Property 2 Captures gapped subsequences 1. My phone calls drop frequently with the iPhone. 2. Great device, but the calls drop toofrequently.
  • 68. 1. My phone calls drop frequently with the iPhone. 2. Great device, but the calls drop toofrequently. Captures gapped subsequences . the iphone my phone Property 2 great device , but drop calls too with frequently 2:8 1:4, 2:7 1:9, 2:10 1:7, 2:5 2:1 2:2 2:3 2:4 1:8 1:1 1:2 1:3, 2:6 1:5, 2:9 1:6 Gap between words = 2
  • 69.
  • 70.
  • 71. Captures collapsible structures 1. Calls drop frequently with the iPhone 2. Calls drop frequently with the Black Berry calls drop frequently with the iphone black berry Property 3 - Can easily be discovered using OG - Ideal for collapse & compression
  • 72. Step 2a: Generate Candidate Summaries
  • 73. Repeatedly search the Opinosis-Graph for a Valid Path Generate Candidate Summaries
  • 74. Set of connected nodes Has a Valid Start Node (VSN) Natural starting point of a sentence Opinosis uses avg. positional information Has a Valid End Node (VEN) Point that completes a sentence Opinosis uses punctuations & conjunctions Valid Path
  • 75. Finding Candidate Summaries drop frequently with the iphone calls , . VSN VEN
  • 76. Finding Candidate Summaries drop frequently with the iphone calls , . VSN VEN Candidate summary
  • 77. Finding Candidate Summaries drop frequently with the iphone calls , . VSN VEN Pool of candidate summaries
  • 78. Some paths are collapsible Identify such paths through a collapsible node Treat linking verbs (e.g. is, are) as collapsible nodes Linking verbs have hub-like properties Commonly used in opinion text Collapsible Structures
  • 79. A Collapsible Structure linking verb = collapsible node is very clear the screen big anchor - Common structure -High redundancy path collapsed candidates (CC) - Subgraphs to be merged
  • 80. A Collapsible Structure is very screen clear the big collapse + merge and very clear big is screen the CC2 anchor CC1
  • 81. CC after linking verbs: concatenate using commas “The screen isvery clear,bright,big” Better readability: “The screen is very clear,brightandbig” How to Merge Structures? CC1 CC2 CC3 Find last connector using hints from OG
  • 82. Step 2b: Score Candidate Summaries
  • 83. Type 1: High confidence summaries Select candidateswithhigh redundancy # of sentences sharing same path controlled by gap threshold, σgap Type 2: +Good coverage Select longer candidates redundancy * length of candidate paths Favor longer but redundant candidates Scoring Options
  • 84. Gaps vary between sentences sharing nodes Gap Threshold (σgap) w2 w3 w1 Candidate X Sentence 1 1 2 Sentence 2 4 4 Sentence X m n gap gap
  • 85. σgapenforces maximum allowed gap between two adjacent nodes Gap Threshold (σgap) w2 w1 Candidate X σgap Sentence 1 1 < σgap > Sentence 2 4 Sentence n m -Lower risk of ill-formed sentences -Avoids over-estimation of redundancy gap
  • 86. After candidate scoring: Select top 2 scoring candidates Most dissimilar candidates Step 3: Final Opinosis Summaries
  • 88. User Reviews: Hotels: Tripadvisor.com Products: Amazon.com Cars: Edmunds.com Data
  • 89. Summarization Task ReviewsEdmunds Tripadvisor Amazon Topic 51 sentence 1….. sentence 2….. sentence 3….. sentence 4….. Topic 1 sentence 1….. sentence 2….. sentence 3….. sentence 4….. Topic 2 sentence 1….. sentence 2….. sentence 3….. sentence 4….. ~100 unordered, topic-related, sentences review document 1 review document 51 review document 2 summarize summarize summarize
  • 90. Human composed summaries Concise (<25 words) Focus on summarizing major opinions ~4 human summaries per topic Gold Standard
  • 91. Hard to find ‘general’ abstractive summarizer Use MEAD - Extractive based method [Radev et al.2000] Select 2 sentences as the summary Baseline
  • 92.
  • 94. Estimate: How much one summary writer agrees with the rest Human Performance
  • 95. Human Performance Human summaries - semantically similar. Slight difference in word usage.
  • 96. Human vs. Opinosis vs. MEAD
  • 97. Human vs. Opinosis vs. MEAD Highest recall Lowest precision Much longer sentences ROUGE Recall ROUGE Precision
  • 98. Human vs. Opinosis vs. MEAD Highest recall Lowest precision Overall: Baseline does not do well in generating concise summaries. Much longer sentences ROUGE Precision ROUGE Recall ROUGE Precision
  • 99. Human vs. Opinosis similar similar ROUGE Recall ROUGE Precision
  • 100. Human vs. Opinosis similar similar Performance of Opinosis is reasonable  similar to Human performance ROUGE Precision ROUGE Recall
  • 101. Effect of Gap Threshold (σgap)
  • 102. Effect of Gap Threshold (σgap) σgap
  • 103. Effect of Gap Threshold (σgap) small σgap words in summary close together in original text σgap
  • 104. Effect of Gap Threshold (σgap) Lowest performance Strict adjacency – disallows redundancies to be captured σgap
  • 105. Effect of Gap Threshold (σgap) Jump in performance More redundancies are captured σgap
  • 106. Effect of Gap Threshold (σgap) Small improvements afterwards σgap
  • 107. Effect of Gap Threshold (σgap) Small improvements afterwards σgap
  • 108. Effect of Gap Threshold (σgap) Small improvements afterwards σgap too large: ill formed sentences Set σgap to low value σgap
  • 109. Compare: Scoring Functions
  • 110. Compare: Scoring Functions redundancy & path length only redundancy σgap
  • 111. Compare: Scoring Functions redundancy & path length only redundancy redundancy & path length  summaries with better coverage σgap
  • 112. Readability Test Topic X Topic X Mixed Sentences sentence 1….. sentence 3….. sentence 2….. sentence 4….. sentence 8….. sentence 6….. sentence 7….. sentence 5….. ……………………… ……………………… Opinosis Generated sentence 1….. sentence 2….. MIX Human Composed 1 sentence 1….. sentence 2….. sentence Y.. Human Composed 4 sentence 1….. sentence 2….. sentence Z….. Pick at most 2 least readable sentences
  • 113. Assessor often picks: Opinosis sentences - Opinosis summaries have readability issues Non-Opinosis sentences or makes no picks - Opinosis summaries similar to human summaries Readability Test
  • 114. Assessor picked: 34/102Opinosisgeneratedsentences as least readable Our Readability Test
  • 115. Assessor picked: 34/102Opinosisgeneratedsentences as least readable Our Readability Test > 60% of Opinosis sentences are not very differentfrom humancomposed sentences
  • 116.
  • 117. Dataset and Demo Software is available http://timan.cs.uiuc.edu/downloads.html
  • 118. [Barzilay and Lee2003] Barzilay, Regina and Lillian Lee. 2003. Learning to paraphrase: an unsupervised approach using multiple-sequence alignment. In NAACL ’03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, pages 16–23, Morristown, NJ, USA. [DeJong1982] DeJong, Gerald F. 1982. An overview of the FRUMP system. In Lehnert, Wendy G. and Martin H. Ringle, editors, Strategies for Natural Language Processing, pages 149–176. Lawrence Erlbaum, Hillsdale, NJ. [Erkan and Radev2004] Erkan, G¨unes and Dragomir R. Radev. 2004. Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Int. Res.,22(1):457–479. [Finley and Harabagiu2002] Finley, SandaHarabagiu and Sanda M. Harabagiu. 2002. Generating single and multi-document summaries with gistexter. In Proceedings of the workshop on automatic summarization, pages 30–38. [Hu and Liu2004] Hu, Minqing and Bing Liu. 2004. Mining and summarizing customer reviews. In KDD, pages 168–177. [Jing and McKeown2000] Jing, Hongyan and Kathleen R. McKeown. 2000. Cut and paste based text summarization. In Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference, pages 178–185, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc. [Lerman et al.2009] Lerman, Kevin, Sasha Blair-Goldensohn, and Ryan Mcdonald. 2009. Sentiment summarization: Evaluating and learning user preferences. In 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL-09). [Lin and Hovy2003] Lin, Chin-Yew and Eduard Hovy. 2003. Automatic evaluation of summaries using ngram co-occurrence statistics. In Proc. HLT-NAACL, page 8 pages. [LIN2004a] LIN, Chin-Yew. 2004a. Looking for a few good metrics : Rouge and its evaluation. proc. of the 4th NTCIR Workshops, 2004. [Lin2004b] Lin, Chin-Yew. 2004b. Rouge: a package for automatic evaluation of summaries. In Proceedings of theWorkshop on Text Summarization Branches Out (WAS 2004), Barcelona, Spain. [Lu et al.2009] Lu, Yue, ChengXiang Zhai, and Neel Sundaresan. 2009. Rated aspect summarization of short comments. In 18th International World WideWeb Conference (WWW2009), April. References
  • 119. [Mihalcea and Tarau2004] Mihalcea, R. and P. Tarau. 2004. TextRank: Bringing order into texts. In Proceedings of EMNLP-04and the 2004 Conference on Empirical Methods in Natural Language Processing, July. [Pang and Lee2004] Pang, Bo and Lillian Lee. 2004. A sentimental education: Sentiment analysis using subjectivity summarization based on minimum cuts. In Proceedings of the ACL, pages 271–278. [Pang et al.2002] Pang, Bo, Lillian Lee, and ShivakumarVaithyanathan. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 79–86. [Radev and McKeown1998]Radev, DR and K. McKeown. 1998. Generating natural language summaries from multiple on-line sources. Computational Linguistics, 24(3):469–500. [Radev et al.2000] Radev, Dragomir, Hongyan Jing, and MalgorzataBudzikowska. 2000. Centroid-based summarization of multiple documents: Sentence extraction, utility-based evaluation, and user studies. In ANLP/NAACL Workshop on Summarization, pages 21–29. [Radev et al.2002] Radev, Dragomir R., Eduard Hovy, and Kathleen McKeown. 2002. Introduction to the special issue on summarization. [Saggion and Lapalme2002] Saggion, Horacio and Guy Lapalme. 2002. Generating indicative-informative summaries with sumum. Computational Linguistics, 28(4):497–526. [Snyder and Barzilay2007] Snyder, Benjamin and Regina Barzilay. 2007. Multiple aspect ranking using the good grief algorithm. In In Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics (HLT-NAACL), pages 300–307. [Titov and Mcdonald2008] Titov, Ivan and Ryan Mcdonald. 2008. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of ACL-08: HLT, pages 308–316, Columbus, Ohio, June. Association for Computational Linguistics. ..References

Notes de l'éditeur

  1. Most work in opinion summarization focus on prediction the aspect based ratings for an entity.For example, for an ipod, you may predict that the appearance is 5 stars, ease of use is three stars..etc.
  2. But the problem is, if you wanted to know more about each of these aspects, you would actually have to read many sentences, from thethousands of reviews, to have your questions answered.
  3. For such textual summaries to be useful to users we first require that the summary actually summarizes the major opinions, is concise so that its viewable on smaller screens and of course should be reasonably readable.
  4. For textual summaries to actually be useful to users we first require that the summary actually summarizes the major opinions, is concise so that its viewable on smaller screens and of course should be reasonably readable.
  5. For such textual summaries to be useful to users we first require that the summary actually summarizes the major opinions, is concise so that its viewable on smaller screens and of course should be reasonably readable.
  6. Miss out information during sentence selection
  7. Miss out information during sentence selection
  8. -Compared to extractive summarization, the study of abstractive summarization on the whole has been quite limited. -and existing methods have some shortcomings. First, some of the techniques require considerable amounts of manual effort to define templates and scripts that can later be filled with extracted information. Then, some methods rely very heavily on natural language understanding which can be computationally quite costly and the methods are often very domain specific.
  9. Redundancies implies confidence
  10. Redundancies implies confidence
  11. Redundancies implies confidence
  12. Redundancies implies confidence
  13. Redundancies implies confidence
  14. 3rd property is that the graph captures collapsible structures. These are 2 new sentences….The first part of the 2 sentences are common, the second part is different.This is a new subgraph for 2 sentences with a common structure. Calls drop freq….and Calls drop freq…..
  15. Such structures are ideal for collapse and compression and such structures can be easily discovered using the OG
  16. Such structures can be easily discovered using the Opinosis-Graph
  17. Now, some paths may be collapsible. We need to identify such paths.
  18. This structure can be potentially collapsed to form a merged structure such as this, where we have the anchor and the merged collapsed candidates. But the question is, how do we merge such structures?
  19. So once we have the candidate summaries, we need to score the candidates, because not all are equally good.
  20. First, since we want high confidence summaries, then we select candidates that have high level of redundancy. This redundancy depends on the number of sentences that share the same path and this count is controlled by a gap threshold which is explained in the next slide.Then, if we want summaries with good coverage, then ideally longer sentences would be better. So for this, we weigh the level of redundancy by the length ofThe candidate paths. So you will favor longer sentences that are reasonably redundant.
  21. Each candidate summary is based on a set of sentences consisting of varying gaps between nodes. If we allow all gaps to be valid, then you risk generating ill formed sentences. So we restrict this, by introducing a gap threshold.
  22. If a sentence is a member of two adjacent node, but does not satisfy the gap threshold, then this entry will not contribute to the redundancy counts that we will use for scoring. This avoids over-estimation of redundancy
  23. The final step essentially involves choosing the candidates to be part of the summary.We choose the top 2 scoring candidates that are most dissimilar.
  24. Our raw data is based on user reviews from Tripadvisor, amazon &amp; edmunds.
  25. The input to the summarizer is a set of topic related sentences constructured from the raw reviews. Each of these is called a review document. And each review documentHas about 100 unordered sentences. The task is then to summarize each of these review documents.
  26. Now we will look into the effects of gap setting.A small gap requires that the words in the graph are close together in the actual text. This graph shows the ROUGE-1 scores at different levels of gap threshold. A small gap requires that the position of words are close together in the actual text.
  27. Now we will look into the effects of gap setting.A small gap requires that the words in the graph are close together in the actual text. This graph shows the ROUGE-1 scores at different levels of gap threshold. A small gap requires that the position of words are close together in the actual text.
  28. Now we will look into the effects of gap setting.A small gap requires that the words in the graph are close together in the actual text. This graph shows the ROUGE-1 scores at different levels of gap threshold. A small gap requires that the position of words are close together in the actual text.
  29. * So when the gap is one, the performance is lowest because strict adjacency does not allow redundancies to be easily discovered.
  30. When the gap is slightly relaxed notice that there is a significant jump in performance. This is because we are able to discover more redundancies, and thus we are ableto generate higher confidence summaries.
  31. As the gap increases, there is small amounts of improvement
  32. When the gap is too large however, you risk generating ill formed sentences. So the best is to set the gap threshold between 2 and 5
  33. This is the ROUGE-1 f-scores using different scoring functions. The orange line is for the scoring function that uses only redundancy, and the red and green line are for scoring functions that use both redundancy and path length
  34. It is obvious that the scoring functions that use redundancy &amp; path length perform better. This is because, the summaries have better coverage and thus better performance.
  35. For a given topic, we take the sentences from the opinosis summary and the corresponding human summaries to form a mixed set of sentences. From this set, we then ask a human assessor to pick 2 sentencesthat are least readable.