SlideShare une entreprise Scribd logo
1  sur  21
Corroborating
Information from
Affirmative Statements
Minji Wu, Rutgers University
Amélie Marian, Rutgers University
Background
• Information is often untrustworthy
• Erroneous (e.g, news site at breaking events)
• Misleading (e.g., malicious sources)
• Biased (e.g., political domains)
• Outdated (e.g., knowledge base that doesn’t update
frequently)
• This phenomenon is amplified by the widespread
information dependency (copy-paste)
• It is difficult for the user to discern the correctness of
information and the trustworthiness of the sources
2
Conflicting Information
3
Data Corroboration
• Early Corroboration
• Frequency-based approach
• Recent work on Corroboration techniques
• Trustworthiness of sources
A measure s(s) that quantify the precision of a source s
• Probability of information (facts)
A measure that s(f) quantify the probability that a fact f is true
• Starting with a default s(s), iteratively compute the
probabilities for the facts and the trustworthiness of the
sources
• Machine-Learning approaches
• Some corroboration problems can be seen as a ML
classification problem
4
What if there is no conflicts?
• Does the presence of information without
contradictions means it is correct?
5
Our Problem: Corroborating Information
with only Affirmative Statements
• We focus on scenarios in which sources have little
or no dissention
• Frequent real-world problem (rumors, hard-to-rebut
claims)
• Difficult to identify incorrect information since all
reported information is consistent
• Existing corroboration approaches do not work
well
• Rely on conflicting information to differentiate the
trustworthiness of the sources
6
Contributions
• Novel corroboration approach:
• Assigns multiple trust scores to each sources
• Considers the trustworthiness of the source for a group of
facts
• Corroboration algorithm incrementally evaluates facts
• Groups unknown facts based on the sources reporting
them
• Makes decisions based on information entropy
• Extensive real world and synthetic experiments that
demonstrate the benefits of our method
7
Evaluation Setting
• Corroboration task:
• Sources for restaurant address: Citysearch,
Foursquare, Menupages, Opentable, Yellowpages,
Yelp
• Golden set
• Selected restaurants in 3 zip codes: 601 listings
• Verified their legitimacy in person (Apr 2012)
• 340 true and 261 false
Identify legitimate restaurant listings in NYC given
the listing information from a set of sources
8
Motivating Example
Opentable Yelp Menupages Citysearch Yellowpag
es
Correct
value
M Bar T T true
Sam’s T T T T true
27 Sunshine T T T true
Crepe
Creations
T T false
El Portal T T false
Holy Basil F T false
Papatzul T T true
Wine Spot T T true
Vbar T T true
Wai Cafe T T false
Tomoe Sushi T T T true
Khushie 139 F F T false
9
State-of-the-art
Corroboration Strategies
Approaches
• TwoEstimate [Galland WSDM’10]
• Iteratively estimates the trust score of the sources
and the probability of the facts
• BayesEstimate [Zhao VLDB’12]
• Uses a Bayesian graphical model
• Considers a two-sided errors (false positives and
false negatives)Precisio
n
Recall Accurac
y
Computed trust scores
TwoEstimate .64 1 .67 (1, 1, 0.8, 0.9, 1)
BayesEstima
te
.58 1 .58 (1, 0.8, 0.6, 1, 1)
used to evaluate each fact!10
Key Observation
• Using the same trust score to judge the correctness
of all information is too coarse
• Each source may exhibit different accuracy towards
different group of facts
• The corroboration result could be greatly improved if
we could derive finer-grained trust scores for each
source
11
Multi-value trust scores for sources
Trust Scores
• Single-value trust scores (s(s))
• A single measure for each source
• Each fact is evaluated using the same value from each
source
• Multi-value trust scores
• A group of values assigned to each source
s(s) = < s1(s), s2(s), …>
• Each (group of) fact is evaluated using one of the trust
values from each source
12
Multi-Value Trust Scores
• Two major challenges
• How to calculate the trust values for each source
• How to decide which sources’ trust values to
consider for each fact
• Solution: an incremental evaluation mechanism
• Select a subset of facts to process
• Update the trust values based on the already
processed facts
• Facts are assigned a truth value when they are
processed
13
How to Select Facts?
• Model each fact f as a random variable
• Objective: compute the probability s(f) that f is true
• Information Entropy approach:
• Consider the entropy H(f) of each fact f
• The entropy of a random variable measures its uncertainty
• Our solution: select facts such that the entropy of
unknown facts are maximized
• Existing corroboration techniques normalize their results
to attain a probability of 1 (or 0) for each fact, i.e., entropy
of 0
• Reducing uncertainty leads to (too) early consensus
14
Heuristics for Selecting Facts
• Group facts based on the votes from sources
• At each step i:
• Calculate the entropy of each fact group using si(s)
• Calculate ΔH(FG) for each fact group FG
(Represents the change of entropy if FG is selected)
• Select both positive and negative fact groups with highest
ΔH(FG)
• Assign positive and negative values to the same number of
facts
15
Revisiting the running
example
Positive: {r7}, {r2}, {r3}, {r5, r8}, {r11}, {r9}, {r4, r10}, {r6}, {r1}
Negative: {r12}
Positive: {r3}, {r11}, {r5, r8}, {r2}, {r9}, {r1}
s(S)={0.9, 0.9, 0.9, 0.9,
0.9}
s(S)={1, 1, 1, 0, 0.9}
Negative: {r4, r10}, {r6}
F1={r7, r12}F2={r3, {r4, r10}}
Positive: {r9}, {r5, r8}, {r1}, {r11}, {r2}
s(S)={1, 1, 1, 0, 0.5}
Negative: {r10}, {r6}
F2={r3, r4}F3={r9, r10}
Positive: {r5, r8}, {r1}, {r11}, {r2}
s(S)={1, 1, 1, 0, 0.5}
Negative: {r6}
F4={r5, r6}
Positive: {r8}, {r3}, {r11}, {r2}
s(S)={1, 1, 1, 0, 0.5}
Negative:
True facts: r7
False facts:r12
r3
r4
r9
r10
r5
r6
r3 r8 r2 r11 Precision Recall Accurac
y
0.86 1 0.92
16
Precisio
n
Recall Accurac
y
Computed trust scores
TwoEstimate .64 1 .67 (1, 1, 0.8, 0.9, 1)
BayesEstima
te
.58 1 .58 (1, 0.8, 0.6, 1, 1)
IncEstHeu .86 1 .92 (0.9,0.9,0.9,0.9,0.9)
(1,1,1,0,0.9)
(1,1,1,0,0.5)
Experimental Setting
• Algorithms
• We implemented two strategies (IncEstPS, IncEstHeu) using
Java
• Frequency-based: Voting and Counting
• Existing Corroboration Techniques: TwoEstimate, BayesEstimate
• Machine Learning based: ML-SVM, ML-Logistic
• 36916 listings from 6 sources
• Metrics
• Precision, Recall, Accuracy
• Mean Square error (MSE) of trust score
17
Corroboration Results
Precision Recall Accuracy F-1
Voting 0.65 1.00 0.66 0.79
Counting 0.94 0.65 0.76 0.77
BayesEstimate 0.63 1.00 0.67 0.77
TwoEstimate 0.65 1.00 0.66 0.79
ML-SVM 0.98 0.74 0.77 0.84
ML-Logistic 0.86 0.85 0.82 0.82
IncEstPS 0.66 1.00 0.68 0.79
IncEstHeu 0.86 0.86 0.83 0.86
18
MSE on the sources
Yellowpag
es
Foursquar
e
Menupage
s
Opentabl
e
Citysearc
h
Yel
p
MSE
Accuracy 0.59 0.78 0.93 0.96 0.62 0.84 -
TwoEstimate 1.00 1.00 0.98 1.00 1.00 0.98 0.063
BayesEstimat
e
1.00 1.00 1.00 1.00 1.00 1.00 0.066
ML-Logistic 0.62 0.85 0.98 0.92 0.65 0.95 0.004
IncEstHeu 0.51 0.70 0.90 0.93 0.51 0.89 0.005
19
Multi-value Trust Score
• Simple Fact Selection • Entropy-based Fact
Selection
0
0.2
0.4
0.6
0.8
1
0 20 40 60 80 100
Trustscore
Time point
Yellowpages
Foursquare
Menupages
Opentable
Citysearch
Yelp
0.8
0.85
0.9
0.95
1
1.05
0 20 40 60 80 100
Trustscore
Time point
Yellowpages
Foursquare
Menupages
Opentable
Citysearch
Yelp
20
Conclusion
• Proposed techniques for corroborating facts with
mostly affirmative statements
• Designed a novel algorithm that adopts a multi-value
trust score for the sources
• Incrementally selects facts by leveraging the information
entropy of unknown facts
• Uses different sets of sources’ trust scores to evaluate ach
sets of facts
• Performed experiments using both real world and
synthetic (see paper) data
21

Contenu connexe

En vedette

Simple past ppt.
Simple past ppt.Simple past ppt.
Simple past ppt.
riham1411
 
Present simple explanation + exercises
Present simple explanation + exercisesPresent simple explanation + exercises
Present simple explanation + exercises
Teba Hernández
 
Partnership and open data as enablers of INSPIREd innovative services
Partnership and open data as enablers of INSPIREd innovative servicesPartnership and open data as enablers of INSPIREd innovative services
Partnership and open data as enablers of INSPIREd innovative services
smespire
 
Bridge the gap research strategies 2014
Bridge the gap research strategies 2014Bridge the gap research strategies 2014
Bridge the gap research strategies 2014
aoconno2
 
Moz food market_integration
Moz food market_integrationMoz food market_integration
Moz food market_integration
IFPRI-Maputo
 
Seminario biologia molecular l. donovani
Seminario biologia molecular  l. donovaniSeminario biologia molecular  l. donovani
Seminario biologia molecular l. donovani
Ana Parada
 
Textile testing-equipments-1224828041803735-9
Textile testing-equipments-1224828041803735-9Textile testing-equipments-1224828041803735-9
Textile testing-equipments-1224828041803735-9
Mazedul Islam Mazed
 
Q3 2012 Home Improvement Search Trends
Q3 2012 Home Improvement Search TrendsQ3 2012 Home Improvement Search Trends
Q3 2012 Home Improvement Search Trends
tonymaull92
 

En vedette (20)

judgment(proposition)
judgment(proposition)judgment(proposition)
judgment(proposition)
 
6. intonation
6. intonation6. intonation
6. intonation
 
Sentence transformations
Sentence transformationsSentence transformations
Sentence transformations
 
Simple past ppt.
Simple past ppt.Simple past ppt.
Simple past ppt.
 
Present simple explanation + exercises
Present simple explanation + exercisesPresent simple explanation + exercises
Present simple explanation + exercises
 
ระบบงานเฝ้าระวังโรคทางระบาดวิทยาสำนักงานสาธารณสุขอำเภอป่าบอน จังหวัดพัทลุง
ระบบงานเฝ้าระวังโรคทางระบาดวิทยาสำนักงานสาธารณสุขอำเภอป่าบอน จังหวัดพัทลุงระบบงานเฝ้าระวังโรคทางระบาดวิทยาสำนักงานสาธารณสุขอำเภอป่าบอน จังหวัดพัทลุง
ระบบงานเฝ้าระวังโรคทางระบาดวิทยาสำนักงานสาธารณสุขอำเภอป่าบอน จังหวัดพัทลุง
 
NA PERIFA
NA PERIFANA PERIFA
NA PERIFA
 
Partnership and open data as enablers of INSPIREd innovative services
Partnership and open data as enablers of INSPIREd innovative servicesPartnership and open data as enablers of INSPIREd innovative services
Partnership and open data as enablers of INSPIREd innovative services
 
Bridge the gap research strategies 2014
Bridge the gap research strategies 2014Bridge the gap research strategies 2014
Bridge the gap research strategies 2014
 
Hackathon - Mapping da National Core a INSPIRE (Hydrography)
Hackathon - Mapping da National Core a INSPIRE (Hydrography)Hackathon - Mapping da National Core a INSPIRE (Hydrography)
Hackathon - Mapping da National Core a INSPIRE (Hydrography)
 
Moz food market_integration
Moz food market_integrationMoz food market_integration
Moz food market_integration
 
Seminario biologia molecular l. donovani
Seminario biologia molecular  l. donovaniSeminario biologia molecular  l. donovani
Seminario biologia molecular l. donovani
 
Textile testing-equipments-1224828041803735-9
Textile testing-equipments-1224828041803735-9Textile testing-equipments-1224828041803735-9
Textile testing-equipments-1224828041803735-9
 
#quclms launch presentation
#quclms launch presentation#quclms launch presentation
#quclms launch presentation
 
Q3 2012 Home Improvement Search Trends
Q3 2012 Home Improvement Search TrendsQ3 2012 Home Improvement Search Trends
Q3 2012 Home Improvement Search Trends
 
My family
My familyMy family
My family
 
Werkwinkel Repair Café - Etapastocht
Werkwinkel Repair Café - EtapastochtWerkwinkel Repair Café - Etapastocht
Werkwinkel Repair Café - Etapastocht
 
Il condizionale
Il condizionaleIl condizionale
Il condizionale
 
Acoustic Correction of High School gym in Florence
Acoustic Correction of High School gym in FlorenceAcoustic Correction of High School gym in Florence
Acoustic Correction of High School gym in Florence
 
Clothes
ClothesClothes
Clothes
 

Similaire à Corroborating Facts from Affirmative Statements

Mir 2012 13 session #4
Mir 2012 13 session #4Mir 2012 13 session #4
Mir 2012 13 session #4
RichardGroom
 

Similaire à Corroborating Facts from Affirmative Statements (20)

Burns And Bush Chapter 16
Burns And Bush Chapter 16Burns And Bush Chapter 16
Burns And Bush Chapter 16
 
Haystax bayesian networks
Haystax bayesian networksHaystax bayesian networks
Haystax bayesian networks
 
CS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptxCS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptx
 
Statistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerceStatistical hypothesis testing in e commerce
Statistical hypothesis testing in e commerce
 
Qm 0809
Qm 0809 Qm 0809
Qm 0809
 
Mir 2012 13 session #4
Mir 2012 13 session #4Mir 2012 13 session #4
Mir 2012 13 session #4
 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
 
data mining
data miningdata mining
data mining
 
Part1
Part1Part1
Part1
 
Hypothesis testing interview
Hypothesis testing interviewHypothesis testing interview
Hypothesis testing interview
 
Environmental statistics
Environmental statisticsEnvironmental statistics
Environmental statistics
 
Introduction to Hypothesis Testing
Introduction to Hypothesis TestingIntroduction to Hypothesis Testing
Introduction to Hypothesis Testing
 
inferencial statistics
inferencial statisticsinferencial statistics
inferencial statistics
 
Data analysis
Data analysisData analysis
Data analysis
 
Missing data handling
Missing data handlingMissing data handling
Missing data handling
 
R - what do the numbers mean? #RStats
R - what do the numbers mean? #RStatsR - what do the numbers mean? #RStats
R - what do the numbers mean? #RStats
 
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and ProbabilismStatistical Inference as Severe Testing: Beyond Performance and Probabilism
Statistical Inference as Severe Testing: Beyond Performance and Probabilism
 
Kenett On Information NYU-Poly 2013
Kenett On Information NYU-Poly 2013Kenett On Information NYU-Poly 2013
Kenett On Information NYU-Poly 2013
 
5 numerical descriptive statitics
5 numerical descriptive statitics5 numerical descriptive statitics
5 numerical descriptive statitics
 
Statistics for Data Analysis - ODE - BVP .pptx
Statistics for Data Analysis - ODE - BVP .pptxStatistics for Data Analysis - ODE - BVP .pptx
Statistics for Data Analysis - ODE - BVP .pptx
 

Plus de Amélie Marian (8)

Integration and Exploration of Connected Personal Digital Traces
Integration and Exploration of Connected Personal Digital TracesIntegration and Exploration of Connected Personal Digital Traces
Integration and Exploration of Connected Personal Digital Traces
 
Miettes de données - Keynote BDA 2015
Miettes de données - Keynote BDA 2015Miettes de données - Keynote BDA 2015
Miettes de données - Keynote BDA 2015
 
Personal Information Management Systems - EDBT/ICDT'15 Tutorial
Personal Information Management Systems - EDBT/ICDT'15 TutorialPersonal Information Management Systems - EDBT/ICDT'15 Tutorial
Personal Information Management Systems - EDBT/ICDT'15 Tutorial
 
Personal Information Search and Discovery
Personal Information Search and DiscoveryPersonal Information Search and Discovery
Personal Information Search and Discovery
 
Personalizing Forum Search using Multidimensional Random Walks
Personalizing Forum Search using Multidimensional Random WalksPersonalizing Forum Search using Multidimensional Random Walks
Personalizing Forum Search using Multidimensional Random Walks
 
Searching Web Forums
Searching Web ForumsSearching Web Forums
Searching Web Forums
 
Remembrance of data past
Remembrance of data pastRemembrance of data past
Remembrance of data past
 
Searching data with substance and style
Searching data with substance and styleSearching data with substance and style
Searching data with substance and style
 

Dernier

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 

Dernier (20)

Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 

Corroborating Facts from Affirmative Statements

  • 1. Corroborating Information from Affirmative Statements Minji Wu, Rutgers University Amélie Marian, Rutgers University
  • 2. Background • Information is often untrustworthy • Erroneous (e.g, news site at breaking events) • Misleading (e.g., malicious sources) • Biased (e.g., political domains) • Outdated (e.g., knowledge base that doesn’t update frequently) • This phenomenon is amplified by the widespread information dependency (copy-paste) • It is difficult for the user to discern the correctness of information and the trustworthiness of the sources 2
  • 4. Data Corroboration • Early Corroboration • Frequency-based approach • Recent work on Corroboration techniques • Trustworthiness of sources A measure s(s) that quantify the precision of a source s • Probability of information (facts) A measure that s(f) quantify the probability that a fact f is true • Starting with a default s(s), iteratively compute the probabilities for the facts and the trustworthiness of the sources • Machine-Learning approaches • Some corroboration problems can be seen as a ML classification problem 4
  • 5. What if there is no conflicts? • Does the presence of information without contradictions means it is correct? 5
  • 6. Our Problem: Corroborating Information with only Affirmative Statements • We focus on scenarios in which sources have little or no dissention • Frequent real-world problem (rumors, hard-to-rebut claims) • Difficult to identify incorrect information since all reported information is consistent • Existing corroboration approaches do not work well • Rely on conflicting information to differentiate the trustworthiness of the sources 6
  • 7. Contributions • Novel corroboration approach: • Assigns multiple trust scores to each sources • Considers the trustworthiness of the source for a group of facts • Corroboration algorithm incrementally evaluates facts • Groups unknown facts based on the sources reporting them • Makes decisions based on information entropy • Extensive real world and synthetic experiments that demonstrate the benefits of our method 7
  • 8. Evaluation Setting • Corroboration task: • Sources for restaurant address: Citysearch, Foursquare, Menupages, Opentable, Yellowpages, Yelp • Golden set • Selected restaurants in 3 zip codes: 601 listings • Verified their legitimacy in person (Apr 2012) • 340 true and 261 false Identify legitimate restaurant listings in NYC given the listing information from a set of sources 8
  • 9. Motivating Example Opentable Yelp Menupages Citysearch Yellowpag es Correct value M Bar T T true Sam’s T T T T true 27 Sunshine T T T true Crepe Creations T T false El Portal T T false Holy Basil F T false Papatzul T T true Wine Spot T T true Vbar T T true Wai Cafe T T false Tomoe Sushi T T T true Khushie 139 F F T false 9
  • 10. State-of-the-art Corroboration Strategies Approaches • TwoEstimate [Galland WSDM’10] • Iteratively estimates the trust score of the sources and the probability of the facts • BayesEstimate [Zhao VLDB’12] • Uses a Bayesian graphical model • Considers a two-sided errors (false positives and false negatives)Precisio n Recall Accurac y Computed trust scores TwoEstimate .64 1 .67 (1, 1, 0.8, 0.9, 1) BayesEstima te .58 1 .58 (1, 0.8, 0.6, 1, 1) used to evaluate each fact!10
  • 11. Key Observation • Using the same trust score to judge the correctness of all information is too coarse • Each source may exhibit different accuracy towards different group of facts • The corroboration result could be greatly improved if we could derive finer-grained trust scores for each source 11 Multi-value trust scores for sources
  • 12. Trust Scores • Single-value trust scores (s(s)) • A single measure for each source • Each fact is evaluated using the same value from each source • Multi-value trust scores • A group of values assigned to each source s(s) = < s1(s), s2(s), …> • Each (group of) fact is evaluated using one of the trust values from each source 12
  • 13. Multi-Value Trust Scores • Two major challenges • How to calculate the trust values for each source • How to decide which sources’ trust values to consider for each fact • Solution: an incremental evaluation mechanism • Select a subset of facts to process • Update the trust values based on the already processed facts • Facts are assigned a truth value when they are processed 13
  • 14. How to Select Facts? • Model each fact f as a random variable • Objective: compute the probability s(f) that f is true • Information Entropy approach: • Consider the entropy H(f) of each fact f • The entropy of a random variable measures its uncertainty • Our solution: select facts such that the entropy of unknown facts are maximized • Existing corroboration techniques normalize their results to attain a probability of 1 (or 0) for each fact, i.e., entropy of 0 • Reducing uncertainty leads to (too) early consensus 14
  • 15. Heuristics for Selecting Facts • Group facts based on the votes from sources • At each step i: • Calculate the entropy of each fact group using si(s) • Calculate ΔH(FG) for each fact group FG (Represents the change of entropy if FG is selected) • Select both positive and negative fact groups with highest ΔH(FG) • Assign positive and negative values to the same number of facts 15
  • 16. Revisiting the running example Positive: {r7}, {r2}, {r3}, {r5, r8}, {r11}, {r9}, {r4, r10}, {r6}, {r1} Negative: {r12} Positive: {r3}, {r11}, {r5, r8}, {r2}, {r9}, {r1} s(S)={0.9, 0.9, 0.9, 0.9, 0.9} s(S)={1, 1, 1, 0, 0.9} Negative: {r4, r10}, {r6} F1={r7, r12}F2={r3, {r4, r10}} Positive: {r9}, {r5, r8}, {r1}, {r11}, {r2} s(S)={1, 1, 1, 0, 0.5} Negative: {r10}, {r6} F2={r3, r4}F3={r9, r10} Positive: {r5, r8}, {r1}, {r11}, {r2} s(S)={1, 1, 1, 0, 0.5} Negative: {r6} F4={r5, r6} Positive: {r8}, {r3}, {r11}, {r2} s(S)={1, 1, 1, 0, 0.5} Negative: True facts: r7 False facts:r12 r3 r4 r9 r10 r5 r6 r3 r8 r2 r11 Precision Recall Accurac y 0.86 1 0.92 16 Precisio n Recall Accurac y Computed trust scores TwoEstimate .64 1 .67 (1, 1, 0.8, 0.9, 1) BayesEstima te .58 1 .58 (1, 0.8, 0.6, 1, 1) IncEstHeu .86 1 .92 (0.9,0.9,0.9,0.9,0.9) (1,1,1,0,0.9) (1,1,1,0,0.5)
  • 17. Experimental Setting • Algorithms • We implemented two strategies (IncEstPS, IncEstHeu) using Java • Frequency-based: Voting and Counting • Existing Corroboration Techniques: TwoEstimate, BayesEstimate • Machine Learning based: ML-SVM, ML-Logistic • 36916 listings from 6 sources • Metrics • Precision, Recall, Accuracy • Mean Square error (MSE) of trust score 17
  • 18. Corroboration Results Precision Recall Accuracy F-1 Voting 0.65 1.00 0.66 0.79 Counting 0.94 0.65 0.76 0.77 BayesEstimate 0.63 1.00 0.67 0.77 TwoEstimate 0.65 1.00 0.66 0.79 ML-SVM 0.98 0.74 0.77 0.84 ML-Logistic 0.86 0.85 0.82 0.82 IncEstPS 0.66 1.00 0.68 0.79 IncEstHeu 0.86 0.86 0.83 0.86 18
  • 19. MSE on the sources Yellowpag es Foursquar e Menupage s Opentabl e Citysearc h Yel p MSE Accuracy 0.59 0.78 0.93 0.96 0.62 0.84 - TwoEstimate 1.00 1.00 0.98 1.00 1.00 0.98 0.063 BayesEstimat e 1.00 1.00 1.00 1.00 1.00 1.00 0.066 ML-Logistic 0.62 0.85 0.98 0.92 0.65 0.95 0.004 IncEstHeu 0.51 0.70 0.90 0.93 0.51 0.89 0.005 19
  • 20. Multi-value Trust Score • Simple Fact Selection • Entropy-based Fact Selection 0 0.2 0.4 0.6 0.8 1 0 20 40 60 80 100 Trustscore Time point Yellowpages Foursquare Menupages Opentable Citysearch Yelp 0.8 0.85 0.9 0.95 1 1.05 0 20 40 60 80 100 Trustscore Time point Yellowpages Foursquare Menupages Opentable Citysearch Yelp 20
  • 21. Conclusion • Proposed techniques for corroborating facts with mostly affirmative statements • Designed a novel algorithm that adopts a multi-value trust score for the sources • Incrementally selects facts by leveraging the information entropy of unknown facts • Uses different sets of sources’ trust scores to evaluate ach sets of facts • Performed experiments using both real world and synthetic (see paper) data 21