SlideShare une entreprise Scribd logo
1  sur  22
SirenLess:
reveal the intention behind news
EuroVis 2020
Xumeng Chen, Leo Yu-Ho Lo, Huamin Qu
Presenter: Hwiyeon Kim
Overview of the paper
• Motivation of the study
 Popularization of journalism (writers with ulterior intentions)
• authority & credibility of news articles?
 Can be difficult for untrained readers to identify those latent intentions
 Some machine learning approaches provide limited help for humans in
decision making
• Goal of the study
 Present a visual analytical system for misleading news detection by
linguistic features
• Analyze 18 news articles from different sources
• Summarize helpful patterns for misleading news detection
• User study  Confirm usefulness & effectiveness of the system
Main Topics of this work
• Design & implementation – how & why we made the system
 Design Goals
 System and Data
 Visual Design
• Findings & Evaluation – what we can find new with the system
 Patterns of unreliable news
 With a professor & 9 students
2
Background Knowledge
• A rise of fake news
 Introduced the nature & impact of fake news (Lazer et al. ‘18)
 Problem: fake news intended to provoke emotions (Bakir et al. ‘18) extremist behavior (Aisch et al. ‘16)
 Promising directions: rhetorical structure and discourse analysis (Rubin et al. ‘15)
• Suggested solutions by previous works
 Transmission of fake news
• trying to capture the pattern of spread of misinformation & factors contributing to it (Vosoughi et al. ‘18)
 Content analysis
• Flag the malicious intent of the writer
• Automated deception detection with high (80~90%) accuracy (Feng et al. ‘12)
• Deceptive stories are separable from truthful stories in the rhetorical structure feature space
 Equip readers with the ability to evaluate credibility of fake news
• Fact-checking websites (FactCheck.org, PolitiFact, Ad Fontes Media)
• Education (Bergstrom et al. ‘17)
3
4
Design goals
• Inspired by Conroy et al. 's survey on automatic deception detection
• Goals
 G1: Provide a quick overview of the languate usage of news articles
 G2: Present news meta data to help users grasp its semantic strucure
 G3: Let users gain direct access and reference to the article text
• Tasks
 T1: Reveal the sentiment and discourse mode distribution of the article [G1]
 T2: Identify the estimated subjectivity and readability level of the article [G1]
 T3: Identify and compare character and keywords occurences in the article [G2]
 T4: Provide the original text [G3]
5
System and Data
• Automatic data processing pipeline
 Plain text –(extract)-> High-level semantics
• Discourse mode analysis
 Combine General text analysis (narration, description, exposition and argument)
+ Tom Wolfe’s Theory
 Five Categories
• Narration: the most important part of storytelling, author’s interpretation
• Argument: analysis and ideas of the author
• Quote: directly repeat the passage of a person
• Description: detailed depiction, aimed at rebuilding the original scene
• Background: fact-checked background information aimed at helping readers understand the
current story
• Training Data
 312 news articles from Fox News, ABC News, New York Times, the Economist..
 Make a rough pre-filtering of articles with the Pew Research Center
6
Visual Design
Article Explorer Module
Article Stat Module
Reader View Module
7
Visual Design
Article Explorer Module
• Visualizes the distribution of discourse
modes
• Sentiment Distribution
• Problem: sentiment score of news
articles goes zero after being aver
aged
• Solution: distribution of sentiment
of the whole article from negative
(-1) to positive (1)
• Extreme sentiment sentences(>0.
5) could easily draw readers’ atten
tion
8
Visual Design
• Discourse Mode Distribution
• Choose color as the visual channel
• Common discourse mode narration; a low
-saturation
• Argument; reddish
• Background information; purple
• Description; green
• Quotes; sky-blue
• Reveal Metadata of News Articles
• Inspired by the ‘5W&1H’ theory
• ‘why’ & ‘how’: high-level, hard to be extra
cted by the computer
• ‘where’ & ‘when’ : give limited help to the
understanding of news, tried to overuse
markers
• Decided to visualize ‘who’ (characters) an
d ‘what’ (keywords grouped by topics)
Article Explorer Module
9
Visual Design
Article Stats Module
• Sentiment and discourse mode stats
• Overall status and sentence-level information
• Article stats
• Flesh-Kincaid readability grade: difficulty level of an article by
its vocabulary use
• Misleading news likes to be easy-to-read  spread faster thr
oughout the general public (‘15, Coroy et al.)
10
Patterns of unreliable news
• Analyzed 8 misleading articles using the system
 Some linguistic features (sentiment dist., article subjectivity, article readability) have strong indicative patterns o
n their own / together
Unreliable news tends to be emotional
• Dominated by one polarity (a, b, c) Fluctuating between two polarities (d)
11
Patterns of unreliable news
Subjectivity of a realiable article is less than 0.2
• Subjectivity is another strong indicator to show the reliability of news articles
• reliable articles could have a subjectivity score < 0.2
Unreliable news shows an easy-to- read pattern
• With a Flesch-Kincaid readability grade greater than 0.3 in the stud
y (cognitive level of secondary & college levels)
• Reliable news: cognitive range of college & graduate levels
12
Patterns of unreliable news
Unreliable articles include considerable portion of arguments
• Subjectivity is another strong indicator to show the reliability of news articles
• Reliable news are narrative without subjective arguments
Sly writers arrange background
/descriptions with their person
al opinions
13
Patterns of unreliable news
Sophisticatd writers selectively use other's mouth to convey their words
• Easy to be ignored when reading plain text but can be seen clearly through visualization
14
Evaluation
• Journalism scholar review
 One journalism professor to evaluate the system thorugh a questionnaire
 "Meaningful and potentially useful tool for news/information analytics"
 "The features of writing style, sentiment, and keywords, etc. can be relevant indica
tors of journalistic performance, depending on the usage context."
 Recommended usage field
• For academic research in media content analysis
• Teaching and learning in jouranlism and public relations couses
• News room practice especially for editors
15
Evaluation
• User study
 Among college students to verify whether it can help them to identify the misleading intention
of the news articles
 9 participants (4 undergraduate and 6 postgraduagte students)
 Two steps
1. Read first and review with the visualization system
• For one selected article
• 2 out of 9  8 out of 9 spotted the bias
2. View visualization system first then read under the help of the visualization system
• For two selected article (same event different standpoints from different news organization)
• 7 (highly alerted), 5 (dubious), and 6 (looks objective) out of 18  unanimously agreed the articles are biased
 Questionnaire
• Help me to spot the intention of misleading in the articles
• 4 participants (strongly agree), 3 participants (agree), 2 participants(neutral)
• Help me to spot the bias towards different entities or events in the articles
• 5 participants (strongly agree), 4 participants (agree)
16
Discussion
• Limitation
 Accuracy of feature extraction
• Some extracted features are denied by human
 Interaction and readability
• Function filters to radar chart – sentiment/discoure and characters/events
• Future Work
 Extending the system and supplement missing dimensions
• Enable comparison by aggregating articles of the same topic (data collection pipline)
• Integrate related external information such as comments and Wikipedia to enable fact check (cr
oss-validation through crowdworkers)
 Generalization to other domains
• Help students analyze patterns of TOEFL writing samples..
17
Takeaways
1. An interactive visualization design that could act as entry point or hint
for further visualization research
2. A field study on current computer-based news analytical techniques
3. A case study reporting patterns found by their methodology
Criticism
1. Well-structured and logical writing (with few questions why)
2. Design rationale is dense
3. Interesting and noticeable findings on unreliable articles
4. Visualization dashboards use reliable indicators
 The authors introduce useful indicators
18
• Very useful when building article selection crite
ria for article reading experiments
Criticism
19
1. Need more information about training data
 Which kind of topic? Politics? Economy? Or else?
2. In the section that describes why they chose that color
 Need reference of each color meaning (controversial)
 "purple represents wisdom... Green and sky blue are safe and reliable.."
3. Not fully convinced with visualizing only who and what
 Missing description of the scope of the article selected in the paper
• I can guess the mainly discussed topic in this paper is politics but not sure..
 Where and when are more important information for infectious diseaes such as Corona virus
4. Study procedure description is quite unclear
 Missing information of used articles
 For the second task; why the authors count all samples for two articles (9 from one and 9 from the other)?
 Participants and articles are quite small number; needed to be studies on more diverse articles
• or at least mention the scope as ‘politics’ only
20
My IDEA for Future Work
• Education
 Learning structure is essential for writing
 Examples are indespensable
• show reliable and unreliable news articles to students or junior journalists
 From examples, try to figure out features and create reliabe/unreliable news articles
 Self-verification of the reliability of their writing
• Filtering function
 Filtering system (or an app) for readers
 Notice subjectivity level and other stats
 Do stats influence readers' acceptance of news content?
• Article’s subjectivity may change depending on the reader’s political propensity
Thank you 
Any questions?

Contenu connexe

Tendances

Product Sentiment Analysis
Product Sentiment AnalysisProduct Sentiment Analysis
Product Sentiment Analysis
nancy amala
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Knowledge Media Institute - The Open University
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 

Tendances (20)

Sentiment Analysis
Sentiment AnalysisSentiment Analysis
Sentiment Analysis
 
Practical sentiment analysis
Practical sentiment analysisPractical sentiment analysis
Practical sentiment analysis
 
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
Sentiment mining- The Design and Implementation of an Internet PublicOpinion...Sentiment mining- The Design and Implementation of an Internet PublicOpinion...
Sentiment mining- The Design and Implementation of an Internet Public Opinion...
 
Final deck
Final deckFinal deck
Final deck
 
Arcomem training diversification
Arcomem training diversificationArcomem training diversification
Arcomem training diversification
 
Twitter data analysis using R
Twitter data analysis using RTwitter data analysis using R
Twitter data analysis using R
 
Multimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimediaMultimedia data minig and analytics sentiment analysis using social multimedia
Multimedia data minig and analytics sentiment analysis using social multimedia
 
Introduction to Sentiment Analysis
Introduction to Sentiment AnalysisIntroduction to Sentiment Analysis
Introduction to Sentiment Analysis
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Product Sentiment Analysis
Product Sentiment AnalysisProduct Sentiment Analysis
Product Sentiment Analysis
 
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new datase...
 
Ontology based sentiment analysis
Ontology based sentiment analysisOntology based sentiment analysis
Ontology based sentiment analysis
 
Sentiment Analysis of Twitter Data
Sentiment Analysis of Twitter DataSentiment Analysis of Twitter Data
Sentiment Analysis of Twitter Data
 
Sentiment analysis
Sentiment analysisSentiment analysis
Sentiment analysis
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEWSENTIMENT ANALYSIS-AN OBJECTIVE VIEW
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
 
Sentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big DataSentiment analysis in Twitter on Big Data
Sentiment analysis in Twitter on Big Data
 
These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...These slides cover the final defense presentation for my Doctorate degree. Th...
These slides cover the final defense presentation for my Doctorate degree. Th...
 
Sentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use casesSentiment analysis - Our approach and use cases
Sentiment analysis - Our approach and use cases
 
Making qualitative analysis more transparent by using NVivo
Making qualitative analysis more transparent by using NVivoMaking qualitative analysis more transparent by using NVivo
Making qualitative analysis more transparent by using NVivo
 

Similaire à [0211]hwiyeon

COM 510 Final Project One Research Paper Guidelines and Rub.docx
COM 510 Final Project One Research Paper Guidelines and Rub.docxCOM 510 Final Project One Research Paper Guidelines and Rub.docx
COM 510 Final Project One Research Paper Guidelines and Rub.docx
monicafrancis71118
 
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
jdubrow2000
 
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docxCM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
monicafrancis71118
 
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
felicidaddinwoodie
 
Data analysis – qualitative data presentation 2
Data analysis – qualitative data   presentation 2Data analysis – qualitative data   presentation 2
Data analysis – qualitative data presentation 2
Azura Zaki
 
The title of the research project is Representation of the.docx
The title of the research project is Representation of the.docxThe title of the research project is Representation of the.docx
The title of the research project is Representation of the.docx
write4
 

Similaire à [0211]hwiyeon (20)

Explainable AI is not yet Understandable AI
Explainable AI is not yet Understandable AIExplainable AI is not yet Understandable AI
Explainable AI is not yet Understandable AI
 
Components in qualitative research proposal Three aspects of qualitative wri...
Components in qualitative research proposal  Three aspects of qualitative wri...Components in qualitative research proposal  Three aspects of qualitative wri...
Components in qualitative research proposal Three aspects of qualitative wri...
 
Lern, jan 2015, digital media slides
Lern, jan 2015, digital media slidesLern, jan 2015, digital media slides
Lern, jan 2015, digital media slides
 
research methodogy
research methodogyresearch methodogy
research methodogy
 
COM 510 Final Project One Research Paper Guidelines and Rub.docx
COM 510 Final Project One Research Paper Guidelines and Rub.docxCOM 510 Final Project One Research Paper Guidelines and Rub.docx
COM 510 Final Project One Research Paper Guidelines and Rub.docx
 
Coding.ppt
Coding.pptCoding.ppt
Coding.ppt
 
Tesol proposoal writingworkshop
Tesol proposoal writingworkshopTesol proposoal writingworkshop
Tesol proposoal writingworkshop
 
The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...The role of new information and communication technologies in information and...
The role of new information and communication technologies in information and...
 
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
Proseminar in Cross-National Studies: From Idea to Publishing in the Social S...
 
Enjoy writing your dissertation
Enjoy writing your dissertationEnjoy writing your dissertation
Enjoy writing your dissertation
 
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docxCM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
CM208 ResearchUnit 9 AssignmentThroughout the class you h.docx
 
Anu digital research literacies
Anu digital research literaciesAnu digital research literacies
Anu digital research literacies
 
Critical Internet Literacy: “synthetic media“, “lateral reading“ and beliefs ...
Critical Internet Literacy: “synthetic media“, “lateral reading“ and beliefs ...Critical Internet Literacy: “synthetic media“, “lateral reading“ and beliefs ...
Critical Internet Literacy: “synthetic media“, “lateral reading“ and beliefs ...
 
Why altmetrics?
Why altmetrics?Why altmetrics?
Why altmetrics?
 
Statement of problem_Z.pdf
Statement of problem_Z.pdfStatement of problem_Z.pdf
Statement of problem_Z.pdf
 
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
1Week 5Critiquing Research Articles to Prepare an Annotated B.docx
 
Data analysis – qualitative data presentation 2
Data analysis – qualitative data   presentation 2Data analysis – qualitative data   presentation 2
Data analysis – qualitative data presentation 2
 
Helping Users Discover Perspectives: Enhancing Opinion Mining with Joint Topi...
Helping Users Discover Perspectives: Enhancing Opinion Mining with Joint Topi...Helping Users Discover Perspectives: Enhancing Opinion Mining with Joint Topi...
Helping Users Discover Perspectives: Enhancing Opinion Mining with Joint Topi...
 
The title of the research project is Representation of the.docx
The title of the research project is Representation of the.docxThe title of the research project is Representation of the.docx
The title of the research project is Representation of the.docx
 
Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...
Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...
Dean R Berry Loss of Privacy: Necessary Evil or Unwanted Invasion Student Pro...
 

Plus de ivaderivader

Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
ivaderivader
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
ivaderivader
 

Plus de ivaderivader (20)

Argument Mining
Argument MiningArgument Mining
Argument Mining
 
Papers at CHI23
Papers at CHI23Papers at CHI23
Papers at CHI23
 
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph KernelsDDGK: Learning Graph Representations for Deep Divergence Graph Kernels
DDGK: Learning Graph Representations for Deep Divergence Graph Kernels
 
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
So Predictable! Continuous 3D Hand Trajectory Prediction in Virtual Reality
 
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
Reinforcement Learning-based Placement of Charging Stations in Urban Road Net...
 
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
Prediction for Retrospection: Integrating Algorithmic Stress Prediction into ...
 
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Orien...
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
 
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
CatchLIve: Real-time Summarization of Live Streams with Stream Content and In...
 
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for VisualizationPerception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
Perception! Immersion! Empowerment! Superpowers as Inspiration for Visualization
 
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
Learning to Remember Patterns: Pattern Matching Memory Networks for Traffic F...
 
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-PoolingNeural Approximate Dynamic Programming for On-Demand Ride-Pooling
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
 
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
StoryMap: Using Social Modeling and Self-Modeling to Support Physical Activit...
 
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTubeBad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
Bad Breakdowns, Useful Seams, and Face Slapping: Analysis of VR Fails on YouTube
 
Invertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise RemovalInvertible Denoising Network: A Light Solution for Real Noise Removal
Invertible Denoising Network: A Light Solution for Real Noise Removal
 
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural NetworkTraffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
 
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training  MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training
 
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI ComponentsScreen2Vec: Semantic Embedding of GUI Screens and GUI Components
Screen2Vec: Semantic Embedding of GUI Screens and GUI Components
 
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
Augmenting Decisions of Taxi Drivers through Reinforcement Learning for Impro...
 
Natural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine TranslationNatural Language to Visualization by Neural Machine Translation
Natural Language to Visualization by Neural Machine Translation
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 

[0211]hwiyeon

  • 1. SirenLess: reveal the intention behind news EuroVis 2020 Xumeng Chen, Leo Yu-Ho Lo, Huamin Qu Presenter: Hwiyeon Kim
  • 2. Overview of the paper • Motivation of the study  Popularization of journalism (writers with ulterior intentions) • authority & credibility of news articles?  Can be difficult for untrained readers to identify those latent intentions  Some machine learning approaches provide limited help for humans in decision making • Goal of the study  Present a visual analytical system for misleading news detection by linguistic features • Analyze 18 news articles from different sources • Summarize helpful patterns for misleading news detection • User study  Confirm usefulness & effectiveness of the system
  • 3. Main Topics of this work • Design & implementation – how & why we made the system  Design Goals  System and Data  Visual Design • Findings & Evaluation – what we can find new with the system  Patterns of unreliable news  With a professor & 9 students 2
  • 4. Background Knowledge • A rise of fake news  Introduced the nature & impact of fake news (Lazer et al. ‘18)  Problem: fake news intended to provoke emotions (Bakir et al. ‘18) extremist behavior (Aisch et al. ‘16)  Promising directions: rhetorical structure and discourse analysis (Rubin et al. ‘15) • Suggested solutions by previous works  Transmission of fake news • trying to capture the pattern of spread of misinformation & factors contributing to it (Vosoughi et al. ‘18)  Content analysis • Flag the malicious intent of the writer • Automated deception detection with high (80~90%) accuracy (Feng et al. ‘12) • Deceptive stories are separable from truthful stories in the rhetorical structure feature space  Equip readers with the ability to evaluate credibility of fake news • Fact-checking websites (FactCheck.org, PolitiFact, Ad Fontes Media) • Education (Bergstrom et al. ‘17) 3
  • 5. 4 Design goals • Inspired by Conroy et al. 's survey on automatic deception detection • Goals  G1: Provide a quick overview of the languate usage of news articles  G2: Present news meta data to help users grasp its semantic strucure  G3: Let users gain direct access and reference to the article text • Tasks  T1: Reveal the sentiment and discourse mode distribution of the article [G1]  T2: Identify the estimated subjectivity and readability level of the article [G1]  T3: Identify and compare character and keywords occurences in the article [G2]  T4: Provide the original text [G3]
  • 6. 5 System and Data • Automatic data processing pipeline  Plain text –(extract)-> High-level semantics • Discourse mode analysis  Combine General text analysis (narration, description, exposition and argument) + Tom Wolfe’s Theory  Five Categories • Narration: the most important part of storytelling, author’s interpretation • Argument: analysis and ideas of the author • Quote: directly repeat the passage of a person • Description: detailed depiction, aimed at rebuilding the original scene • Background: fact-checked background information aimed at helping readers understand the current story • Training Data  312 news articles from Fox News, ABC News, New York Times, the Economist..  Make a rough pre-filtering of articles with the Pew Research Center
  • 7. 6 Visual Design Article Explorer Module Article Stat Module Reader View Module
  • 8. 7 Visual Design Article Explorer Module • Visualizes the distribution of discourse modes • Sentiment Distribution • Problem: sentiment score of news articles goes zero after being aver aged • Solution: distribution of sentiment of the whole article from negative (-1) to positive (1) • Extreme sentiment sentences(>0. 5) could easily draw readers’ atten tion
  • 9. 8 Visual Design • Discourse Mode Distribution • Choose color as the visual channel • Common discourse mode narration; a low -saturation • Argument; reddish • Background information; purple • Description; green • Quotes; sky-blue • Reveal Metadata of News Articles • Inspired by the ‘5W&1H’ theory • ‘why’ & ‘how’: high-level, hard to be extra cted by the computer • ‘where’ & ‘when’ : give limited help to the understanding of news, tried to overuse markers • Decided to visualize ‘who’ (characters) an d ‘what’ (keywords grouped by topics) Article Explorer Module
  • 10. 9 Visual Design Article Stats Module • Sentiment and discourse mode stats • Overall status and sentence-level information • Article stats • Flesh-Kincaid readability grade: difficulty level of an article by its vocabulary use • Misleading news likes to be easy-to-read  spread faster thr oughout the general public (‘15, Coroy et al.)
  • 11. 10 Patterns of unreliable news • Analyzed 8 misleading articles using the system  Some linguistic features (sentiment dist., article subjectivity, article readability) have strong indicative patterns o n their own / together Unreliable news tends to be emotional • Dominated by one polarity (a, b, c) Fluctuating between two polarities (d)
  • 12. 11 Patterns of unreliable news Subjectivity of a realiable article is less than 0.2 • Subjectivity is another strong indicator to show the reliability of news articles • reliable articles could have a subjectivity score < 0.2 Unreliable news shows an easy-to- read pattern • With a Flesch-Kincaid readability grade greater than 0.3 in the stud y (cognitive level of secondary & college levels) • Reliable news: cognitive range of college & graduate levels
  • 13. 12 Patterns of unreliable news Unreliable articles include considerable portion of arguments • Subjectivity is another strong indicator to show the reliability of news articles • Reliable news are narrative without subjective arguments Sly writers arrange background /descriptions with their person al opinions
  • 14. 13 Patterns of unreliable news Sophisticatd writers selectively use other's mouth to convey their words • Easy to be ignored when reading plain text but can be seen clearly through visualization
  • 15. 14 Evaluation • Journalism scholar review  One journalism professor to evaluate the system thorugh a questionnaire  "Meaningful and potentially useful tool for news/information analytics"  "The features of writing style, sentiment, and keywords, etc. can be relevant indica tors of journalistic performance, depending on the usage context."  Recommended usage field • For academic research in media content analysis • Teaching and learning in jouranlism and public relations couses • News room practice especially for editors
  • 16. 15 Evaluation • User study  Among college students to verify whether it can help them to identify the misleading intention of the news articles  9 participants (4 undergraduate and 6 postgraduagte students)  Two steps 1. Read first and review with the visualization system • For one selected article • 2 out of 9  8 out of 9 spotted the bias 2. View visualization system first then read under the help of the visualization system • For two selected article (same event different standpoints from different news organization) • 7 (highly alerted), 5 (dubious), and 6 (looks objective) out of 18  unanimously agreed the articles are biased  Questionnaire • Help me to spot the intention of misleading in the articles • 4 participants (strongly agree), 3 participants (agree), 2 participants(neutral) • Help me to spot the bias towards different entities or events in the articles • 5 participants (strongly agree), 4 participants (agree)
  • 17. 16 Discussion • Limitation  Accuracy of feature extraction • Some extracted features are denied by human  Interaction and readability • Function filters to radar chart – sentiment/discoure and characters/events • Future Work  Extending the system and supplement missing dimensions • Enable comparison by aggregating articles of the same topic (data collection pipline) • Integrate related external information such as comments and Wikipedia to enable fact check (cr oss-validation through crowdworkers)  Generalization to other domains • Help students analyze patterns of TOEFL writing samples..
  • 18. 17 Takeaways 1. An interactive visualization design that could act as entry point or hint for further visualization research 2. A field study on current computer-based news analytical techniques 3. A case study reporting patterns found by their methodology
  • 19. Criticism 1. Well-structured and logical writing (with few questions why) 2. Design rationale is dense 3. Interesting and noticeable findings on unreliable articles 4. Visualization dashboards use reliable indicators  The authors introduce useful indicators 18 • Very useful when building article selection crite ria for article reading experiments
  • 20. Criticism 19 1. Need more information about training data  Which kind of topic? Politics? Economy? Or else? 2. In the section that describes why they chose that color  Need reference of each color meaning (controversial)  "purple represents wisdom... Green and sky blue are safe and reliable.." 3. Not fully convinced with visualizing only who and what  Missing description of the scope of the article selected in the paper • I can guess the mainly discussed topic in this paper is politics but not sure..  Where and when are more important information for infectious diseaes such as Corona virus 4. Study procedure description is quite unclear  Missing information of used articles  For the second task; why the authors count all samples for two articles (9 from one and 9 from the other)?  Participants and articles are quite small number; needed to be studies on more diverse articles • or at least mention the scope as ‘politics’ only
  • 21. 20 My IDEA for Future Work • Education  Learning structure is essential for writing  Examples are indespensable • show reliable and unreliable news articles to students or junior journalists  From examples, try to figure out features and create reliabe/unreliable news articles  Self-verification of the reliability of their writing • Filtering function  Filtering system (or an app) for readers  Notice subjectivity level and other stats  Do stats influence readers' acceptance of news content? • Article’s subjectivity may change depending on the reader’s political propensity
  • 22. Thank you  Any questions?