SlideShare une entreprise Scribd logo
1  sur  14
{
SentiWordNet
Andrea Esuli* and Fabrizio Sebastiani, SentiWordNet: A
Publicly Available Lexical Resource for Opinion Mining
1. text의 주관성, 객관성 판단
-> SO polarity (Pang and Lee,
2004; Yu and Hatzivassiloglou,
2003)
2. 주관성을 지닌 text의 긍정, 부
정 판단
-> PN polarity (Pang and Lee,
2004; Turney, 2002)
3. 얼마나 긍정/부정인지 판단
예) 조금 긍정, 약간 긍정, 아주
긍정
-> strength of text PN polarity
(Pang and Lee, 2005; Wilson et
al., 2004)
Polarity classification
Bo Pang and Lillian Lee, A Sentimental Education: Sentiment Analysis
Using Subjectivity Summarization Based on Minimum Cuts
WordNet
synset
- 영어의 의미 어휘
목록
- synset (유의어 집
단)으로 분류하여
단어집과 유의어,
반의어 사전의 배
합을 만듬.
- 심리학 교수인 조
지 A. 밀러가 지도
하는 프린스턴 대
학의 인지 과학 연
구소에 의해 만들
어졌고, 유지되고
있음.
synset
Score : 0.0 ~ 1.0,
각 synset의 총 합은 1
SentiWordNet
WordNet
synset
Positive
score
Neutral
score
Negative
score
Ternary
classifier
Semi-supervised
method
Training set
Lp , Ln Lp , Ln
K iterations
WordNet
Relation 적용
(반의어, 유의어,
파생어)
Training set
Subjectivity
(Lp, Ln)
Objectivity (Lo)
 Lp, Ln, Lo -> vectorial representation -> label Ci
-> 2개의 classifier 생성
1. Positive / not Positive 로 분류하는 classifier
2. Negative / not Negative 로 분류하는 classifier
Classifier
“Supervised learner”
Positive
∩
 Lp, Ln, Lo -> vectorial representation -> label Ci
-> 2개의 classifier 생성
1. Positive / not Positive 로 분류하는 classifier
2. Negative / not Negative 로 분류하는 classifier
Classifier
“Supervised learner”
Negative
∩
 Lp, Ln, Lo -> vectorial representation -> label Ci
-> 2개의 classifier 생성
1. Positive / not Positive 로 분류하는 classifier
2. Negative / not Negative 로 분류하는 classifier
Classifier
“Supervised learner”
Objective
∩ ∩
∪
 Precision = Tp / (Tp + Fp)
: True라고 예측한 것 중에서 실제로 true인 것의 비율
 Recall = Tp / (Tp + Fn)
: 실제로 true인 것중에 내가 얼마나 맞췄는지
(Tp : true라고 예측했는데 실제로 true,
Fp : true라고 예측했는데 실제로 false,
Fn : false라고 예측했는데 실제로 true,
Tn : false라고 예측했는데 실제로 false)
K를 정하려면?
K
Precision
Recall
Small
Training set
noise
(Andrea Esuli1 and Fabrizio Sebastiani2, Determining Term Subjectivity and
Term Orientation for Opinion Mining)
Roccino (Andrew McCallum’s Bow package http://www-
2.cs.cmu.edu/~mccallum/bow/)
SVM (6.01 of Thorsten Joachims’
SVMlight - http://svmlight.joachims.org/).
 K = 0, 2, 4, 6 -> 8가지의 ternary classifier
-> 1로 정규화
K를 정하려면?
 http://ko.asianwordnet.org/
한국어 WordNet?

Contenu connexe

Plus de SOYEON KIM

Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...SOYEON KIM
 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSOYEON KIM
 
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...SOYEON KIM
 
Network embedding
Network embeddingNetwork embedding
Network embeddingSOYEON KIM
 
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...SOYEON KIM
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveySOYEON KIM
 
DeepWalk: Online Learning of Social Representations
DeepWalk: Online Learning of Social RepresentationsDeepWalk: Online Learning of Social Representations
DeepWalk: Online Learning of Social RepresentationsSOYEON KIM
 
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral FilteringConvolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral FilteringSOYEON KIM
 
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchVisual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchSOYEON KIM
 
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...SOYEON KIM
 
A survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisA survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisSOYEON KIM
 
Translated learning
Translated learningTranslated learning
Translated learningSOYEON KIM
 
Self taught clustering
Self taught clusteringSelf taught clustering
Self taught clusteringSOYEON KIM
 
Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...SOYEON KIM
 
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...SOYEON KIM
 
Text extraction from natural scene image, a survey
Text extraction from natural scene image, a surveyText extraction from natural scene image, a survey
Text extraction from natural scene image, a surveySOYEON KIM
 
Opinion Fraud Detection in Online Reviews by Network Effects
Opinion Fraud Detection in Online Reviews by Network EffectsOpinion Fraud Detection in Online Reviews by Network Effects
Opinion Fraud Detection in Online Reviews by Network EffectsSOYEON KIM
 
Evaluating color descriptors for object and scene recognition
Evaluating color descriptors for object and scene recognitionEvaluating color descriptors for object and scene recognition
Evaluating color descriptors for object and scene recognitionSOYEON KIM
 
Outcome-guided mutual information networks for investigating gene-gene intera...
Outcome-guided mutual information networks for investigating gene-gene intera...Outcome-guided mutual information networks for investigating gene-gene intera...
Outcome-guided mutual information networks for investigating gene-gene intera...SOYEON KIM
 
Spectral clustering
Spectral clusteringSpectral clustering
Spectral clusteringSOYEON KIM
 

Plus de SOYEON KIM (20)

Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...Revealing disease-associated pathways by network integration of untargeted me...
Revealing disease-associated pathways by network integration of untargeted me...
 
Systems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traitsSystems genetics approaches to understand complex traits
Systems genetics approaches to understand complex traits
 
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
Robust Pathway-based Multi-Omics Data Integration using Directed Random Walk ...
 
Network embedding
Network embeddingNetwork embedding
Network embedding
 
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
Integrative Pathway-based Survival Prediction utilizing the Interaction betwe...
 
Deep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a surveyDeep learning based multi-omics integration, a survey
Deep learning based multi-omics integration, a survey
 
DeepWalk: Online Learning of Social Representations
DeepWalk: Online Learning of Social RepresentationsDeepWalk: Online Learning of Social Representations
DeepWalk: Online Learning of Social Representations
 
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral FilteringConvolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
 
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image SearchVisual-Textual Joint Relevance Learning for Tag-Based Social Image Search
Visual-Textual Joint Relevance Learning for Tag-Based Social Image Search
 
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated wi...
 
A survey of heterogeneous information network analysis
A survey of heterogeneous information network analysisA survey of heterogeneous information network analysis
A survey of heterogeneous information network analysis
 
Translated learning
Translated learningTranslated learning
Translated learning
 
Self taught clustering
Self taught clusteringSelf taught clustering
Self taught clustering
 
Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...Semi-automatic ground truth generation using unsupervised clustering and limi...
Semi-automatic ground truth generation using unsupervised clustering and limi...
 
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
Mobile Phone Spam Image Detection based on Graph Partitioning with Pyramid H...
 
Text extraction from natural scene image, a survey
Text extraction from natural scene image, a surveyText extraction from natural scene image, a survey
Text extraction from natural scene image, a survey
 
Opinion Fraud Detection in Online Reviews by Network Effects
Opinion Fraud Detection in Online Reviews by Network EffectsOpinion Fraud Detection in Online Reviews by Network Effects
Opinion Fraud Detection in Online Reviews by Network Effects
 
Evaluating color descriptors for object and scene recognition
Evaluating color descriptors for object and scene recognitionEvaluating color descriptors for object and scene recognition
Evaluating color descriptors for object and scene recognition
 
Outcome-guided mutual information networks for investigating gene-gene intera...
Outcome-guided mutual information networks for investigating gene-gene intera...Outcome-guided mutual information networks for investigating gene-gene intera...
Outcome-guided mutual information networks for investigating gene-gene intera...
 
Spectral clustering
Spectral clusteringSpectral clustering
Spectral clustering
 

Sentiwordnet: A publicly available lexical resource for opinion mining

  • 1. { SentiWordNet Andrea Esuli* and Fabrizio Sebastiani, SentiWordNet: A Publicly Available Lexical Resource for Opinion Mining
  • 2. 1. text의 주관성, 객관성 판단 -> SO polarity (Pang and Lee, 2004; Yu and Hatzivassiloglou, 2003) 2. 주관성을 지닌 text의 긍정, 부 정 판단 -> PN polarity (Pang and Lee, 2004; Turney, 2002) 3. 얼마나 긍정/부정인지 판단 예) 조금 긍정, 약간 긍정, 아주 긍정 -> strength of text PN polarity (Pang and Lee, 2005; Wilson et al., 2004)
  • 3. Polarity classification Bo Pang and Lillian Lee, A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts
  • 4. WordNet synset - 영어의 의미 어휘 목록 - synset (유의어 집 단)으로 분류하여 단어집과 유의어, 반의어 사전의 배 합을 만듬. - 심리학 교수인 조 지 A. 밀러가 지도 하는 프린스턴 대 학의 인지 과학 연 구소에 의해 만들 어졌고, 유지되고 있음.
  • 5. synset Score : 0.0 ~ 1.0, 각 synset의 총 합은 1 SentiWordNet
  • 7. Training set Lp , Ln Lp , Ln K iterations WordNet Relation 적용 (반의어, 유의어, 파생어)
  • 9.  Lp, Ln, Lo -> vectorial representation -> label Ci -> 2개의 classifier 생성 1. Positive / not Positive 로 분류하는 classifier 2. Negative / not Negative 로 분류하는 classifier Classifier “Supervised learner” Positive ∩
  • 10.  Lp, Ln, Lo -> vectorial representation -> label Ci -> 2개의 classifier 생성 1. Positive / not Positive 로 분류하는 classifier 2. Negative / not Negative 로 분류하는 classifier Classifier “Supervised learner” Negative ∩
  • 11.  Lp, Ln, Lo -> vectorial representation -> label Ci -> 2개의 classifier 생성 1. Positive / not Positive 로 분류하는 classifier 2. Negative / not Negative 로 분류하는 classifier Classifier “Supervised learner” Objective ∩ ∩ ∪
  • 12.  Precision = Tp / (Tp + Fp) : True라고 예측한 것 중에서 실제로 true인 것의 비율  Recall = Tp / (Tp + Fn) : 실제로 true인 것중에 내가 얼마나 맞췄는지 (Tp : true라고 예측했는데 실제로 true, Fp : true라고 예측했는데 실제로 false, Fn : false라고 예측했는데 실제로 true, Tn : false라고 예측했는데 실제로 false) K를 정하려면? K Precision Recall Small Training set noise (Andrea Esuli1 and Fabrizio Sebastiani2, Determining Term Subjectivity and Term Orientation for Opinion Mining)
  • 13. Roccino (Andrew McCallum’s Bow package http://www- 2.cs.cmu.edu/~mccallum/bow/) SVM (6.01 of Thorsten Joachims’ SVMlight - http://svmlight.joachims.org/).  K = 0, 2, 4, 6 -> 8가지의 ternary classifier -> 1로 정규화 K를 정하려면?