SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
An alignment-based Approach to Semi-supervised Relation Extraction
                   Including Multiple Arguments
                                                          Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee, Kwangil Ko, and Zino Lee
                                                               {megaup, stardust, gblee}@postech.ac.kr, {kik, zino}@alticast.com

  Abstract - We present an alignment-based approach to semi-supervised relation extraction task including more than two arguments. We concentrate
  on improving not only the precision of the extracted result, but also on the coverage of the method. Our relation extraction method is based on an
  alignment-based pattern matching approach which provides more flexibility of the method. In addition, we extract all relationships including two or
  more arguments at once in order to obtain the integrated result with high quality. We present experimental results which indicate the effectiveness of
  our method.

                                                                                                                            Alignment-based Information Extraction
v Information Extraction                                                                                        v Sentence Alignment for Information Extraction                                                                                 w Matrix Computation
w Extracting the defined number of relevant                                                                     w Example                                                                                                                                       M i 1, j 1 sim i
arguments from natural language documents                                                                                the character <ROLE> portrayed by <ACTOR> in the television series <PROGRAM> is
                                                                                                                                                                                                                                                                                                  1, j 1
                                                                                                                                                                                                                                                                M i 1, j gp
w Subtasks                                                                                                                                                                                                                                       M i, j     max
                                                                                                                                                                                                                                                                M i , j 1 gp
 # of arguments                                              subtask                                                                                                                                                                                            0
        1                                           named-entity recognition      character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is


                                                                                                                                                                                                                                                              {
                                                                                                                                                                                                                                                                      1, if PTNi = RAWj
        2                                           binary relation extraction w Alignment Matrix
                                                                                                                                                                                                                                                   simi,j =              or PTNi = <label>
   more than 2                                      relation/event extraction         character
                                                                                                the character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is
                                                                                                 0      1        1       1         1     1      1      1     1 1 1       1      1      1   1                                                                          0, otherwise
                                                                                                                               <ROLE>       1     1        2      2   2   2   2   2   2   2   2   2                       2     2   2

w Approaches                                                                                                                   portrayed
                                                                                                                                   by
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  1
                                                                                                                                                  1
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  2
                                                                                                                                                                  2
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          3
                                                                                                                                                                          4
                                                                                                                                                                              3
                                                                                                                                                                              4
                                                                                                                                                                                  3
                                                                                                                                                                                  4
                                                                                                                                                                                      3
                                                                                                                                                                                      4
                                                                                                                                                                                          3
                                                                                                                                                                                          4
                                                                                                                                                                                              3
                                                                                                                                                                                              4
                                                                                                                                                                                                  3
                                                                                                                                                                                                  4
                                                                                                                                                                                                                          3
                                                                                                                                                                                                                          4
                                                                                                                                                                                                                                3
                                                                                                                                                                                                                                4
                                                                                                                                                                                                                                    3
                                                                                                                                                                                                                                    4
                                                                                                                                                                                                                                                w Trace Back
                                                                                                                              <ACTOR>       1     2        2      3   3   4   5   5   5   5   5   5                       5     5   5
  w Supervised                                                                                                                      in
                                                                                                                                   the
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  2
                                                                                                                                                  2
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  3
                                                                                                                                                                  3
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          4
                                                                                                                                                                          4
                                                                                                                                                                              5
                                                                                                                                                                              5
                                                                                                                                                                                  5
                                                                                                                                                                                  5
                                                                                                                                                                                      6
                                                                                                                                                                                      6
                                                                                                                                                                                          6
                                                                                                                                                                                          7
                                                                                                                                                                                              6
                                                                                                                                                                                              7
                                                                                                                                                                                                  6
                                                                                                                                                                                                  7
                                                                                                                                                                                                                          6
                                                                                                                                                                                                                          7
                                                                                                                                                                                                                                6
                                                                                                                                                                                                                                7
                                                                                                                                                                                                                                    6
                                                                                                                                                                                                                                    7
                                                                                                                                                                                                                                                             M i,j                next position
  w Un/Semi-Supervised                                                                                                         television
                                                                                                                                 series
                                                                                                                                            1
                                                                                                                                            1
                                                                                                                                                  2
                                                                                                                                                  2
                                                                                                                                                           2
                                                                                                                                                           2
                                                                                                                                                                  3
                                                                                                                                                                  3
                                                                                                                                                                      3
                                                                                                                                                                      3
                                                                                                                                                                          4
                                                                                                                                                                          4
                                                                                                                                                                              5
                                                                                                                                                                              5
                                                                                                                                                                                  5
                                                                                                                                                                                  5
                                                                                                                                                                                      6
                                                                                                                                                                                      6
                                                                                                                                                                                          7
                                                                                                                                                                                          7
                                                                                                                                                                                              7
                                                                                                                                                                                              7
                                                                                                                                                                                                  7
                                                                                                                                                                                                  8
                                                                                                                                                                                                                          7
                                                                                                                                                                                                                          8
                                                                                                                                                                                                                                7
                                                                                                                                                                                                                                8
                                                                                                                                                                                                                                    7
                                                                                                                                                                                                                                    8                   M i,j-1 +gp                  [i, j-1]
                                                                                                                                                                                                                                                       M i-1,j-1 + simi,j           [i-1, j-1]
                                                                                                                             <PROGRAM>      1     2        3      3   4   4   5   6   6   7   8   8                       9     9   9
                                                                                                                                    is      1     2        3      3   4   4   5   6   6   7   8   8                       9     9   10

                                                                                                                                                                                                                                                        M i-1,j +gp                  [i-1, j]

                                                                       Semi-supervised Relation Extraction Including Multiple Arguments
 v Overall Architecture                                                                                                               v Context Patterns Extraction                                                             v Alignment-based Verification
                                                                                                                                      1) Searching the sentences containing all                                                 w Aligning between two candidate arguments
                                                                                                                                      arguments of each tuple in source documents
                                                          Seed Data
                                                                                                                                      2) Segmenting out subpart of the sentence with                         max{M(A, B)}× 2
                                                                          n arguments
                                                                                                                                                                                         similarity(A,B) =
                                                                                                                                      the window size w                                                    length(A) + length(B)
                                                                                                                                      3) Replacing the parts of arguments in the sub-
   Seed Data    Seed Data              Seed Data           Seed Data   Seed Data          Seed Data         Seed Data
                                                                                                                                                                                      w Tuple clustering based on
                             2 arguments                                        k arguments                        n args
                                                                                                                                      sentence with argument labels
   Extracting   Extracting
                               …       Extracting
                                                      …   Extracting   Extracting
                                                                                    …     Extracting
                                                                                                       …    Extracting


                                                                                                                                      v Relation Extraction based on                    sim(tuple1, tuple2) =
    Context      Context                Context            Context      Context            Context           Context
    Patterns     Patterns               Patterns           Patterns     Patterns           Patterns          Patterns


    Relation     Relation               Relation           Relation     Relation           Relation          Relation
                                                                                                                                      Pairwise Alignment                                                                                        |args|
                                                                                                                                                                                                                                                                                     tuple2i)
                                                                                                                                                                                                                                                i=1 similarity(tuple1i,
   Extraction   Extraction             Extraction         Extraction   Extraction         Extraction        Extraction


                                                                                                                                      w Alignment score
                                                                                                                                                                                                                                                           |arguments|
                                           Validation &                                                                                                       max{M(PTN, RAW)}
                                            Integration
                                                                            Results
                                                                                                                                            score(PTN, RAW) =                                                                   w Selecting the most probable tuple for each
                                                                                              n arguments
                                                                                                                                                                 length(PTN)
                                                                                                                                                                                                                                cluster

                                                                                                                                                Experimental Results
v Experimental Setup
w 930 Korean news documents (13,175 sents) about TV series
w Only a tuple with 4 arguments (CHANNEL, PROGRAM, ACTOR, ROLE) is used as a seed
                                                                                                                                                                                                  v Comparison on the Coverage for
w Each result is collected after the first iteration and evaluated manually
                                                                                                                                                                                                  Various Threshold Values
v Result of the verification                                                                                   v Result of the integration
                                                                                                                                                                                                                         90

                                                                                                                                                                                                                         80

                                      before          after                                                                                       with only
    type of                                                                                                                                                              with all
                                                                                                                                                                                                                         70
                                  verification    verification                                                               type of               binary
   relations                                                                                                                                                          intermediates                                      60
                                |tuples|     P  |tuples|    P                                                               relations             relations
                                                                                                                                                                                                  # of correct results




    (A,R)                         249 36.55        79     73.42                                                                                 |tuples|         P   |tuples|   P
                                                                                                                                                                                                                         50

    (P,R)                          19     52.63    17     58.82                                                           (P,A,R)                   9          77.78     9    88.89                                      40

    (P,A)                          10       60     10       60                                                            (C,P,R)                  11          81.82    16    87.5                                       30
    (C,P)                          12     33.33     6     66.67                                                           (C,P,A)                  12          58.33     9    77.78                                      20
   (P,A,R)                          7     42.86     5       60                                                           (C,P,A,R)                  8          87.5     16    87.5                                                                                             including 2 arguments
   (C,P,R)                         18     55.56    16     81.25                                                                                                                                                          10                                                    including 3 arguments
                                                                                                                                                                                                                                                                               including 4 arguments
   (C,P,A)                          8      62.5     8       75                                                  w th = 0.85                                                                                              0
                                                                                                                                                                                                                         1.00            0.95   0.90         0.85       0.80           0.75            0.70
  (C,P,A,R)                        15       60     14     85.71                                                 w C(Channel), P(Program), A(Actor), R(Role)                                                                                               threshold

Contenu connexe

Tendances

Insulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast ComparisonInsulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast Comparison
entogenex
 
[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project
igemiitkgp
 
Cosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser windowCosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser window
IAEME Publication
 

Tendances (9)

Ch2 rev[1]
Ch2 rev[1]Ch2 rev[1]
Ch2 rev[1]
 
Insulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast ComparisonInsulin & TMOF_Yeast Comparison
Insulin & TMOF_Yeast Comparison
 
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
An Investigation of Self-Interference Reduction Strategy in Correlated SM-OFD...
 
Mekong BFP Review
Mekong BFP ReviewMekong BFP Review
Mekong BFP Review
 
[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project[iGEM Workshop] Coming up with a Project
[iGEM Workshop] Coming up with a Project
 
Cosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser windowCosine modulated filter bank transmultiplexer using kaiser window
Cosine modulated filter bank transmultiplexer using kaiser window
 
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
The Impact of Methods and Techniques on Outcomes from Agile Software Developm...
 
Acs2006 Mrm
Acs2006 MrmAcs2006 Mrm
Acs2006 Mrm
 
F0543645
F0543645F0543645
F0543645
 

En vedette

An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
Seokhwan Kim
 
Requisitos para elaborar una visión y una misión
Requisitos  para elaborar una visión y una misiónRequisitos  para elaborar una visión y una misión
Requisitos para elaborar una visión y una misión
gissella_zambrano
 

En vedette (8)

Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
AI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're goingAI-powered Chatbots - what they are and where they're going
AI-powered Chatbots - what they are and where they're going
 
Requisitos para elaborar una visión y una misión
Requisitos  para elaborar una visión y una misiónRequisitos  para elaborar una visión y una misión
Requisitos para elaborar una visión y una misión
 

Plus de Seokhwan Kim

Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
Seokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
Seokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
Seokhwan Kim
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
Seokhwan Kim
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 

Plus de Seokhwan Kim (15)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relatio...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 
A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation DetectionA Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 

Dernier

unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
Abortion pills in Kuwait Cytotec pills in Kuwait
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
daisycvs
 
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂EscortCall Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
dlhescort
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
allensay1
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
lizamodels9
 
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
lizamodels9
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
amitlee9823
 

Dernier (20)

unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabiunwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
unwanted pregnancy Kit [+918133066128] Abortion Pills IN Dubai UAE Abudhabi
 
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
(Anamika) VIP Call Girls Napur Call Now 8617697112 Napur Escorts 24x7
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂EscortCall Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
Call Girls In Nangloi Rly Metro ꧂…….95996 … 13876 Enjoy ꧂Escort
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Falcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business PotentialFalcon Invoice Discounting: Unlock Your Business Potential
Falcon Invoice Discounting: Unlock Your Business Potential
 
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
Lundin Gold - Q1 2024 Conference Call Presentation (Revised)
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 
Phases of Negotiation .pptx
 Phases of Negotiation .pptx Phases of Negotiation .pptx
Phases of Negotiation .pptx
 
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLBAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
BAGALUR CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort ServiceMalegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
Malegaon Call Girls Service ☎ ️82500–77686 ☎️ Enjoy 24/7 Escort Service
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
 
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
 
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
Call Girls From Pari Chowk Greater Noida ❤️8448577510 ⊹Best Escorts Service I...
 
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Nelamangala Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 

An alignment-based approach to semi-supervised relation extraction including multiple arguments

  • 1. An alignment-based Approach to Semi-supervised Relation Extraction Including Multiple Arguments Seokhwan Kim, Minwoo Jeong, Gary Geunbae Lee, Kwangil Ko, and Zino Lee {megaup, stardust, gblee}@postech.ac.kr, {kik, zino}@alticast.com Abstract - We present an alignment-based approach to semi-supervised relation extraction task including more than two arguments. We concentrate on improving not only the precision of the extracted result, but also on the coverage of the method. Our relation extraction method is based on an alignment-based pattern matching approach which provides more flexibility of the method. In addition, we extract all relationships including two or more arguments at once in order to obtain the integrated result with high quality. We present experimental results which indicate the effectiveness of our method. Alignment-based Information Extraction v Information Extraction v Sentence Alignment for Information Extraction w Matrix Computation w Extracting the defined number of relevant w Example M i 1, j 1 sim i arguments from natural language documents the character <ROLE> portrayed by <ACTOR> in the television series <PROGRAM> is 1, j 1 M i 1, j gp w Subtasks M i, j max M i , j 1 gp # of arguments subtask 0 1 named-entity recognition character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is { 1, if PTNi = RAWj 2 binary relation extraction w Alignment Matrix simi,j = or PTNi = <label> more than 2 relation/event extraction character the character Michael Scofield portrayed by Wentworth Miller in the TV series Prison Break is 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0, otherwise <ROLE> 1 1 2 2 2 2 2 2 2 2 2 2 2 2 2 w Approaches portrayed by 1 1 1 1 2 2 2 2 3 3 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 3 4 w Trace Back <ACTOR> 1 2 2 3 3 4 5 5 5 5 5 5 5 5 5 w Supervised in the 1 1 2 2 2 2 3 3 3 3 4 4 5 5 5 5 6 6 6 7 6 7 6 7 6 7 6 7 6 7 M i,j next position w Un/Semi-Supervised television series 1 1 2 2 2 2 3 3 3 3 4 4 5 5 5 5 6 6 7 7 7 7 7 8 7 8 7 8 7 8 M i,j-1 +gp [i, j-1] M i-1,j-1 + simi,j [i-1, j-1] <PROGRAM> 1 2 3 3 4 4 5 6 6 7 8 8 9 9 9 is 1 2 3 3 4 4 5 6 6 7 8 8 9 9 10 M i-1,j +gp [i-1, j] Semi-supervised Relation Extraction Including Multiple Arguments v Overall Architecture v Context Patterns Extraction v Alignment-based Verification 1) Searching the sentences containing all w Aligning between two candidate arguments arguments of each tuple in source documents Seed Data 2) Segmenting out subpart of the sentence with max{M(A, B)}× 2 n arguments similarity(A,B) = the window size w length(A) + length(B) 3) Replacing the parts of arguments in the sub- Seed Data Seed Data Seed Data Seed Data Seed Data Seed Data Seed Data w Tuple clustering based on 2 arguments k arguments n args sentence with argument labels Extracting Extracting … Extracting … Extracting Extracting … Extracting … Extracting v Relation Extraction based on sim(tuple1, tuple2) = Context Context Context Context Context Context Context Patterns Patterns Patterns Patterns Patterns Patterns Patterns Relation Relation Relation Relation Relation Relation Relation Pairwise Alignment |args| tuple2i) i=1 similarity(tuple1i, Extraction Extraction Extraction Extraction Extraction Extraction Extraction w Alignment score |arguments| Validation & max{M(PTN, RAW)} Integration Results score(PTN, RAW) = w Selecting the most probable tuple for each n arguments length(PTN) cluster Experimental Results v Experimental Setup w 930 Korean news documents (13,175 sents) about TV series w Only a tuple with 4 arguments (CHANNEL, PROGRAM, ACTOR, ROLE) is used as a seed v Comparison on the Coverage for w Each result is collected after the first iteration and evaluated manually Various Threshold Values v Result of the verification v Result of the integration 90 80 before after with only type of with all 70 verification verification type of binary relations intermediates 60 |tuples| P |tuples| P relations relations # of correct results (A,R) 249 36.55 79 73.42 |tuples| P |tuples| P 50 (P,R) 19 52.63 17 58.82 (P,A,R) 9 77.78 9 88.89 40 (P,A) 10 60 10 60 (C,P,R) 11 81.82 16 87.5 30 (C,P) 12 33.33 6 66.67 (C,P,A) 12 58.33 9 77.78 20 (P,A,R) 7 42.86 5 60 (C,P,A,R) 8 87.5 16 87.5 including 2 arguments (C,P,R) 18 55.56 16 81.25 10 including 3 arguments including 4 arguments (C,P,A) 8 62.5 8 75 w th = 0.85 0 1.00 0.95 0.90 0.85 0.80 0.75 0.70 (C,P,A,R) 15 60 14 85.71 w C(Channel), P(Program), A(Actor), R(Role) threshold