SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
A GRAPH-BASED CROSS-LINGUAL
      PROJECTION APPROACH FOR
WEAKLY SUPERVISED RELATION EXTRACTION
    The 50th Annual Meeting of the Association for Computational Linguistics
                                  (ACL 2012)
                             July 11th, 2012, Jeju

       Seokhwan Kim (Institute for Infocomm Research)
                 Gary Geunbae Lee (POSTECH)
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   2
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   3
Problem Definition
• Relation Extraction
    To identify semantic relations between a pair of entities

                         Birthplace

       Barack Obama was born in Honolulu           ,   Hawaii    .
              PER                          LOC           LOC



    Considered as a classification problem




                                                                     4
Related Work (1)
• Supervised Learning
    Many supervised machine learning approaches have been
     successfully applied
      • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and
        Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006)

• Semi-supervised Learning
    To obtain the annotations of unlabeled instances from the seed
     information
      • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000;
        Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006;
        Zhang, 2004; Chen el al., 2006; Zhou et al., 2009)



                                                                             5
Motivation
• Resources for Relation Extraction
    Supervised/Semi-supervised Approaches
      • Labeled corpora for supervised learning
      • Seed instances for semi-supervised learning
      • Available for only a few languages
           ACE 2003 Multilingual Training Dataset
              • English (252 articles)
              • Chinese (221 articles)
              • Arabic (206 articles)
      • No resources for other languages
           Korean




                                                      6
Related Work (2)
• Self-supervised Learning
    To obtain the annotated dataset without any human effort
    Using the information obtained from external resources
       • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008)
       • Wikipedia-based Methods (Wu and Weld, 2010)

• Cross-lingual Annotation Projection
    To leverage parallel corpora to project the relation annotations on
     the resource-rich source language to the resource-poor target
     language (Kim et al., 2010, Kim et al., 2011)




                                                                           7
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Implementation
• Evaluation
• Conclusions




                                                               8
Overall Architecture
Annotation                Parallel
                                                     Projection
                          Corpus


         Sentences in                 Sentences in
                 Ls                        Lt



        Preprocessing                Preprocessing
        (POS Tagging,                (POS Tagging,
           Parsing)                     Parsing)




               NER                   Word Alignment




              Relation
                                       Projection
             Extraction



          Annotated                    Annotated
         Sentences in                 Sentences in
                 Ls                        Lt                     9
Direct Projection
                                                                                (Kim et al., 2010)
• Annotation


• Projection




                        fE (<Barack Obama, Honolulu>) = 1
       Barack Obama              was born in        Honolulu          ,    Hawaii         .


    버락 오바마               는       하와이         의      호놀룰루              에서          태어났다
    (beo-rak-o-ba-ma)   (neun)   (ha-wa-i)   (ui)   (ho-nol-rul-ru)   (e-seo)     (tae-eo-nat-da)


                          fK (<버락 오바마, 호놀룰루>) = 1
                                                                                                    10
Limitations of Direct Projection
• Direct projection approach is still vulnerable to the
  erroneous inputs generated by preprocessors
• Main causes of this limitation
    Considering alignment between entity candidates only, not any
     contextual information
    Performed by just a single pass process




                                                                     11
Graph-based Learning
• Semi-supervised learning algorithm
• Defining a graph
    The nodes represent labeled and unlabeled examples in a dataset
    The edges reflect the similarity of examples
• Learning a labeling function in an iterative manner
    It should be close to the given labels on the similar labeled nodes
    It should be smooth on the whole graph
• Related Work
    Graph-based Learning for Relation Extraction (Chen et al, 2006)
    Bilingual projection of POS tagging (Das and Petrov, 2011)


                                                                           12
Graph Construction
• Graph Nodes
   Instance Nodes
      • Defined for all pairs of entity candidates in both languages
      • Each instance node has a soft label vector Y = [y+ y-]
   Context Nodes
      • For identifying the relation descriptors of the positive instances
      • Defined for each trigram which is located between a given entity pair
        which is semantically related
      • Each context node has a soft label vector Y = [y+ y-]


                       <ARG1> was born in <ARG2>



   <ARG1> was born                was born in                born in <ARG2>     13
Graph Construction
• Edge Weights
   Between instance node and context node in the same language
        𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘
                             1             𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒,
                           = 0                                𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.
   Between context nodes in a language
             𝑘,
                                                   |𝑢 𝑘 ∩ 𝑢 𝑙 |
       𝑤(𝑢        𝑢 𝑙)     = 𝐽(𝑢      𝑘,
                                           𝑢 𝑙)   = 𝑘           .
                                                   |𝑢 ∪ 𝑢 𝑙 |

   Between context nodes in source and target languages
                                      𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡
       𝑤(𝑢 𝑠𝑘 ,   𝑢 𝑙𝑡 )   =                      𝑘       𝑚
                                                               ,
                                 𝑢𝑡   𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡




                                                                                          14
Graph Construction
• Example




                                 15
Label Propagation
                                       Initialize T
• Algorithm
    Input
      • A transition matrix T
      • An initial label matrix Y0    Normalize T
    Output
      • The updated label matrix Yt

                                       Initialize Y




                                        Update Y




                                                 16
Label Propagation
• Executed in three phases




               1st phase




                             2nd phase




                                         3rd phase




                                                     17
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   18
Implementation
• Dataset
    English-Korean parallel corpus
       • 266,982 bi-sentence pairs in English and Korean
       • Aligned by GIZA++
• Annotation
    ReVerb (Fader et al., 2011)
       • English Open IE system
• Label Propagation
    Junto Label Propagation Toolkit
• Learning
    Tree kernel-based SVM classifier
       • Shortest path dependency kernel (Bunescu and Mooney, 2005)
       • SVM-Light (Joachims, 1998)

                                                                      19
Evaluation
• Dataset
    Manually annotated Korean dataset
      • Obtained from the Web following Bunescu and Mooney(2007)’s work
      • 500 sentences with manual annotations for four relation types
             Acquisition
             Birthplace
             Inventor Of
             Won Prize

• Evaluation Metrics
    Precision/Recall/F-measure




                                                                          20
Experimental Results
• Direct Projection vs. Graph-based Projection


                   Direct Projection     Graph-based Projection
      Type
                  P       R        F       P       R       F
   Acquisition   51.6    87.7     64.9    55.3    91.2    68.9
   Birthplace    69.8    84.5     76.4    73.8    87.3    80.0
   Inventor of   62.4    85.3     72.1    66.3    89.7    76.3
   Won Prize     73.3    80.5     76.7    76.4    82.9    79.5
      Total      63.9    84.2     72.7    67.7    87.4    76.3




                                                                 21
Experimental Results
• Comparisons to other self-supervised approaches
    Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008)
       • Korean Treebank and Syntactic Heuristics
    Wikipedia-based Approach (Wu and Weld, 2010)
       • Korean Wikipedia articles and Infoboxes


          Approach                    P              R        F

       Heuristic-based              92.31           17.27   29.09

      Wikipedia-based               66.67           66.91   66.79

      Projection-based             67.69            87.41   76.30

                                                                         22
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   23
Conclusion
• Summary
    A graph-based projection approach for relation extraction
       • Label propagation algorithm
       • On a graph that represents the instance and context features of both
         the source and target languages
    Experimental results show that our approach helps to improve the
     performances of relation extraction compared to other approaches
• Future work
    To relieve the high complexity problem of the approach
    To deal with more expanded graph structure to improve the
     extraction performances


                                                                                24
Q&A

Contenu connexe

Tendances

Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersGuy De Pauw
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsRoelof Pieters
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERTshaurya uppal
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersLiangqun Lu
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical InterpreterS-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpretervirtual-campus
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Christoph Lange
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionMultitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionDr. Syed Hassan Amin
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deren Lei
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...kevig
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesDiversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesBryan Gummibearehausen
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needsIvan Berlocher
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Jason Yang
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesRepresentation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesFelipe Moraes
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understandinggohyunwoong
 

Tendances (20)

Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
NLP from scratch
NLP from scratch NLP from scratch
NLP from scratch
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá CharactersA System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word EmbeddingsDeep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERT
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from TransformersBERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical InterpreterS-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
 
BERT introduction
BERT introductionBERT introduction
BERT introduction
 
[Paper review] BERT
[Paper review] BERT[Paper review] BERT
[Paper review] BERT
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
 
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq RecognitionMultitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News StoriesDiversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News Stories
 
VOC real world enterprise needs
VOC real world enterprise needsVOC real world enterprise needs
VOC real world enterprise needs
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and PhrasesRepresentation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
 

Similaire à A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...Seokhwan Kim
 
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De... Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...Ohsawa Goodfellow
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languageshs0041
 
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Zachary S. Brown
 
Erlang/OTP for Rubyists
Erlang/OTP for RubyistsErlang/OTP for Rubyists
Erlang/OTP for RubyistsSean Cribbs
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Saurabh Kaushik
 
Towards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataTowards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataFernando Silva Parreiras
 
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Yuki Tomo
 
Nltk natural language toolkit overview and application @ PyCon.tw 2012
Nltk  natural language toolkit overview and application @ PyCon.tw 2012Nltk  natural language toolkit overview and application @ PyCon.tw 2012
Nltk natural language toolkit overview and application @ PyCon.tw 2012Jimmy Lai
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...Lifeng (Aaron) Han
 
MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)STI International
 
Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Ana Marasović
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...Association for Computational Linguistics
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIDES Editor
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award Pavel Shvaiko
 

Similaire à A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction (20)

A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
 
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De... Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
Semi-Supervised Autoencoders for Predicting Sentiment Distributions(第 5 回 De...
 
Word Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented LanguagesWord Segmentation and Lexical Normalization for Unsegmented Languages
Word Segmentation and Lexical Normalization for Unsegmented Languages
 
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
Deep Learning and Modern Natural Language Processing (AnacondaCon2019)
 
Erlang/OTP for Rubyists
Erlang/OTP for RubyistsErlang/OTP for Rubyists
Erlang/OTP for Rubyists
 
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2 Engineering Intelligent NLP Applications Using Deep Learning – Part 2
Engineering Intelligent NLP Applications Using Deep Learning – Part 2
 
Towards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software DataTowards a Marketplace of Open Source Software Data
Towards a Marketplace of Open Source Software Data
 
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
Deep Learning勉強会@小町研 "Learning Character-level Representations for Part-of-Sp...
 
Word embedding
Word embedding Word embedding
Word embedding
 
Nltk natural language toolkit overview and application @ PyCon.tw 2012
Nltk  natural language toolkit overview and application @ PyCon.tw 2012Nltk  natural language toolkit overview and application @ PyCon.tw 2012
Nltk natural language toolkit overview and application @ PyCon.tw 2012
 
Deep learning for nlp
Deep learning for nlpDeep learning for nlp
Deep learning for nlp
 
wordembedding.pptx
wordembedding.pptxwordembedding.pptx
wordembedding.pptx
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
 
MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)MOST (Newsfromthefront 2010)
MOST (Newsfromthefront 2010)
 
Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...Colloquium talk on modal sense classification using a convolutional neural ne...
Colloquium talk on modal sense classification using a convolutional neural ne...
 
Knowledge Extraction
Knowledge ExtractionKnowledge Extraction
Knowledge Extraction
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
 
Incremental Difference as Feature for Lipreading
Incremental Difference as Feature for LipreadingIncremental Difference as Feature for Lipreading
Incremental Difference as Feature for Lipreading
 
ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award ESWC-2011: S-Match received 7 years award
ESWC-2011: S-Match received 7 years award
 
Dependency-Based Word Embeddings
Dependency-Based Word EmbeddingsDependency-Based Word Embeddings
Dependency-Based Word Embeddings
 

Plus de Seokhwan Kim

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingSeokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionSeokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...Seokhwan Kim
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSeokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingSeokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionSeokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessSeokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionSeokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템Seokhwan Kim
 

Plus de Seokhwan Kim (19)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic TrackingDynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot InteractionNatural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog StatesSequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic TrackingWikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognitionMMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information accessA spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information ExtractionAn Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
 

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

  • 1. A GRAPH-BASED CROSS-LINGUAL PROJECTION APPROACH FOR WEAKLY SUPERVISED RELATION EXTRACTION The 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012) July 11th, 2012, Jeju Seokhwan Kim (Institute for Infocomm Research) Gary Geunbae Lee (POSTECH)
  • 2. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 2
  • 3. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 3
  • 4. Problem Definition • Relation Extraction  To identify semantic relations between a pair of entities Birthplace Barack Obama was born in Honolulu , Hawaii . PER LOC LOC  Considered as a classification problem 4
  • 5. Related Work (1) • Supervised Learning  Many supervised machine learning approaches have been successfully applied • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006) • Semi-supervised Learning  To obtain the annotations of unlabeled instances from the seed information • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000; Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006; Zhang, 2004; Chen el al., 2006; Zhou et al., 2009) 5
  • 6. Motivation • Resources for Relation Extraction  Supervised/Semi-supervised Approaches • Labeled corpora for supervised learning • Seed instances for semi-supervised learning • Available for only a few languages  ACE 2003 Multilingual Training Dataset • English (252 articles) • Chinese (221 articles) • Arabic (206 articles) • No resources for other languages  Korean 6
  • 7. Related Work (2) • Self-supervised Learning  To obtain the annotated dataset without any human effort  Using the information obtained from external resources • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008) • Wikipedia-based Methods (Wu and Weld, 2010) • Cross-lingual Annotation Projection  To leverage parallel corpora to project the relation annotations on the resource-rich source language to the resource-poor target language (Kim et al., 2010, Kim et al., 2011) 7
  • 8. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Implementation • Evaluation • Conclusions 8
  • 9. Overall Architecture Annotation Parallel Projection Corpus Sentences in Sentences in Ls Lt Preprocessing Preprocessing (POS Tagging, (POS Tagging, Parsing) Parsing) NER Word Alignment Relation Projection Extraction Annotated Annotated Sentences in Sentences in Ls Lt 9
  • 10. Direct Projection (Kim et al., 2010) • Annotation • Projection fE (<Barack Obama, Honolulu>) = 1 Barack Obama was born in Honolulu , Hawaii . 버락 오바마 는 하와이 의 호놀룰루 에서 태어났다 (beo-rak-o-ba-ma) (neun) (ha-wa-i) (ui) (ho-nol-rul-ru) (e-seo) (tae-eo-nat-da) fK (<버락 오바마, 호놀룰루>) = 1 10
  • 11. Limitations of Direct Projection • Direct projection approach is still vulnerable to the erroneous inputs generated by preprocessors • Main causes of this limitation  Considering alignment between entity candidates only, not any contextual information  Performed by just a single pass process 11
  • 12. Graph-based Learning • Semi-supervised learning algorithm • Defining a graph  The nodes represent labeled and unlabeled examples in a dataset  The edges reflect the similarity of examples • Learning a labeling function in an iterative manner  It should be close to the given labels on the similar labeled nodes  It should be smooth on the whole graph • Related Work  Graph-based Learning for Relation Extraction (Chen et al, 2006)  Bilingual projection of POS tagging (Das and Petrov, 2011) 12
  • 13. Graph Construction • Graph Nodes  Instance Nodes • Defined for all pairs of entity candidates in both languages • Each instance node has a soft label vector Y = [y+ y-]  Context Nodes • For identifying the relation descriptors of the positive instances • Defined for each trigram which is located between a given entity pair which is semantically related • Each context node has a soft label vector Y = [y+ y-] <ARG1> was born in <ARG2> <ARG1> was born was born in born in <ARG2> 13
  • 14. Graph Construction • Edge Weights  Between instance node and context node in the same language 𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘 1 𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒, = 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.  Between context nodes in a language 𝑘, |𝑢 𝑘 ∩ 𝑢 𝑙 | 𝑤(𝑢 𝑢 𝑙) = 𝐽(𝑢 𝑘, 𝑢 𝑙) = 𝑘 . |𝑢 ∪ 𝑢 𝑙 |  Between context nodes in source and target languages 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 𝑤(𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 ) = 𝑘 𝑚 , 𝑢𝑡 𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡 14
  • 16. Label Propagation Initialize T • Algorithm  Input • A transition matrix T • An initial label matrix Y0 Normalize T  Output • The updated label matrix Yt Initialize Y Update Y 16
  • 17. Label Propagation • Executed in three phases 1st phase 2nd phase 3rd phase 17
  • 18. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 18
  • 19. Implementation • Dataset  English-Korean parallel corpus • 266,982 bi-sentence pairs in English and Korean • Aligned by GIZA++ • Annotation  ReVerb (Fader et al., 2011) • English Open IE system • Label Propagation  Junto Label Propagation Toolkit • Learning  Tree kernel-based SVM classifier • Shortest path dependency kernel (Bunescu and Mooney, 2005) • SVM-Light (Joachims, 1998) 19
  • 20. Evaluation • Dataset  Manually annotated Korean dataset • Obtained from the Web following Bunescu and Mooney(2007)’s work • 500 sentences with manual annotations for four relation types  Acquisition  Birthplace  Inventor Of  Won Prize • Evaluation Metrics  Precision/Recall/F-measure 20
  • 21. Experimental Results • Direct Projection vs. Graph-based Projection Direct Projection Graph-based Projection Type P R F P R F Acquisition 51.6 87.7 64.9 55.3 91.2 68.9 Birthplace 69.8 84.5 76.4 73.8 87.3 80.0 Inventor of 62.4 85.3 72.1 66.3 89.7 76.3 Won Prize 73.3 80.5 76.7 76.4 82.9 79.5 Total 63.9 84.2 72.7 67.7 87.4 76.3 21
  • 22. Experimental Results • Comparisons to other self-supervised approaches  Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008) • Korean Treebank and Syntactic Heuristics  Wikipedia-based Approach (Wu and Weld, 2010) • Korean Wikipedia articles and Infoboxes Approach P R F Heuristic-based 92.31 17.27 29.09 Wikipedia-based 66.67 66.91 66.79 Projection-based 67.69 87.41 76.30 22
  • 23. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 23
  • 24. Conclusion • Summary  A graph-based projection approach for relation extraction • Label propagation algorithm • On a graph that represents the instance and context features of both the source and target languages  Experimental results show that our approach helps to improve the performances of relation extraction compared to other approaches • Future work  To relieve the high complexity problem of the approach  To deal with more expanded graph structure to improve the extraction performances 24
  • 25. Q&A