Georg Wiese - 2017 - Neural Domain Adaptation for Biomedical Question Answering

•

0 j'aime•241 vues

Association for Computational Linguistics

CoNLL 2017 - K17-1029

Formation

Neural Domain Adaptation
for Biomedical Question Answering
Georg Wiese1,2
, Dirk Weissenborn2
, Mariana Neves1
1
Hasso Plattner Institute, August Bebel Strasse 88, Potsdam, Germany 2
Language Technology Lab, DFKI, Alt-Moabit 91c, Berlin, Germany
Motivation
Architecture & Training
Domain Adaptation
● Neural question answering (QA) systems
outperform traditional methods in open-domain
factoid QA.
● In biomedicine, datasets are too small to apply
deep learning directly.
● Can we bridge this gap via domain adaptation?
● Our system is pre-trained on SQuAD, a large-scale
(105
) open-domain factoid QA dataset.
● Then, we adapt the system to the biomedical
domain, using BioASQ, a small (103
) biomedical
QA dataset.
End Probabilities p(e|s)
End Probabilities p(e|s)
End Scores e(s)
End Scores e(s)
... ...
Biomedical Embeddings
GloVe Embeddings
Character Embeddings
Context Embeddings Question Embeddings
Start Scores ystart
Start Probabilities pstart
End Scores yend
End Probabilities pend
sigmoid softmax
Extractive QA System
Question Type Features
● Our architecture wraps an existing neural QA
system (FastQA [1]), with the following changes:
○ Input Layer: In addition to GloVe embeddings
and character embeddings, we feed biomedical
token embeddings and question type features.
○ Output Layer: We generalize our activation and
decoding process to support list questions in
addition to factoid questions.
● During training, we explore several domain
adaptation techniques, including mere fine-tuning,
joint training, and forgetting cost regularization [2].
Results
● Pre-training on SQuAD and fine-tuning
on BioASQ already improves performance
significantly over training on BioASQ only.
● The forgetting cost improves results
slightly for factoid questions.
● In order to compare our system to the state of the art in biomedical
QA, we tested it on the 2016 BioASQ challenge.
● We compared a single model and model ensemble.
● Our system achieves state-of-the-art results on factoid questions
and competitive results on list questions.
Comparison to state of the art
[1] Weissenborn et al., “Making Neural QA as Simple as
Possible but not Simpler”
[2] Riemer et al., “Representation Stability as a
Regularizer for Improved Text Analytics Transfer
Learning”

Recommandé

Sound Empirical Evidence in Software TestingJaguaraci Silva

BlaineODriscollcvBlaine O Driscoll

Joint Negative and Positive Learning for Noisy Labelsharmonylab

Random Keys Genetic Alogrithims Applied to Conflicting Objectives for Optimiz...Uday Haral

Accelerating lead optimisation with active learning by exploiting MMPA based ...Ed Griffen

Emerging Challenges for Artificial Intelligence in Medicinal ChemistryEd Griffen

Using the Machine to predict TestabilityMiguel Lopez

Bayesian Approaches To Improve Sample Size WebinarnQuery

Recommandé

Sound Empirical Evidence in Software TestingJaguaraci Silva

BlaineODriscollcvBlaine O Driscoll

Joint Negative and Positive Learning for Noisy Labelsharmonylab

Random Keys Genetic Alogrithims Applied to Conflicting Objectives for Optimiz...Uday Haral

Accelerating lead optimisation with active learning by exploiting MMPA based ...Ed Griffen

Emerging Challenges for Artificial Intelligence in Medicinal ChemistryEd Griffen

Using the Machine to predict TestabilityMiguel Lopez

Bayesian Approaches To Improve Sample Size WebinarnQuery

Ontology-based data access: why it is so cool!Josef Hardi

Enhancing Video Understanding: NLP-Based Automatic Question GenerationIRJET Journal

Whole test suite generationJPINFOTECH JAYAPRAKASH

Lafauci dv club oct 2006Obsidian Software

Testing survey by_directionsTao He

Barga Data Science lecture 5Roger Barga

2007 03-16 modeling and static analysis of complex biological systems dsrDebora Da Rosa

Manta ray optimized deep contextualized bi-directional long short-term memor...IJECEIAES

ICST11.pptPtidej Team

Guidetaikhoan262

Huong dan cu the svmtaikhoan262

Triantafyllia VoulibasiISSEL

Evolving Reinforcement Learning Algorithms, JD. Co-Reyes et al, 2021Chris Ohk

RoBERTa: language modelling in building Indonesian question-answering systemsTELKOMNIKA JOURNAL

Sentiment Analysis using Naïve Bayes, CNN, SVMIRJET Journal

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5BAssociation for Computational Linguistics

Question Answering as Search - the Anserini Pipeline and Other StoriesSujit Pal

Deep Learning Models for Question AnsweringSujit Pal

A Walk Through GWASGolden Helix Inc

A Walk Through GWASGolden Helix

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextAssociation for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics

Contenu connexe

Similaire à Georg Wiese - 2017 - Neural Domain Adaptation for Biomedical Question Answering

Ontology-based data access: why it is so cool!Josef Hardi

Enhancing Video Understanding: NLP-Based Automatic Question GenerationIRJET Journal

Whole test suite generationJPINFOTECH JAYAPRAKASH

Lafauci dv club oct 2006Obsidian Software

Testing survey by_directionsTao He

Barga Data Science lecture 5Roger Barga

2007 03-16 modeling and static analysis of complex biological systems dsrDebora Da Rosa

Manta ray optimized deep contextualized bi-directional long short-term memor...IJECEIAES

ICST11.pptPtidej Team

Guidetaikhoan262

Huong dan cu the svmtaikhoan262

Triantafyllia VoulibasiISSEL

Evolving Reinforcement Learning Algorithms, JD. Co-Reyes et al, 2021Chris Ohk

RoBERTa: language modelling in building Indonesian question-answering systemsTELKOMNIKA JOURNAL

Sentiment Analysis using Naïve Bayes, CNN, SVMIRJET Journal

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5BAssociation for Computational Linguistics

Question Answering as Search - the Anserini Pipeline and Other StoriesSujit Pal

Deep Learning Models for Question AnsweringSujit Pal

A Walk Through GWASGolden Helix Inc

A Walk Through GWASGolden Helix

Similaire à Georg Wiese - 2017 - Neural Domain Adaptation for Biomedical Question Answering (20)

Ontology-based data access: why it is so cool!

Enhancing Video Understanding: NLP-Based Automatic Question Generation

Whole test suite generation

Lafauci dv club oct 2006

Testing survey by_directions

Barga Data Science lecture 5

2007 03-16 modeling and static analysis of complex biological systems dsr

Manta ray optimized deep contextualized bi-directional long short-term memor...

ICST11.ppt

Guide

Huong dan cu the svm

Triantafyllia Voulibasi

Evolving Reinforcement Learning Algorithms, JD. Co-Reyes et al, 2021

RoBERTa: language modelling in building Indonesian question-answering systems

Sentiment Analysis using Naïve Bayes, CNN, SVM

Georg Wiese - 2017 - Neural Question Answering at BioASQ 5B

Question Answering as Search - the Anserini Pipeline and Other Stories

Deep Learning Models for Question Answering

A Walk Through GWAS

Plus de Association for Computational Linguistics

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal TextAssociation for Computational Linguistics

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...Association for Computational Linguistics

Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour AnalysisAssociation for Computational Linguistics

Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...Association for Computational Linguistics

Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics

Elior Sulem - 2018 - Semantic Structural Evaluation for Text SimplificationAssociation for Computational Linguistics

Daniel Gildea - 2018 - The ACL Anthology: Current State and Future DirectionsAssociation for Computational Linguistics

Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...Association for Computational Linguistics

Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...Association for Computational Linguistics

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...Association for Computational Linguistics

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics

Chenchen Ding - 2015 - NICT at WAT 2015Association for Computational Linguistics

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...Association for Computational Linguistics

Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...Association for Computational Linguistics

Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015Association for Computational Linguistics

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 WorkshopAssociation for Computational Linguistics

Chenchen Ding - 2015 - NICT at WAT 2015Association for Computational Linguistics

Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...Association for Computational Linguistics

Plus de Association for Computational Linguistics (20)

Muis - 2016 - Weak Semi-Markov CRFs for NP Chunking in Informal Text

Castro - 2018 - A High Coverage Method for Automatic False Friends Detection ...

Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis

Muthu Kumar Chandrasekaran - 2018 - Countering Position Bias in Instructor In...

Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions

Elior Sulem - 2018 - Semantic Structural Evaluation for Text Simplification

Daniel Gildea - 2018 - The ACL Anthology: Current State and Future Directions

Wenqiang Lei - 2018 - Sequicity: Simplifying Task-oriented Dialogue Systems w...

Matthew Marge - 2017 - Exploring Variation of Natural Human Commands to a Rob...

Venkatesh Duppada - 2017 - SeerNet at EmoInt-2017: Tweet Emotion Intensity Es...

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop

Chenchen Ding - 2015 - NICT at WAT 2015

John Richardson - 2015 - KyotoEBMT System Description for the 2nd Workshop on...

Zhongyuan Zhu - 2015 - Evaluating Neural Machine Translation in English-Japan...

Hyoung-Gyu Lee - 2015 - NAVER Machine Translation System for WAT 2015

Satoshi Sonoh - 2015 - Toshiba MT System Description for the WAT2015 Workshop

Chenchen Ding - 2015 - NICT at WAT 2015

Graham Neubig - 2015 - Neural Reranking Improves Subjective Quality of Machin...

Dernier

Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic

Class 11th Physics NEET formula sheet pdfAyushMahapatra5

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching

Software Engineering Methodologies (overview)eniolaolutunde

The Most Excellent Way | 1 Corinthians 13Steve Thomason

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur

Arihant handbook biology for class 11 .pdfchloefrazer622

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"National Information Standards Organization (NISO)

microwave assisted reaction. General introductionMaksud Ahmed

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services

Accessible design: Minimum effort, maximum impactdawncurless

Nutritional Needs Presentation - HLTH 104misteraugie

1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh

Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron

Introduction to Nonprofit Accounting: The BasicsTechSoup

social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3

Measures of Central Tendency: Mean, Median and ModeThiyagu K

Dernier (20)

Key note speaker Neum_Admir Softic_ENG.pdf

Class 11th Physics NEET formula sheet pdf

Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...

Software Engineering Methodologies (overview)

The Most Excellent Way | 1 Corinthians 13

Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...

Arihant handbook biology for class 11 .pdf

9548086042 for call girls in Indira Nagar with room service

Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"

microwave assisted reaction. General introduction

IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...

Accessible design: Minimum effort, maximum impact

Nutritional Needs Presentation - HLTH 104

1029 - Danh muc Sach Giao Khoa 10 . pdf

Q4-W6-Restating Informational Text Grade 3

Introduction to Nonprofit Accounting: The Basics

social pharmacy d-pharm 1st year by Pragati K. Mahajan

Measures of Central Tendency: Mean, Median and Mode

Georg Wiese - 2017 - Neural Domain Adaptation for Biomedical Question Answering

1. Neural Domain Adaptation for Biomedical Question Answering Georg Wiese1,2 , Dirk Weissenborn2 , Mariana Neves1 1 Hasso Plattner Institute, August Bebel Strasse 88, Potsdam, Germany 2 Language Technology Lab, DFKI, Alt-Moabit 91c, Berlin, Germany Motivation Architecture & Training Domain Adaptation ● Neural question answering (QA) systems outperform traditional methods in open-domain factoid QA. ● In biomedicine, datasets are too small to apply deep learning directly. ● Can we bridge this gap via domain adaptation? ● Our system is pre-trained on SQuAD, a large-scale (105 ) open-domain factoid QA dataset. ● Then, we adapt the system to the biomedical domain, using BioASQ, a small (103 ) biomedical QA dataset. End Probabilities p(e|s) End Probabilities p(e|s) End Scores e(s) End Scores e(s) ... ... Biomedical Embeddings GloVe Embeddings Character Embeddings Context Embeddings Question Embeddings Start Scores ystart Start Probabilities pstart End Scores yend End Probabilities pend sigmoid softmax Extractive QA System Question Type Features ● Our architecture wraps an existing neural QA system (FastQA [1]), with the following changes: ○ Input Layer: In addition to GloVe embeddings and character embeddings, we feed biomedical token embeddings and question type features. ○ Output Layer: We generalize our activation and decoding process to support list questions in addition to factoid questions. ● During training, we explore several domain adaptation techniques, including mere fine-tuning, joint training, and forgetting cost regularization [2]. Results ● Pre-training on SQuAD and fine-tuning on BioASQ already improves performance significantly over training on BioASQ only. ● The forgetting cost improves results slightly for factoid questions. ● In order to compare our system to the state of the art in biomedical QA, we tested it on the 2016 BioASQ challenge. ● We compared a single model and model ensemble. ● Our system achieves state-of-the-art results on factoid questions and competitive results on list questions. Comparison to state of the art [1] Weissenborn et al., “Making Neural QA as Simple as Possible but not Simpler” [2] Riemer et al., “Representation Stability as a Regularizer for Improved Text Analytics Transfer Learning”