SlideShare une entreprise Scribd logo
1  sur  8
Télécharger pour lire hors ligne
A Multi-layer LSTM-based Approach
for Robot Command Interaction
Modeling
IROS 2018 - Workshop on Language and Robotics
Martino Mensio1
, Emanuele Bastianelli1
,
Ilaria Tiddi2
, Giuseppe Rizzo3
.
1: The Open University, 2: VU Amsterdam,
3: Istituto Superiore Mario Boella.
Background
2
Human-robot interaction
- indoor environment
- natural language commands stating:
- action → semantic frame
- arguments → frame elements
Challenges:
- understand the sentence of the user
- instantiate the corresponding command
The pipeline
3
The LSTM neural network
Multi-layered LSTM
- word embeddings
(GloVe) for each word
- bidirectional LSTM
encoder
- other layers for
specialized tasks
4
The dataset: HuRIC
E. Bastianelli, G. Castellucci, D. Croce, L. Iocchi, R. Basili, and D. Nardi, “Huric: a human robot
interaction corpus,” in Proceedings of the Ninth International Conference on Language Resources and
Evaluation (LREC’14). Reykjavik, Iceland: European Language Resources Association (ELRA), 2014.
5
Evaluation (1) - Semantic Parsing
Using 5-fold evaluation, each task is evaluated:
- AD: Action Detection (identification of the Frame)
- AI: Argument Identification (detecting the spans of the arguments)
- AC: Argument Classification (assign a label to each argument) 6
F1 AD AI AC
BAS16 94.67% 90.74% 94.93%
3L-ATT 94.44% 94.73% 94.69%
3L-NO-ATT 95.37% 94.90% 91.90%
2L-ATT 96.29% 94.40% 92.30%
2L-NO-ATT 94.44% 94.50% 92.45%
Evaluation (2): whole chain
7
The results of the parser are fed into the grounding module and the performances are evaluated.
Whole chain F1
BAS16 41.70%
3L-ATT 43.67%
3L-NO-ATT 41.92%
2L-ATT 44.54%
2L-NO-ATT 42.79%
Future works
- Study the values of attention towards explainability
- Design an interactive scenario for correction
http://kmi.open.ac.uk/
https://www.cs.vu.nl/
http://www.ismb.it/
8

Contenu connexe

Similaire à A Multi-layer LSTM-based Approach for Robot Command Interaction Modeling

SATANJEEV BANERJEE
SATANJEEV BANERJEESATANJEEV BANERJEE
SATANJEEV BANERJEEbutest
 
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...Akira Taniguchi
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUEJournal For Research
 
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017kevig
 
A-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfA-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfSUDESHNASANI1
 
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic ModerationHate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic ModerationUniversitat Politècnica de Catalunya
 
electronics-11-01780-v2.pdf
electronics-11-01780-v2.pdfelectronics-11-01780-v2.pdf
electronics-11-01780-v2.pdfNaveenkushwaha18
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligenceMayank Saxena
 
Languages and frameworks for specifying test artifacts
Languages and frameworks for specifying test artifactsLanguages and frameworks for specifying test artifacts
Languages and frameworks for specifying test artifactsZoltan Micskei
 
Machine learning-and-data-mining-19-mining-text-and-web-data
Machine learning-and-data-mining-19-mining-text-and-web-dataMachine learning-and-data-mining-19-mining-text-and-web-data
Machine learning-and-data-mining-19-mining-text-and-web-dataitstuff
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysispaperpublications3
 
GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003butest
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?Paul Groth
 
Yuri A. Ivanov
Yuri A. IvanovYuri A. Ivanov
Yuri A. Ivanovbutest
 
Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysisfridolin.wild
 

Similaire à A Multi-layer LSTM-based Approach for Robot Command Interaction Modeling (20)

ICAME 2010
ICAME 2010ICAME 2010
ICAME 2010
 
SATANJEEV BANERJEE
SATANJEEV BANERJEESATANJEEV BANERJEE
SATANJEEV BANERJEE
 
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...
[IROS2017] Online Spatial Concept and Lexical Acquisition with Simultaneous L...
 
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUECOMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
COMPREHENSIVE ANALYSIS OF NATURAL LANGUAGE PROCESSING TECHNIQUE
 
Ai notes
Ai notesAi notes
Ai notes
 
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
MOST CITED NATURAL LANGUAGECOMPUTING ARTICLESIN 2017
 
A-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdfA-STUDY-ON-SENTIMENT-POLARITY.pdf
A-STUDY-ON-SENTIMENT-POLARITY.pdf
 
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic ModerationHate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
Hate Speech in Pixels: Detection of Offensive Memes towards Automatic Moderation
 
electronics-11-01780-v2.pdf
electronics-11-01780-v2.pdfelectronics-11-01780-v2.pdf
electronics-11-01780-v2.pdf
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Languages and frameworks for specifying test artifacts
Languages and frameworks for specifying test artifactsLanguages and frameworks for specifying test artifacts
Languages and frameworks for specifying test artifacts
 
IVACS 2010
IVACS 2010IVACS 2010
IVACS 2010
 
Machine learning-and-data-mining-19-mining-text-and-web-data
Machine learning-and-data-mining-19-mining-text-and-web-dataMachine learning-and-data-mining-19-mining-text-and-web-data
Machine learning-and-data-mining-19-mining-text-and-web-data
 
Sign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture AnalysisSign Language Recognition with Gesture Analysis
Sign Language Recognition with Gesture Analysis
 
GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003GATE, HLT and Machine Learning, Sheffield, July 2003
GATE, HLT and Machine Learning, Sheffield, July 2003
 
More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?More ways of symbol grounding for knowledge graphs?
More ways of symbol grounding for knowledge graphs?
 
HDRIO Presentation - 2018
HDRIO Presentation - 2018HDRIO Presentation - 2018
HDRIO Presentation - 2018
 
IS
ISIS
IS
 
Yuri A. Ivanov
Yuri A. IvanovYuri A. Ivanov
Yuri A. Ivanov
 
Meaningful Interaction Analysis
Meaningful Interaction AnalysisMeaningful Interaction Analysis
Meaningful Interaction Analysis
 

Plus de Martino Mensio

Towards a Cross-Article Narrative Comparison of News
Towards a Cross-Article Narrative Comparison of NewsTowards a Cross-Article Narrative Comparison of News
Towards a Cross-Article Narrative Comparison of NewsMartino Mensio
 
Detecting subtle text manipulations
Detecting subtle text manipulationsDetecting subtle text manipulations
Detecting subtle text manipulationsMartino Mensio
 
News Source Credibility in the Eyes of Different Assessors
News Source Credibility in the Eyes of Different AssessorsNews Source Credibility in the Eyes of Different Assessors
News Source Credibility in the Eyes of Different AssessorsMartino Mensio
 
The Rise of Emotion-aware Conversational Agents: Threats in Digital Emotions
The Rise of Emotion-aware Conversational Agents: Threats in Digital EmotionsThe Rise of Emotion-aware Conversational Agents: Threats in Digital Emotions
The Rise of Emotion-aware Conversational Agents: Threats in Digital EmotionsMartino Mensio
 
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...Martino Mensio
 
Deep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational AgentsDeep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational AgentsMartino Mensio
 
Deep Learning per la Comprensione del Linguaggio Naturale - HKN
Deep Learning per la Comprensione del Linguaggio Naturale - HKNDeep Learning per la Comprensione del Linguaggio Naturale - HKN
Deep Learning per la Comprensione del Linguaggio Naturale - HKNMartino Mensio
 

Plus de Martino Mensio (7)

Towards a Cross-Article Narrative Comparison of News
Towards a Cross-Article Narrative Comparison of NewsTowards a Cross-Article Narrative Comparison of News
Towards a Cross-Article Narrative Comparison of News
 
Detecting subtle text manipulations
Detecting subtle text manipulationsDetecting subtle text manipulations
Detecting subtle text manipulations
 
News Source Credibility in the Eyes of Different Assessors
News Source Credibility in the Eyes of Different AssessorsNews Source Credibility in the Eyes of Different Assessors
News Source Credibility in the Eyes of Different Assessors
 
The Rise of Emotion-aware Conversational Agents: Threats in Digital Emotions
The Rise of Emotion-aware Conversational Agents: Threats in Digital EmotionsThe Rise of Emotion-aware Conversational Agents: Threats in Digital Emotions
The Rise of Emotion-aware Conversational Agents: Threats in Digital Emotions
 
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...
Multi-turn QA: A RNN Contextual Approach to Intent Classification for Goal-or...
 
Deep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational AgentsDeep Semantic Learning for Conversational Agents
Deep Semantic Learning for Conversational Agents
 
Deep Learning per la Comprensione del Linguaggio Naturale - HKN
Deep Learning per la Comprensione del Linguaggio Naturale - HKNDeep Learning per la Comprensione del Linguaggio Naturale - HKN
Deep Learning per la Comprensione del Linguaggio Naturale - HKN
 

Dernier

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 

Dernier (20)

9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 

A Multi-layer LSTM-based Approach for Robot Command Interaction Modeling

  • 1. A Multi-layer LSTM-based Approach for Robot Command Interaction Modeling IROS 2018 - Workshop on Language and Robotics Martino Mensio1 , Emanuele Bastianelli1 , Ilaria Tiddi2 , Giuseppe Rizzo3 . 1: The Open University, 2: VU Amsterdam, 3: Istituto Superiore Mario Boella.
  • 2. Background 2 Human-robot interaction - indoor environment - natural language commands stating: - action → semantic frame - arguments → frame elements Challenges: - understand the sentence of the user - instantiate the corresponding command
  • 4. The LSTM neural network Multi-layered LSTM - word embeddings (GloVe) for each word - bidirectional LSTM encoder - other layers for specialized tasks 4
  • 5. The dataset: HuRIC E. Bastianelli, G. Castellucci, D. Croce, L. Iocchi, R. Basili, and D. Nardi, “Huric: a human robot interaction corpus,” in Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). Reykjavik, Iceland: European Language Resources Association (ELRA), 2014. 5
  • 6. Evaluation (1) - Semantic Parsing Using 5-fold evaluation, each task is evaluated: - AD: Action Detection (identification of the Frame) - AI: Argument Identification (detecting the spans of the arguments) - AC: Argument Classification (assign a label to each argument) 6 F1 AD AI AC BAS16 94.67% 90.74% 94.93% 3L-ATT 94.44% 94.73% 94.69% 3L-NO-ATT 95.37% 94.90% 91.90% 2L-ATT 96.29% 94.40% 92.30% 2L-NO-ATT 94.44% 94.50% 92.45%
  • 7. Evaluation (2): whole chain 7 The results of the parser are fed into the grounding module and the performances are evaluated. Whole chain F1 BAS16 41.70% 3L-ATT 43.67% 3L-NO-ATT 41.92% 2L-ATT 44.54% 2L-NO-ATT 42.79%
  • 8. Future works - Study the values of attention towards explainability - Design an interactive scenario for correction http://kmi.open.ac.uk/ https://www.cs.vu.nl/ http://www.ismb.it/ 8