SlideShare une entreprise Scribd logo
1  sur  29
Spanish resources of TrendMiner Project
Course on Advanced Topics
Combining Language and Web Technologies,
UNED, January 22, 2015
Paloma Martínez
Advanced Databases Group
labda.inf.uc3m.es
Universidad Carlos III de Madrid
CONTENTS
1. Challenges in automatic semantic analysis of
health information
2. Objective
3. Resources
4. Linguistic Processor
5. Real-time prototype
6. Annotation pipeline evaluation
7. Other methods to extract drug-effect relations
8. Possible extensions
No Estructurados Estructurados
• Extracción y Recuperación de información en el
dominio biomédico en distintos medios
(publicaciones científicas, redes sociales, notas
clínicas)
• ¿Cuántos datos estructurados se procesan de la
Historia Clínica Electrónica? ¿Y con los no
estructurados, qué se hace?
• Aplicaciones:
1. Soporte a la codificación ICD9/10, SNOMED
CT, ….(p.e. diagnósticos en partes de alta en
urgencias)
2. Filtrado de grandes volúmenes de
información
3. Extracción de información para alimentar BD
(p.e. extracción de interacciones entre
fármacos a partir de literatura médica)
4. Monitorización de eventos médicos en
distintos medios
RETOS EN EL ANÁLISIS SEMÁNTICO AUTOMÁTICO DE
INFORMACIÓN DE SALUD
1. Español
2. ≠ tipos de lenguaje (orientado a
pacientes, científico, clínico)
3. Fenómenos propios
(abreviaturas, gran ambigüedad,
…)
To detect drugs and medical events mentions
(drugs, diseases, symptoms, Adverse effects, ….)
from social media.
Social medial sources can be valuable sources
monitoring medical events
Different applications:
- Pharmacovigilance tasks performed in medicines
agencies and pharma companies.
- Filtering, classification and monitoring of health-
related social networks (blogs, forums, ….)
- Information Extraction tasks
OBJECTIVE
RESOURCES
Patients on Twitter
Spanish patient Forums
(1) Analyzed and monitored sources
Example of post in Forumclinic
(1) Analyzed and monitored sources
RESOURCES
Anatomical Therapeutic Chemical
(ATC) Classification System
Ibuprofeno
algiasdin|apirofeno|aragel|articalm|ast
efor|brufen|dalsy|dersindol|diltix|dole
ncar|doltra|espididol|espidifen|….
nauseas estomago revuelto|sentirse
mareado|nauseas|nauseas
solas|nauseoso|nauseoso|ansia
nauseosa|……
(2) Integrated existing semantic domain resources
35.259 terms
16.418 drugs and 2.228
active substances
2.566 ATC codes
42.548 main diseases
Cáncer|neoplasia maligna|….
RESOURCES
Level 1: Anatomical main group (1 letter)
Level 2: Therapeutic main group (2 digits)
Level 3: Pharmacological main group (1 letter)
Level 4: Chemical main group (1 letter)
Level 5: Active substance main group (2 digits)
ATC is a system of alphanumeric codes developed by the WHO for the classification of
drugs and other medical products
ATC Structure
(2) Integrated existing semantic domain resources
RESOURCES
(2) Integrated existing semantic domain resources
Example of ATC Structure
RESOURCES
(3) Integrated new semantic resources
Spanish DrugEffect DB containing relations
among drugs and effects
63.000 relations
RESOURCES
How has the Spanish
DrugEffect DB been built?
(3) Integrated new semantic resources
RESOURCES
LINGUISTIC PROCESSOR
REAL-TIME PROTOTYPE
 GATE Annotation Pipeline: Plugging developed for Textalytics
 Datawarehouse:
• Implemented on elasticsearch with an ATC-based index structure:
January 19, 2015 , 2,401,613 Tweets and 41,985 Blog Posts annotated an
indexed
• Amazon infrastructure for processing and storing
 Searching:
• Different search modes depending of ATC level or exact matching.
• Obtaining and distinguishing indications, adverse effects and possible
relations among drugs and effects.
• Searching for co-occurrences (drug-disease, drug-drug, ….). Also machine
learning as distant supervision approach (not in prototype)
 Visualization:
• Timeline to view evolution of mentions with different granularity.
• Viewing the annotated source text (tweets and posts)
REAL-TIME PROTOTYPE
Aggregated data about effects related to drug lorazepam
REAL-TIME PROTOTYPE
Lorazepam ATC tree cross-lingual
REAL-TIME PROTOTYPE
Drug-drug and drug-diseases co-ocurrences for lorazepam
active substance
REAL-TIME PROTOTYPE
Timeline with evolution of lorazepam related drugs
REAL-TIME PROTOTYPE
Drugs
R P F-Measure
strict 0,68 0,85 0,76
lenient 0,68 0,85 0,76
Drugs
• Using SpanishADR corpus (400 annotated comments from Forumclínic)
• To enhance recognizing of misspelled drugs
• Include abbreviations for drug families
• Solve several ambiguities (alcohol, oxygen)
ANNOTATION PIPELINE EVALUATION
Effects
R P F-Measure
strict 0,43 0,75 0,54
lenient 0,47 0,83 0,6
Effects
• Low performance:
• because colloquial expressions to describe an effect: me deja ko (it makes
me KO) or me cuesta más levantarme (it’s harder for me to wake up).
• different lexical variations and abbreviations of the same effect.
ANNOTATION PIPELINE EVALUATION
Drug-effect relations
Window size R P F-Measure R P F-Measure
strict 0,08 0,57 0,14 0,63 0,44 0,52
lenient 0,13 0,96 0,24 0,88 0,61 0,72
strict 0,1 0,34 0,16 0,74 0,26 0,38
lenient 0,23 0,74 0,35 0,99 0,34 0,51
strict 0,12 0,32 0,17 0,17 0,75 0,33
lenient 0,24 0,67 0,36 1 0,29 0,45
SpanishDrugEffectDB Coocurrences
30
100
250
• Performed from annotated drugs and effects
• Low recall using SpanishDrugEffect DB because low coverage of effects, the
lack of co-reference resolution and size of corpus (only 164 relations)
• Machine Learning tryed: distant supervision approaches
ANNOTATION PIPELINE EVALUATION
1. Distant-supervision method using the database on a collection
of 84,000 messages in order to extract the relations between
drugs and their effects (instances of DB are positive examples)
OTHER METHODS TO EXTRACT DRUG-EFFECT RELATIONS
2. To classify the relation instances, we used a kernel method
based only on shallow linguistic information of the
sentences.
3. Regarding Relation Extraction of drugs and their effects, the
distant supervision approach achieved a recall of 0.59 and a
precision of 0.48
OTHER METHODS TO EXTRACT DRUG-EFFECT RELATIONS
 Development of cross-lingual approach for ATC codes integrating
ATC ontology in collaboration with DFKI
 Populate Health TM ontologies from semantic resources in
collaboration with DFKI
 Many possibilities of customization:
• Tracking specific drugs appearing with different diseases or
effects
• Information Extraction from unstructured data (i.e., detection
of allergies in EHRs)
• Helping on-line health blogs and forums managers
POSSIBLE EXTENSIONS
References
Aplicación Distant supervision a la extracción de relaciones
Isabel Segura-Bedmar, Paloma Martínez, Ricardo Revert , Julián Moreno-
Schneider, (2015). Exploring Spanish Health Social Media for detecting drug
effects, BMC Medical Informatics and Decision Systems, January, 2015, Volumen:
In press.
Demos
Santiago Peña-González, Isabel Segura-Bedmar, Paloma Martínez, José Luis
Martínez Fernández,(2014). ADRSpanishTool: a tool for extracting adverse drug
reactions and indications, September, 2014, Procesamiento del Lenguaje Natural,
Volumen: 53, Páginas: 177-180, url.
Corpus para entrenamiento y test
María Herrero Zazo, Isabel Segura-Bedmar, Paloma Martínez, Thierry
Declerck, (2013). The DDI corpus: An annotated corpus with pharmacological
substances and drug-drug interactions, Journal of Biomedical Informatics (IF 2012:
2.131), October, 2013, Volumen: 46, Número: 5 DOI: 10.1016/j.jbi.2013.07.011,
Páginas: 914-920, url.
References
Corpus ADRs y enfoques basados en diccionarios
Isabel Segura-Bedmar, Ricardo Revert , Paloma Martínez, (2014). Detecting drugs
and adverse events from Spanish social media streams, Gothenburg, Sweden,
April, 2014, Proceedings of the 5th International Workshop on Health Text Mining
and Information Analysis (Louhi), Association for Computational Linguistics,
Páginas: 106-115, pdf.
Isabel Segura-Bedmar, Santiago Peña-González, Paloma
Martínez, (2014). Extracting drug indications and adverse drug reactions from
Spanish health social media, Proceedings of the BioNLP 2014 workshop, USA,
June, 2014, Association for Computational Linguistics, Páginas: 98-106, pdf.
Tarea Semeval DDIExtraction 2013 (http://www.cs.york.ac.uk/semeval-
2013/task9/)
Isabel Segura-Bedmar, Paloma Martínez, María Herrero Zazo, (2014). Lessons
learnt from the DDIExtraction-2013 shared task, January, 2014, Journal of
Biomedical Informatics (IF 2012: 2.131), Elsevier, ISSN: 1532-0464, Volumen: 51,
Páginas: 152-164, url.
PRESENTACIÓN DEL GRUPO
Contacto: Paloma Martínez
E-mail: pmf@inf.uc3m.es
@Grupo_LaBDA
labda.inf.uc3m.es

Contenu connexe

En vedette

ICT Media Platform versie sept 2016
ICT  Media Platform  versie sept 2016ICT  Media Platform  versie sept 2016
ICT Media Platform versie sept 2016Aschwin Geerts
 
Guide to Reason
Guide to ReasonGuide to Reason
Guide to Reasondmclewil
 
Promotion reebok revised worksheet 3 no yellow
Promotion reebok  revised worksheet 3 no yellowPromotion reebok  revised worksheet 3 no yellow
Promotion reebok revised worksheet 3 no yellowtarunkuchipudi
 
Yoni reyes avila
Yoni reyes avilaYoni reyes avila
Yoni reyes avilaFili MH
 

En vedette (7)

ICT Media Platform versie sept 2016
ICT  Media Platform  versie sept 2016ICT  Media Platform  versie sept 2016
ICT Media Platform versie sept 2016
 
shutters Australia
shutters Australiashutters Australia
shutters Australia
 
Guide to Reason
Guide to ReasonGuide to Reason
Guide to Reason
 
Perceptual Study SRG
Perceptual Study SRGPerceptual Study SRG
Perceptual Study SRG
 
CPC Newsletter 121514v2
CPC Newsletter 121514v2CPC Newsletter 121514v2
CPC Newsletter 121514v2
 
Promotion reebok revised worksheet 3 no yellow
Promotion reebok  revised worksheet 3 no yellowPromotion reebok  revised worksheet 3 no yellow
Promotion reebok revised worksheet 3 no yellow
 
Yoni reyes avila
Yoni reyes avilaYoni reyes avila
Yoni reyes avila
 

Similaire à Presentation "Spanish Resources in Trendminer Project"

Semantic Technologies for Healthcare
Semantic Technologies for HealthcareSemantic Technologies for Healthcare
Semantic Technologies for HealthcareSngular Meaning
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...Kees van Bochove
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseaseseSAT Publishing House
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseaseseSAT Journals
 
Drug Pill Recognition System Using Deep Learning
Drug Pill Recognition System Using Deep LearningDrug Pill Recognition System Using Deep Learning
Drug Pill Recognition System Using Deep LearningIRJET Journal
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discoveryRAVINDRABABUKOPPERA
 
New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022David Talby
 
Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Athula Herath
 
Knowledge managementneedsinprescriptionmedicationprocess
Knowledge managementneedsinprescriptionmedicationprocessKnowledge managementneedsinprescriptionmedicationprocess
Knowledge managementneedsinprescriptionmedicationprocessNooshin Allahyari
 
Embi cri review-2012-final
Embi cri review-2012-finalEmbi cri review-2012-final
Embi cri review-2012-finalPeter Embi
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)ijceronline
 
Research trends in different pharmaceutical areas.docx
Research trends in different pharmaceutical areas.docxResearch trends in different pharmaceutical areas.docx
Research trends in different pharmaceutical areas.docxImtiajChowdhuryEham
 
Unlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningUnlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningRNayak3
 
Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Fondazione Giannino Bassetti
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Ankur Khanna
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
Quantitative methods of signal detection on spontaneous reporting system data...
Quantitative methods of signal detection on spontaneous reporting system data...Quantitative methods of signal detection on spontaneous reporting system data...
Quantitative methods of signal detection on spontaneous reporting system data...Francois MAIGNEN
 
Explore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdfExplore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdfNeomatician
 

Similaire à Presentation "Spanish Resources in Trendminer Project" (20)

Semantic Technologies for Healthcare
Semantic Technologies for HealthcareSemantic Technologies for Healthcare
Semantic Technologies for Healthcare
 
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseases
 
Nlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseasesNlp based retrieval of medical information for diagnosis of human diseases
Nlp based retrieval of medical information for diagnosis of human diseases
 
Drug Pill Recognition System Using Deep Learning
Drug Pill Recognition System Using Deep LearningDrug Pill Recognition System Using Deep Learning
Drug Pill Recognition System Using Deep Learning
 
Artificial intelligence in drug discovery
Artificial intelligence in drug discoveryArtificial intelligence in drug discovery
Artificial intelligence in drug discovery
 
New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022
 
Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...Leverage machine learning and new technologies to enhance rwe generation and ...
Leverage machine learning and new technologies to enhance rwe generation and ...
 
Knowledge managementneedsinprescriptionmedicationprocess
Knowledge managementneedsinprescriptionmedicationprocessKnowledge managementneedsinprescriptionmedicationprocess
Knowledge managementneedsinprescriptionmedicationprocess
 
Embi cri review-2012-final
Embi cri review-2012-finalEmbi cri review-2012-final
Embi cri review-2012-final
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
 
Research trends in different pharmaceutical areas.docx
Research trends in different pharmaceutical areas.docxResearch trends in different pharmaceutical areas.docx
Research trends in different pharmaceutical areas.docx
 
Unlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media ListeningUnlocking Hidden Insights for Pharma with Social Media Listening
Unlocking Hidden Insights for Pharma with Social Media Listening
 
Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...
 
Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma Data Mining and Big Data Analytics in Pharma
Data Mining and Big Data Analytics in Pharma
 
Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Definitva ehr
Definitva ehrDefinitva ehr
Definitva ehr
 
Quantitative methods of signal detection on spontaneous reporting system data...
Quantitative methods of signal detection on spontaneous reporting system data...Quantitative methods of signal detection on spontaneous reporting system data...
Quantitative methods of signal detection on spontaneous reporting system data...
 
Explore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdfExplore new Frontiers in Medicine with AI (2).pdf
Explore new Frontiers in Medicine with AI (2).pdf
 

Plus de Grupo HULAT

Interaccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoInteraccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoGrupo HULAT
 
Low vision interaccion2018v4
Low vision interaccion2018v4Low vision interaccion2018v4
Low vision interaccion2018v4Grupo HULAT
 
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Grupo HULAT
 
Babelfy: Entity Linking meets Word Sense Disambiguation.
Babelfy: Entity Linking meets Word Sense Disambiguation.Babelfy: Entity Linking meets Word Sense Disambiguation.
Babelfy: Entity Linking meets Word Sense Disambiguation.Grupo HULAT
 
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Grupo HULAT
 
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...Grupo HULAT
 
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Grupo HULAT
 
Extraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsExtraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsGrupo HULAT
 
Lessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskLessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskGrupo HULAT
 
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...Grupo HULAT
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Grupo HULAT
 
Accessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleAccessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleGrupo HULAT
 
Toward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesToward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesGrupo HULAT
 
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Grupo HULAT
 
Formación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadFormación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadGrupo HULAT
 
Requisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaRequisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaGrupo HULAT
 
Integrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachIntegrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachGrupo HULAT
 
A MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityA MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityGrupo HULAT
 
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Grupo HULAT
 
An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent Grupo HULAT
 

Plus de Grupo HULAT (20)

Interaccion 2019 lourdes moreno
Interaccion 2019 lourdes morenoInteraccion 2019 lourdes moreno
Interaccion 2019 lourdes moreno
 
Low vision interaccion2018v4
Low vision interaccion2018v4Low vision interaccion2018v4
Low vision interaccion2018v4
 
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
Exploring language technologies to provide support to WCAG 2.0 and E2R guidel...
 
Babelfy: Entity Linking meets Word Sense Disambiguation.
Babelfy: Entity Linking meets Word Sense Disambiguation.Babelfy: Entity Linking meets Word Sense Disambiguation.
Babelfy: Entity Linking meets Word Sense Disambiguation.
 
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...Integration of Accessibility Requirements in the Design of Multimedia User Ag...
Integration of Accessibility Requirements in the Design of Multimedia User Ag...
 
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...New Approaches to Interactive Multimedia Content Retrieval from different Sou...
New Approaches to Interactive Multimedia Content Retrieval from different Sou...
 
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
Mujeres, ciencia y tecnología. Encuesta sobre la percepción de las dificultad...
 
Extraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical TextsExtraction of Drug-Drug Interactions from Biomedical Texts
Extraction of Drug-Drug Interactions from Biomedical Texts
 
Lessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction TaskLessons from the Drug-Drug Interaction Extraction Task
Lessons from the Drug-Drug Interaction Extraction Task
 
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
BioSEPLN 2010 Workshop on Language Technology applied to biomedical and heal...
 
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
Building a Graph of Names and Contextual Patterns for Named Entity Classifica...
 
Accessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older peopleAccessibility to mobile interfaces for older people
Accessibility to mobile interfaces for older people
 
Toward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processesToward an integration of Web accessibility into testing processes
Toward an integration of Web accessibility into testing processes
 
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
Revisión de los requisitos de accesibilidad en la interacción del usuario anc...
 
Formación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la UniversidadFormación y tecnologías en accesibilidad para la Universidad
Formación y tecnologías en accesibilidad para la Universidad
 
Requisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimediaRequisitos de accesibilidad web en los reproductores multimedia
Requisitos de accesibilidad web en los reproductores multimedia
 
Integrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approachIntegrating HCI in a Web accessibility engineering approach
Integrating HCI in a Web accessibility engineering approach
 
A MDD approach for modelling web accessibility
A MDD approach for modelling web accessibilityA MDD approach for modelling web accessibility
A MDD approach for modelling web accessibility
 
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
Inclusive Usability Techniques in Requirements Analysis of Accessible Web App...
 
An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent An approach to User Interface Design of an accessible user agent
An approach to User Interface Design of an accessible user agent
 

Dernier

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 

Dernier (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 

Presentation "Spanish Resources in Trendminer Project"

  • 1. Spanish resources of TrendMiner Project Course on Advanced Topics Combining Language and Web Technologies, UNED, January 22, 2015 Paloma Martínez Advanced Databases Group labda.inf.uc3m.es Universidad Carlos III de Madrid
  • 2. CONTENTS 1. Challenges in automatic semantic analysis of health information 2. Objective 3. Resources 4. Linguistic Processor 5. Real-time prototype 6. Annotation pipeline evaluation 7. Other methods to extract drug-effect relations 8. Possible extensions
  • 3. No Estructurados Estructurados • Extracción y Recuperación de información en el dominio biomédico en distintos medios (publicaciones científicas, redes sociales, notas clínicas) • ¿Cuántos datos estructurados se procesan de la Historia Clínica Electrónica? ¿Y con los no estructurados, qué se hace? • Aplicaciones: 1. Soporte a la codificación ICD9/10, SNOMED CT, ….(p.e. diagnósticos en partes de alta en urgencias) 2. Filtrado de grandes volúmenes de información 3. Extracción de información para alimentar BD (p.e. extracción de interacciones entre fármacos a partir de literatura médica) 4. Monitorización de eventos médicos en distintos medios RETOS EN EL ANÁLISIS SEMÁNTICO AUTOMÁTICO DE INFORMACIÓN DE SALUD
  • 4. 1. Español 2. ≠ tipos de lenguaje (orientado a pacientes, científico, clínico) 3. Fenómenos propios (abreviaturas, gran ambigüedad, …)
  • 5.
  • 6. To detect drugs and medical events mentions (drugs, diseases, symptoms, Adverse effects, ….) from social media. Social medial sources can be valuable sources monitoring medical events Different applications: - Pharmacovigilance tasks performed in medicines agencies and pharma companies. - Filtering, classification and monitoring of health- related social networks (blogs, forums, ….) - Information Extraction tasks OBJECTIVE
  • 7. RESOURCES Patients on Twitter Spanish patient Forums (1) Analyzed and monitored sources
  • 8. Example of post in Forumclinic (1) Analyzed and monitored sources RESOURCES
  • 9. Anatomical Therapeutic Chemical (ATC) Classification System Ibuprofeno algiasdin|apirofeno|aragel|articalm|ast efor|brufen|dalsy|dersindol|diltix|dole ncar|doltra|espididol|espidifen|…. nauseas estomago revuelto|sentirse mareado|nauseas|nauseas solas|nauseoso|nauseoso|ansia nauseosa|…… (2) Integrated existing semantic domain resources 35.259 terms 16.418 drugs and 2.228 active substances 2.566 ATC codes 42.548 main diseases Cáncer|neoplasia maligna|…. RESOURCES
  • 10. Level 1: Anatomical main group (1 letter) Level 2: Therapeutic main group (2 digits) Level 3: Pharmacological main group (1 letter) Level 4: Chemical main group (1 letter) Level 5: Active substance main group (2 digits) ATC is a system of alphanumeric codes developed by the WHO for the classification of drugs and other medical products ATC Structure (2) Integrated existing semantic domain resources RESOURCES
  • 11. (2) Integrated existing semantic domain resources Example of ATC Structure RESOURCES
  • 12. (3) Integrated new semantic resources Spanish DrugEffect DB containing relations among drugs and effects 63.000 relations RESOURCES
  • 13. How has the Spanish DrugEffect DB been built? (3) Integrated new semantic resources RESOURCES
  • 16.  GATE Annotation Pipeline: Plugging developed for Textalytics  Datawarehouse: • Implemented on elasticsearch with an ATC-based index structure: January 19, 2015 , 2,401,613 Tweets and 41,985 Blog Posts annotated an indexed • Amazon infrastructure for processing and storing  Searching: • Different search modes depending of ATC level or exact matching. • Obtaining and distinguishing indications, adverse effects and possible relations among drugs and effects. • Searching for co-occurrences (drug-disease, drug-drug, ….). Also machine learning as distant supervision approach (not in prototype)  Visualization: • Timeline to view evolution of mentions with different granularity. • Viewing the annotated source text (tweets and posts) REAL-TIME PROTOTYPE
  • 17. Aggregated data about effects related to drug lorazepam REAL-TIME PROTOTYPE
  • 18. Lorazepam ATC tree cross-lingual REAL-TIME PROTOTYPE
  • 19. Drug-drug and drug-diseases co-ocurrences for lorazepam active substance REAL-TIME PROTOTYPE
  • 20. Timeline with evolution of lorazepam related drugs REAL-TIME PROTOTYPE
  • 21. Drugs R P F-Measure strict 0,68 0,85 0,76 lenient 0,68 0,85 0,76 Drugs • Using SpanishADR corpus (400 annotated comments from Forumclínic) • To enhance recognizing of misspelled drugs • Include abbreviations for drug families • Solve several ambiguities (alcohol, oxygen) ANNOTATION PIPELINE EVALUATION
  • 22. Effects R P F-Measure strict 0,43 0,75 0,54 lenient 0,47 0,83 0,6 Effects • Low performance: • because colloquial expressions to describe an effect: me deja ko (it makes me KO) or me cuesta más levantarme (it’s harder for me to wake up). • different lexical variations and abbreviations of the same effect. ANNOTATION PIPELINE EVALUATION
  • 23. Drug-effect relations Window size R P F-Measure R P F-Measure strict 0,08 0,57 0,14 0,63 0,44 0,52 lenient 0,13 0,96 0,24 0,88 0,61 0,72 strict 0,1 0,34 0,16 0,74 0,26 0,38 lenient 0,23 0,74 0,35 0,99 0,34 0,51 strict 0,12 0,32 0,17 0,17 0,75 0,33 lenient 0,24 0,67 0,36 1 0,29 0,45 SpanishDrugEffectDB Coocurrences 30 100 250 • Performed from annotated drugs and effects • Low recall using SpanishDrugEffect DB because low coverage of effects, the lack of co-reference resolution and size of corpus (only 164 relations) • Machine Learning tryed: distant supervision approaches ANNOTATION PIPELINE EVALUATION
  • 24. 1. Distant-supervision method using the database on a collection of 84,000 messages in order to extract the relations between drugs and their effects (instances of DB are positive examples) OTHER METHODS TO EXTRACT DRUG-EFFECT RELATIONS
  • 25. 2. To classify the relation instances, we used a kernel method based only on shallow linguistic information of the sentences. 3. Regarding Relation Extraction of drugs and their effects, the distant supervision approach achieved a recall of 0.59 and a precision of 0.48 OTHER METHODS TO EXTRACT DRUG-EFFECT RELATIONS
  • 26.  Development of cross-lingual approach for ATC codes integrating ATC ontology in collaboration with DFKI  Populate Health TM ontologies from semantic resources in collaboration with DFKI  Many possibilities of customization: • Tracking specific drugs appearing with different diseases or effects • Information Extraction from unstructured data (i.e., detection of allergies in EHRs) • Helping on-line health blogs and forums managers POSSIBLE EXTENSIONS
  • 27. References Aplicación Distant supervision a la extracción de relaciones Isabel Segura-Bedmar, Paloma Martínez, Ricardo Revert , Julián Moreno- Schneider, (2015). Exploring Spanish Health Social Media for detecting drug effects, BMC Medical Informatics and Decision Systems, January, 2015, Volumen: In press. Demos Santiago Peña-González, Isabel Segura-Bedmar, Paloma Martínez, José Luis Martínez Fernández,(2014). ADRSpanishTool: a tool for extracting adverse drug reactions and indications, September, 2014, Procesamiento del Lenguaje Natural, Volumen: 53, Páginas: 177-180, url. Corpus para entrenamiento y test María Herrero Zazo, Isabel Segura-Bedmar, Paloma Martínez, Thierry Declerck, (2013). The DDI corpus: An annotated corpus with pharmacological substances and drug-drug interactions, Journal of Biomedical Informatics (IF 2012: 2.131), October, 2013, Volumen: 46, Número: 5 DOI: 10.1016/j.jbi.2013.07.011, Páginas: 914-920, url.
  • 28. References Corpus ADRs y enfoques basados en diccionarios Isabel Segura-Bedmar, Ricardo Revert , Paloma Martínez, (2014). Detecting drugs and adverse events from Spanish social media streams, Gothenburg, Sweden, April, 2014, Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi), Association for Computational Linguistics, Páginas: 106-115, pdf. Isabel Segura-Bedmar, Santiago Peña-González, Paloma Martínez, (2014). Extracting drug indications and adverse drug reactions from Spanish health social media, Proceedings of the BioNLP 2014 workshop, USA, June, 2014, Association for Computational Linguistics, Páginas: 98-106, pdf. Tarea Semeval DDIExtraction 2013 (http://www.cs.york.ac.uk/semeval- 2013/task9/) Isabel Segura-Bedmar, Paloma Martínez, María Herrero Zazo, (2014). Lessons learnt from the DDIExtraction-2013 shared task, January, 2014, Journal of Biomedical Informatics (IF 2012: 2.131), Elsevier, ISSN: 1532-0464, Volumen: 51, Páginas: 152-164, url.
  • 29. PRESENTACIÓN DEL GRUPO Contacto: Paloma Martínez E-mail: pmf@inf.uc3m.es @Grupo_LaBDA labda.inf.uc3m.es

Notes de l'éditeur

  1. We have created a drug dictionary and a side effect dictionary which will be integrated in the Textalytics tool (developed by DAEDALUS). where each concept is represented by a generic drug. It also provides a list of all possible brand drugs for this generic drug. These brand drugs will be considered as synonyms. For now, we have focused on CIMA and MedDRA resources. CIMA is a comprehensive resource to create a dictionary to recognize generic and drugs. and MedDRA also provides a huge number of side effects in Spanish. Also, we have considered the ATC classification which allow to classify drugs in groups. N → Sistema Nervioso. - Grupo Anatómico principal. 05 → Psicolépticos. - Grupo Terapéutico principal. B → Ansiolíticos. - Subgrupo Terapéutico Farmacológico. A → Derivados benzodiacepínicos. - Subgrupo Químico-Terapéutico Farmacológico 01 → Diazepam - Sustancia final. En la bbdd ahora mismo sólo tenemos lo procedente de MedlinePlus y de Prospectos.org para anotar las relaciones como indicaciones o como efectos adversos en el método de relaciones (no se usa para anotar fármacos). La anotación de fármacos va en diccionarios o directamente en gazetteers. Esos 2228 de CIMA corresponden a las entradas del diccionario de trendminerplus que son fármacos. El número total de fármacos sumando diccionario de CIMA + gazetteers de MedlinePlus son las que pone la presentación: 914 medlineplus (habíamos filtrado las que ya estaban en los otros) + 1236 vademecum + 465 ATC + 2228 CIMA que dan un total de  4843.La bbdd en la que anotamos las relaciones tiene 3244 drugs y 63520 relaciones Drug-Effects que tenemos en la bbdd (Effects son 16940)
  2. We have created a drug dictionary and a side effect dictionary which will be integrated in the Textalytics tool (developed by DAEDALUS). where each concept is represented by a generic drug. It also provides a list of all possible brand drugs for this generic drug. These brand drugs will be considered as synonyms. For now, we have focused on CIMA and MedDRA resources. CIMA is a comprehensive resource to create a dictionary to recognize generic and drugs. and MedDRA also provides a huge number of side effects in Spanish. Also, we have considered the ATC classification which allow to classify drugs in groups. N → Sistema Nervioso. - Grupo Anatómico principal. 05 → Psicolépticos. - Grupo Terapéutico principal. B → Ansiolíticos. - Subgrupo Terapéutico Farmacológico. A → Derivados benzodiacepínicos. - Subgrupo Químico-Terapéutico Farmacológico 01 → Diazepam - Sustancia final. En la bbdd ahora mismo sólo tenemos lo procedente de MedlinePlus y de Prospectos.org para anotar las relaciones como indicaciones o como efectos adversos en el método de relaciones (no se usa para anotar fármacos). La anotación de fármacos va en diccionarios o directamente en gazetteers. Esos 2228 de CIMA corresponden a las entradas del diccionario de trendminerplus que son fármacos. El número total de fármacos sumando diccionario de CIMA + gazetteers de MedlinePlus son las que pone la presentación: 914 medlineplus (habíamos filtrado las que ya estaban en los otros) + 1236 vademecum + 465 ATC + 2228 CIMA que dan un total de  4843.La bbdd en la que anotamos las relaciones tiene 3244 drugs y 63520 relaciones Drug-Effects que tenemos en la bbdd (Effects son 16940)