SlideShare une entreprise Scribd logo
1  sur  28
Automated Extraction of Domain-specific Clinical Ontologies Segmenting, merging, and surveying modules Chimezie Ogbujicut@case.edu
Need for Ontology Bootstrapping There is a critical need for formal, reproducible methods for recognizing and filling gaps in medical terminologies (Cimino 1998) Clinical terminology systems need to extend smoothly and quickly in response to the needs of users (Rector 1999) A fixed, enumerated list of concepts can never be complete and results in a combinatorial explosion of terms (exhaustive pre-coordination)
A general best practice is to re-use ontologies, especially those that have been standardized However, there is a proliferation of (domain-specific) clinical ontologies Flies in the face of this best practice As more projects leverage the full value of reference, medical ontologies, there will be an increased need for automated management: Not there yet, mostly have coding systems
The Goal Want to (automatically) Customize a large source ontology such as SNOMED-CT in a tractable way Generate normalized, anatomy and clinical terminology modules that are manageable in size, and preserve the meaning of common terms Provide a framework for bootstrapping the creation of clinical terminology for a specific domain
Prior Work Noy and Musen (2000) Discuss how to either automate the merging and alignment or guide the user, suggesting conflicts and actions to take Rely on lexical matching of term names Bontas and Tolksdorf (2005) Similar goal as Noy & Musen User provides a list of term matches between source & target Follow semantic connections from these terms
Modularization:Ontology Engineering Seidenberg and Rector (2006) describe an ontology segmentation heuristic that starts with a set of terms and creates an extract from an ontology around those terms Traverses ontology structure and is limited by user-specified recursion depth
Seidenberg and Rector (2006)
Grau et al. (2008): Developing ontology P and want to re-use a set of symbols from (another) ontology Q without changing their meaning P + Q is a conservative extension of Q When answering a query involving terms in O (its signature or vocabulary), importing O'1  should give the same answers as if O' had been imported instead (both are subsets but O'1 is more manageable): Then we say O'1 is a module for O in O'
Segments v.s. Modules  The segmentation heuristic used is in contrast to (and predates) those of Grau et al. (2008) that produce modules with 100% semantic fidelity Sacrifice semantic fidelity for an expedient extraction process The (tractable) calculation of deductive, conservative extensions for EL is an open research problem
Materials SNOMED-CT Foundational Model of Anatomy (FMA) Common anatomy signature
Reference Clinical Ontologies There is a reasonable consensus around two reference ontologies that cover a substantial portion of clinical medicine SNOMED-CT and the FMA Both leverage an underlying formal knowledge representation
SNOMED-CT A comprehensive terminological framework for clinical documentation and reporting. Comprised of about half a million concepts: Clinical findings, procedures, body structures, organisms, substances, pharmaceutical products, specimen, quantitative measures, and clinical situations Has an underlying description logic (EL family) EL family has shown to be suitable for medical terminology And subsequently, ELHR+, the performance target of many modern classifiers
Technical challenges: Its size discourages the use of logical inference systems to manage and process it (due to performance issues) Most description logic systems run into challenges with memory exhaustion when classifying it in its entirety (there have been recent advances here) In some cases, its definitions are inconsistent or incomplete (more on this later) Policy pressures (opportunity): Participants in meaningful use program must capture EHR problem lists based on ICD-9 or SNOMED-CT
Using Modulzarization for Quality Assurance Plenty of (recent) work on quality assurance of SNOMED-CT Using Semantic Web technologies (and lattice theory) for quality assurance of large biomedical ontologies (Zhang et al. 2010) Identifying incorrect or clinically misleading SNOMED-CT inferences that arose from use of SNOMED-CT(Rector et al. 2011) More, recent QA of SNOMED-CT (Rector 2011) leverages extraction of  manageable modules and discusses the value to domain experts of browsing SNOMED-CT via a module built from a set of terms relevant to a domain or application
Foundational Model of Anatomy Goal is to conceptualize the physical objects and spaces that constitute the human body Leverages a frame-based knowledge representation to formulate over 75,000 concepts including: Macroscopic, microscopic, and sub-cellular canonical anatomy Anatomy is fundamental to biomedical domains
Concepts are connected by several mereological relations Primarily concerned with part_of and has_part Adheres to a strict, aristotelian modeling paradigm Ensures definitions are consistent and state the essence of anatomy in terms of their characteristics Using July 24th 2008 ALPHA version of the FMA 2.0 in OBO foundry
Common Anatomy Signature There is a significant overlap between anatomy terms in SNOMED-CT and FMA Bodenreider and Zhang (2006) analyzed this overlap Leveraged lexical and structural analysis Identified ~ 7500 common concepts Refer to as Sanatomy
Small Detail: SEP Triplets SNOMED-CT uses SEP triplets to model anatomy concepts and their relationships to each other For every proper SNOMED-CT anatomy concept (an Entire class), there are two auxiliary classes: A Structure class A Part class
Example
Main motivation is to rely on subsumption to reason about part-whole relationships SNOMED-CT is moving away from this, but for the purpose of using it in concert with the FMA, this is still an issue Previous work (Suntisrivaraporn 2007) demonstrated how an expressive description logic can be used to  more directly represent mereological relations.
Build on this but re-use terms (a transliteration) from a reference ontology of anatomy rather than re-using SNOMED-CT terms To preserve the meaning of anatomy terms but increase the (latent) knowledge about them and provide a terminology path to additional terms of interest
Reifying SEP triplets Need to replace SNOMED-CT anatomy terms in a way that preserves the intent of the SEP anatomy scheme Transcribe them into a more expressive description logic Define a set of rules to determine how axioms involving mapped SNOMED-CT terms are replaced (Shultz et al. 1998) describe how to logically identify components of an SEP triplet
Method Start with a list of user-specified SNOMED-CT concepts  Determines the domain 3 step process resulting in A SNOMED-CT module: O'snct-fma Transliteration of SEP triplets FMA segment: O'fma-snct Directly merge results into a single ontology
Segmenting and Merging Domain-specific Ontology Modules for Clinical Informatics (Ogbuji 2010)
Collecting the domain of discourse (Sahoo et al. 2011) Automatically extract a minimal common set of terms (upper-domain ontology) from an existing domain ontology Can be used to survey the generation of anatomy and clinical terminology modules: “For a given domain, what are the most general categories of (clinical) terminology that can be automatically extracted from specific distributions of SNOMED-CT and the FMA?”
Demonstration Implementation (Python) http://code.google.com/p/python-dlp/wiki/ClinicalOntologyModules Example: Atrial Fibrillation (disorder)

Contenu connexe

En vedette

Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...Chimezie Ogbuji
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereChimezie Ogbuji
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLChimezie Ogbuji
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchChimezie Ogbuji
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsChimezie Ogbuji
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic WebChimezie Ogbuji
 
Semantic Web Technologies: A Paradigm for Medical Informatics
Semantic Web Technologies: A Paradigm for Medical InformaticsSemantic Web Technologies: A Paradigm for Medical Informatics
Semantic Web Technologies: A Paradigm for Medical InformaticsChimezie Ogbuji
 
Using OWL for the RESO Data Dictionary
Using OWL for the RESO Data DictionaryUsing OWL for the RESO Data Dictionary
Using OWL for the RESO Data DictionaryChimezie Ogbuji
 

En vedette (8)

Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
Integrating Large, Disparate, Biomedical Ontologies to Boost Organ Developmen...
 
GRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and WhereGRDDL: The Why, What, How, and Where
GRDDL: The Why, What, How, and Where
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 
Semantic Web use cases in outcomes research
Semantic Web use cases in outcomes researchSemantic Web use cases in outcomes research
Semantic Web use cases in outcomes research
 
Semantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical InformaticsSemantic Web Technologies as a Framework for Clinical Informatics
Semantic Web Technologies as a Framework for Clinical Informatics
 
UniProt and the Semantic Web
UniProt and the Semantic WebUniProt and the Semantic Web
UniProt and the Semantic Web
 
Semantic Web Technologies: A Paradigm for Medical Informatics
Semantic Web Technologies: A Paradigm for Medical InformaticsSemantic Web Technologies: A Paradigm for Medical Informatics
Semantic Web Technologies: A Paradigm for Medical Informatics
 
Using OWL for the RESO Data Dictionary
Using OWL for the RESO Data DictionaryUsing OWL for the RESO Data Dictionary
Using OWL for the RESO Data Dictionary
 

Similaire à Automated clinicalontologyextraction

Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6Zhe (Henry) He
 
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical OntologiesBMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical OntologiesMaulik Kamdar
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...IAEME Publication
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...IAEME Publication
 
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...Health Informatics New Zealand
 
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 Evaluating Semantic Similarity between Biomedical Concepts/Classes through S... Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...Editor IJCATR
 
Medinfo 2010 openEHR Clinical Modelling Worshop
Medinfo 2010 openEHR Clinical Modelling WorshopMedinfo 2010 openEHR Clinical Modelling Worshop
Medinfo 2010 openEHR Clinical Modelling WorshopKoray Atalag
 
A Rule-Based Approach For Semantic Annotation Evolution
A Rule-Based Approach For Semantic Annotation EvolutionA Rule-Based Approach For Semantic Annotation Evolution
A Rule-Based Approach For Semantic Annotation EvolutionMaria Perkins
 
An Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic ProgrammingAn Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic ProgrammingJennifer Roman
 
Designing of an efficient algorithm for identifying Abbreviation definitions ...
Designing of an efficient algorithm for identifying Abbreviation definitions ...Designing of an efficient algorithm for identifying Abbreviation definitions ...
Designing of an efficient algorithm for identifying Abbreviation definitions ...ijcsit
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS cscpconf
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLSSBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLScsandit
 
Chapter 18 advanced terminology systems
Chapter 18  advanced terminology systems Chapter 18  advanced terminology systems
Chapter 18 advanced terminology systems Minette Din
 
Accelerated life testing
Accelerated life testingAccelerated life testing
Accelerated life testingSteven Li
 
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...drboon
 
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...ertekg
 
School of General Education Assignment Rubrics SC.docx
School of General Education  Assignment Rubrics SC.docxSchool of General Education  Assignment Rubrics SC.docx
School of General Education Assignment Rubrics SC.docxanhlodge
 

Similaire à Automated clinicalontologyextraction (20)

Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6Zhe_2014JointSummits_v6
Zhe_2014JointSummits_v6
 
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical OntologiesBMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
BMI 201 - Investigating Term Reuse and Overlap in Biomedical Ontologies
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
 
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...
From openEHR Domain Models to Advanced User Interfaces: a Case Study in Endos...
 
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 Evaluating Semantic Similarity between Biomedical Concepts/Classes through S... Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
Evaluating Semantic Similarity between Biomedical Concepts/Classes through S...
 
Medinfo 2010 openEHR Clinical Modelling Worshop
Medinfo 2010 openEHR Clinical Modelling WorshopMedinfo 2010 openEHR Clinical Modelling Worshop
Medinfo 2010 openEHR Clinical Modelling Worshop
 
A Rule-Based Approach For Semantic Annotation Evolution
A Rule-Based Approach For Semantic Annotation EvolutionA Rule-Based Approach For Semantic Annotation Evolution
A Rule-Based Approach For Semantic Annotation Evolution
 
An Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic ProgrammingAn Essay Concerning Human Understanding Of Genetic Programming
An Essay Concerning Human Understanding Of Genetic Programming
 
Designing of an efficient algorithm for identifying Abbreviation definitions ...
Designing of an efficient algorithm for identifying Abbreviation definitions ...Designing of an efficient algorithm for identifying Abbreviation definitions ...
Designing of an efficient algorithm for identifying Abbreviation definitions ...
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
 
1 (2)
1 (2)1 (2)
1 (2)
 
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLSSBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
SBML FOR OPTIMIZING DECISION SUPPORT'S TOOLS
 
Chapter 18 advanced terminology systems
Chapter 18  advanced terminology systems Chapter 18  advanced terminology systems
Chapter 18 advanced terminology systems
 
الواجججج
الواججججالواجججج
الواجججج
 
Accelerated life testing
Accelerated life testingAccelerated life testing
Accelerated life testing
 
Prosdocimi ucb cdao
Prosdocimi ucb cdaoProsdocimi ucb cdao
Prosdocimi ucb cdao
 
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...
An Application of Genetic Algorithm for Non-restricted Space and Pre-determin...
 
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
Encapsulating And Representing The Knowledge On The Evolution Of An Engineeri...
 
School of General Education Assignment Rubrics SC.docx
School of General Education  Assignment Rubrics SC.docxSchool of General Education  Assignment Rubrics SC.docx
School of General Education Assignment Rubrics SC.docx
 

Dernier

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxRustici Software
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 

Dernier (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 

Automated clinicalontologyextraction

  • 1. Automated Extraction of Domain-specific Clinical Ontologies Segmenting, merging, and surveying modules Chimezie Ogbujicut@case.edu
  • 2. Need for Ontology Bootstrapping There is a critical need for formal, reproducible methods for recognizing and filling gaps in medical terminologies (Cimino 1998) Clinical terminology systems need to extend smoothly and quickly in response to the needs of users (Rector 1999) A fixed, enumerated list of concepts can never be complete and results in a combinatorial explosion of terms (exhaustive pre-coordination)
  • 3. A general best practice is to re-use ontologies, especially those that have been standardized However, there is a proliferation of (domain-specific) clinical ontologies Flies in the face of this best practice As more projects leverage the full value of reference, medical ontologies, there will be an increased need for automated management: Not there yet, mostly have coding systems
  • 4. The Goal Want to (automatically) Customize a large source ontology such as SNOMED-CT in a tractable way Generate normalized, anatomy and clinical terminology modules that are manageable in size, and preserve the meaning of common terms Provide a framework for bootstrapping the creation of clinical terminology for a specific domain
  • 5. Prior Work Noy and Musen (2000) Discuss how to either automate the merging and alignment or guide the user, suggesting conflicts and actions to take Rely on lexical matching of term names Bontas and Tolksdorf (2005) Similar goal as Noy & Musen User provides a list of term matches between source & target Follow semantic connections from these terms
  • 6. Modularization:Ontology Engineering Seidenberg and Rector (2006) describe an ontology segmentation heuristic that starts with a set of terms and creates an extract from an ontology around those terms Traverses ontology structure and is limited by user-specified recursion depth
  • 8. Grau et al. (2008): Developing ontology P and want to re-use a set of symbols from (another) ontology Q without changing their meaning P + Q is a conservative extension of Q When answering a query involving terms in O (its signature or vocabulary), importing O'1 should give the same answers as if O' had been imported instead (both are subsets but O'1 is more manageable): Then we say O'1 is a module for O in O'
  • 9. Segments v.s. Modules The segmentation heuristic used is in contrast to (and predates) those of Grau et al. (2008) that produce modules with 100% semantic fidelity Sacrifice semantic fidelity for an expedient extraction process The (tractable) calculation of deductive, conservative extensions for EL is an open research problem
  • 10. Materials SNOMED-CT Foundational Model of Anatomy (FMA) Common anatomy signature
  • 11. Reference Clinical Ontologies There is a reasonable consensus around two reference ontologies that cover a substantial portion of clinical medicine SNOMED-CT and the FMA Both leverage an underlying formal knowledge representation
  • 12. SNOMED-CT A comprehensive terminological framework for clinical documentation and reporting. Comprised of about half a million concepts: Clinical findings, procedures, body structures, organisms, substances, pharmaceutical products, specimen, quantitative measures, and clinical situations Has an underlying description logic (EL family) EL family has shown to be suitable for medical terminology And subsequently, ELHR+, the performance target of many modern classifiers
  • 13. Technical challenges: Its size discourages the use of logical inference systems to manage and process it (due to performance issues) Most description logic systems run into challenges with memory exhaustion when classifying it in its entirety (there have been recent advances here) In some cases, its definitions are inconsistent or incomplete (more on this later) Policy pressures (opportunity): Participants in meaningful use program must capture EHR problem lists based on ICD-9 or SNOMED-CT
  • 14. Using Modulzarization for Quality Assurance Plenty of (recent) work on quality assurance of SNOMED-CT Using Semantic Web technologies (and lattice theory) for quality assurance of large biomedical ontologies (Zhang et al. 2010) Identifying incorrect or clinically misleading SNOMED-CT inferences that arose from use of SNOMED-CT(Rector et al. 2011) More, recent QA of SNOMED-CT (Rector 2011) leverages extraction of manageable modules and discusses the value to domain experts of browsing SNOMED-CT via a module built from a set of terms relevant to a domain or application
  • 15. Foundational Model of Anatomy Goal is to conceptualize the physical objects and spaces that constitute the human body Leverages a frame-based knowledge representation to formulate over 75,000 concepts including: Macroscopic, microscopic, and sub-cellular canonical anatomy Anatomy is fundamental to biomedical domains
  • 16. Concepts are connected by several mereological relations Primarily concerned with part_of and has_part Adheres to a strict, aristotelian modeling paradigm Ensures definitions are consistent and state the essence of anatomy in terms of their characteristics Using July 24th 2008 ALPHA version of the FMA 2.0 in OBO foundry
  • 17. Common Anatomy Signature There is a significant overlap between anatomy terms in SNOMED-CT and FMA Bodenreider and Zhang (2006) analyzed this overlap Leveraged lexical and structural analysis Identified ~ 7500 common concepts Refer to as Sanatomy
  • 18. Small Detail: SEP Triplets SNOMED-CT uses SEP triplets to model anatomy concepts and their relationships to each other For every proper SNOMED-CT anatomy concept (an Entire class), there are two auxiliary classes: A Structure class A Part class
  • 20. Main motivation is to rely on subsumption to reason about part-whole relationships SNOMED-CT is moving away from this, but for the purpose of using it in concert with the FMA, this is still an issue Previous work (Suntisrivaraporn 2007) demonstrated how an expressive description logic can be used to more directly represent mereological relations.
  • 21. Build on this but re-use terms (a transliteration) from a reference ontology of anatomy rather than re-using SNOMED-CT terms To preserve the meaning of anatomy terms but increase the (latent) knowledge about them and provide a terminology path to additional terms of interest
  • 22. Reifying SEP triplets Need to replace SNOMED-CT anatomy terms in a way that preserves the intent of the SEP anatomy scheme Transcribe them into a more expressive description logic Define a set of rules to determine how axioms involving mapped SNOMED-CT terms are replaced (Shultz et al. 1998) describe how to logically identify components of an SEP triplet
  • 23.
  • 24. Method Start with a list of user-specified SNOMED-CT concepts Determines the domain 3 step process resulting in A SNOMED-CT module: O'snct-fma Transliteration of SEP triplets FMA segment: O'fma-snct Directly merge results into a single ontology
  • 25. Segmenting and Merging Domain-specific Ontology Modules for Clinical Informatics (Ogbuji 2010)
  • 26.
  • 27. Collecting the domain of discourse (Sahoo et al. 2011) Automatically extract a minimal common set of terms (upper-domain ontology) from an existing domain ontology Can be used to survey the generation of anatomy and clinical terminology modules: “For a given domain, what are the most general categories of (clinical) terminology that can be automatically extracted from specific distributions of SNOMED-CT and the FMA?”
  • 28. Demonstration Implementation (Python) http://code.google.com/p/python-dlp/wiki/ClinicalOntologyModules Example: Atrial Fibrillation (disorder)