SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
Challenges in Understanding Clinical Notes:
Why NLP Engines Fall Short and Where Background
Knowledge Can Help
Sujan Perera
Ohio Center of Excellence in
Knowledge-enabled
Computing (Kno.e.sis)
Wright State University,
Dayton OH, USA
sujan@knoesis.org
Amit Sheth
Ohio Center of Excellence in
Knowledge-enabled
Computing (Kno.e.sis)
Wright State University,
Dayton OH, USA
amit@knoesis.org
Krishnaprasad Thirunarayan
Ohio Center of Excellence in
Knowledge-enabled
Computing (Kno.e.sis)
Wright State University,
Dayton OH, USA
tkprasad@knoesis.org
Suhas Nair
ezDI, LLC
suhas.nair@ezdi.us
Neil Shah
ezDI, LLC
neil@ezdi.us
ABSTRACT
Understanding of Electronic Medical Records(EMRs) plays
a crucial role in improving healthcare outcomes. However,
the unstructured nature of EMRs poses several technical
challenges for structured information extraction from clini-
cal notes leading to automatic analysis. Natural Language
Processing(NLP) techniques developed to process EMRs are
effective for variety of tasks, they often fail to preserve the
semantics of original information expressed in EMRs, partic-
ularly in complex scenarios. This paper illustrates the com-
plexity of the problems involved and deals with conflicts cre-
ated due to the shortcomings of NLP techniques and demon-
strates where domain specific knowledge bases can come to
rescue in resolving conflicts that can significantly improve
the semantic annotation and structured information extrac-
tion. We discuss various insights gained from our study on
real world dataset.
Categories and Subject Descriptors
I.7.m [Computing Methodologies]: Document and Text
ProcessingMiscellaneous
Keywords
Natural Language Processing, Negation Detection, Knowl-
edge Base
Permission to make digital or hard copies of all or part of this work for personal or
classroom use is granted without fee provided that copies are not made or distributed
for profit or commercial advantage and that copies bear this notice and the full cita-
tion on the first page. Copyrights for components of this work owned by others than
ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-
publish, to post on servers or to redistribute to lists, requires prior specific permission
and/or a fee. Request permissions from Permissions@acm.org.
DARE’13, Nov. 1, 2013, San Francisco, CA, USA.
Copyright 2013 ACM 978-1-4503-2425-0/13/11$15.00
http://dx.doi.org/10.1145/2512410.2512427 ...$15.00.
1. INTRODUCTION
The Patient Protection and Affordable Care Act has re-
cently introduced a number of new regulations and guide-
lines with the goal of improving healthcare outcomes by
2015. A key initiative has been the promotion of manage-
ment and exploitation of Electronic Medical Records(EMRs).
However, processing EMRs accurately requires addressing
and overcoming several technical and non-technical chal-
lenges. Specifically, the large volume of data that resides
in hospital databases, 80% of which is unstructured, should
be converted into machine understandable form preserving
semantics, to enable automatic analysis.
Conversion of unstructured components of EMRs, referred
to here as clinical notes, to a machine understandable form
requires multiple tasks including: 1) identification of the
domain entities (i.e., disorders, symptoms, procedures, find-
ings and medications), 2) annotation of domain entities with
standard vocabularies, 3) understanding different linguistic
and semantic components of the sentence construction such
as negation, conditioning and uncertainty, 4) detection of
whether the document discusses present condition1
or past
condition of the patient, 5) detection of whether the docu-
ment discusses patient’s condition or his/her family condi-
tion.
Natural Language Processing(NLP) techniques have been
developed to perform these tasks with reasonable accuracy.
For example, cTAKES[5], MedLEE[3] and MetaMap[1] are
among the major NLP engines tuned to perform the above
mentioned tasks for clinical notes. Even though NLP engines
that perform the above mentioned tasks have significantly
improved their outputs2
, they still exhibit significant short-
comings. For example, consider an EMR document that has
the following two sentences3
1
A condition includes disorders and symptoms
2
NLP output is machine understandable structured output,
e.g., an XML document
3
All examples are from real world EMR documents
1. He is not having any symptoms of chest pain or exer-
tional syncope or dizziness.
2. I advised him that if he experiences chest pain, short-
ness of breath with exertion or dizziness or syncopal
episodes to let us know and we can do appropriate
workup.
The first sentence clearly states that the patient does not
have chest pain, exertional syncope and dizziness and NLP
engine understands this correctly. The second sentence rec-
ommends actions if he experiences these conditions. But
unfortunately, current NLP engines do not understand that
the second sentence has a conditional statement and incor-
rectly outputs that the patient has chest pain, shortness of
breath, dizziness and syncope. This leads to a conflicting
scenario since the NLP output states both the presence and
absence of the same condition for the same patient at the
same time. This conflict must be resolved before a high-
fidelity and meaningful analysis and inference can be made
for clinical applications such as Clinical Document Improve-
ment(CDI) and Computer Assisted Coding(CAC).
We analyzed many conflicting instances such as the one
identified above with the clinical domain and health in-
formatics experts and recognized that contextual informa-
tion(such as the medication taken by the patient and other
conditions that the patient has) can be used to resolve these
conflicts. For example, the absence of relevant medications
for chest pain in the above example provides evidence that
the patient does not have chest pain while the presence of
such medications suggests the opposite. Motivated by these
observations we try to imitate the thought process of domain
experts to resolve the conflicts in the NLP output. This ap-
proach requires a reasonable amount of domain knowledge
in order to attempt to partially imitate the reasoning of
a domain expert in resolving or avoiding the conflicts. In
order to factor that in, we built a comprehensive knowl-
edge base by mining EMR documents and by leveraging the
open knowledge available in different formats. Our eval-
uation demonstrates that the knowledge bases can play a
crucial role in understanding EMRs by complementing NLP
algorithms and overcoming some of their limitations.
The rest of the paper is organized as follows, Section 2
discusses the problem in detail with real world examples.
Section 3 describes the knowledge base in detail. The im-
plementation is discussed in Section 4 and the evaluation is
presented in Section 5. Section 6 concludes with a summary
of our results and intended future work.
2. THE PROBLEM
EMRs use complex natural language constructs that are
easy to write and interpret by humans with common sense
reasoning and working knowledge of the domain, but hard
to process and make sense of automatically. That is, people
have less problem understanding what is stated in the EMRs
and can resolve apparent conflicts easily, but machines have
a hard time understanding the stated facts especially in the
presence of ambiguity. The following list contains linguis-
tic and semantic components of the natural language state-
ments that should be detected in order to understand the
EMRs.
• Negated statements - The statements that convey the
absence of domain concepts.
e.g., Patient has no angina, no orthopnea, PND, or
lower extremity edema.
• Conditional statements - The statements that convey
possible future status.
e.g., I advised him that if he experiences any chest
pain, shortness of breath with exertion or dizziness or
syncopal episodes to let us know and we can do appro-
priate workup.
• Uncertain statements - The statements that convey
some doubt about the patient status.
e.g., She is not sure if she is just depressed or not.
• Statements about patient’s medical history.
e.g., When the patient was last seen in the office,
he was complaining of fairly significant chest discom-
fort.
• Statements about patient’s family.
e.g., Also, several family members with high blood
pressure.
Current NLP engines cannot properly understand the above
exemplified components of the sentences and associate them
with relevant domain entities(i.e., no with angina, orthop-
nea, PND and lower extremity edema in the first sentence).
These shortcomings cause conflicts to be present in the out-
put of NLP engines that are widely used for processing clin-
ical notes.
Here we will demonstrate a number of scenarios where
current negation detection algorithms fail to understand the
semantics of the sentences. As we will show with the exam-
ples, although some of these shortcomings are due to pure
NLP issues, there are complex situations where it needs a
much more sophisticated solution that complements current
NLP techniques.
Example 1:
The original sentence: “the patient denies any chest discom-
fort, shortness of breath, orthopnea, paroxysmal nocturnal
dyspnea, palpitations, or lower extremity edema.”
According to the above sentence, the patient has none of
the conditions mentioned, but NLP output states that the
patient has palpitations and lower extremity edema. This is
mainly because the negation detection algorithm is designed
in a way that the effect of negation is propagated to prede-
fined number of entities to the right and to the left side of
negation indicating term(‘denies’).
Example 2 and Example 3 illustrate two instances where
the NLP algorithm incorrectly associates the negation indi-
cation term to the domain entity. The first example misin-
terprets the semantics, while the second example ultimately
turns out to be correct.
Example 2:
The original sentence: “I do not have an explanation for this
dyspnea.”
The sentence conveys that doctor was unable to find an
explanation for dyspnea, hence implicitly says the patient
has dyspnea. But the NLP algorithm incorrectly associates
‘not’ with dyspnea. This example shows that the algorithm
needs to have a deeper understanding of the words in order
to derive implicit semantics.
Example 3:
The original sentence: “there was no evidence of ischemia.”
The term “no” is associated with “evidence”. The current
NLP algorithm incorrectly associates “no” with ischemia,
but it turns out to interpret the semantics of this particular
example properly because the doctor interprets the absence
of evidence as absence of such condition(according to closed
world reasoning).
The first example showed that the NLP algorithm was not
able to associate the negation indicator with the correct enti-
ties, while second example shows that even if the algorithm
associates the negation indicator with the proper term, it
does not interpret the semantics of the sentence properly.
Hence, it requires a more sophisticated solution to accom-
pany NLP to understand the semantics of such sentences.
Although the above examples illustrate only the com-
plex scenarios for negation detection algorithms, similar in-
stances exist for other aspects of the sentences as well(e.g.,
conditional and uncertainty). The failure to understand the
semantics of the sentences leads to incorrect interpretations
of patient status. It is hard to detect such misinterpreta-
tions if there is only one mention of a particular condition
in the EMR. But when there are multiple mentions of the
same condition within the document and NLP misunder-
stands one such instance, it leads to conflicting scenarios
and NLP output reflects this conflict as illustrated earlier.
We have found 620 such conflicts within 3172 EMR docu-
ments parsed by cTAKES engine. The corresponding error
for applications that consume the output of an NLP en-
gine(e.g., CDI or CAC) would be considered unacceptable
in most cases. Our objective is to resolve these conflicts
created due to different reasons by using contextual infor-
mation available in the EMR document. Specifically, we
will demonstrate that these conflicts can be resolved with
the help of domain specific knowledge bases.
3. THE KNOWLEDGE BASE
We built a comprehensive knowledge base for the health-
care domain by mining EMR documents and leveraging the
open knowledge available in different formats. The knowl-
edge base is expressed in the Resource Description Frame-
work Schema(RDFS) language. The primary knowledge source
for the knowledge base is the Unified Medical Language Sys-
tem(UMLS)[2]. The UMLS consists of key domain concepts
that we are interested in and their hierarchical relationships.
We used the hierarchical relationships present in the UMLS
for disorders, symptoms and findings. However, the medi-
cation hierarchy is not explicitly present in UMLS. Instead,
UMLS contains tradename of relationships which links dif-
ferent brands of the same medication to its generic medica-
tion. Hence, we used tradename of relationship in order to
build the hierarchical relationships among the medications.
Specifically, all the different brands of the same medication
are added as sub-classes of the generic medication.
Our approach requires knowledge about the relationships
between disorders and symptoms (‘symptom A’ is symptom of
‘disorder B’) and the relationship between disorders/symptoms
and medications(‘disorder B’ is treated with ‘medication C’)
to resolve conflicts in NLP output4
. While UMLS pro-
vides the required coverage in terms of domain concepts
and their hierarchical relationships, it only contains a rela-
tionship between disorders/symptoms and medications (i.e.,
4
The experiments in this paper use only the relationships
between disorder/symptom and medications
Figure 1: A Snippet from Knowledge Base
Concept Type Number of Concepts
Condition(disorder or symp-
tom)
125778
Medication 298993
Findings 172230
Procedures 262360
Table 1: Number of Concepts from each type of
Domain Entities
Relationship Type Number of Relation-
ships
is symptom of 8299
is treated with 41182
Table 2: Number of Relationships among Concepts
is treated with). It lacks, for example, the knowledge about
the associations of disorders with the symptoms. Hence, we
developed a technique to mine the relationship between dis-
orders and symptoms from EMRs[4]. Effectively, we added
the is treated with relationship to our knowledge base from
UMLS and the is symptom of relationship by mining EMR
documents. Our knowledge base was further enriched by
using two Web resources5
. We decided to add the knowl-
edge in these two Web resources after conducting a careful
assessment about the quality of stated facts. Table 1 and
Table 2 contains the statistics about a recent version of our
knowledge base, which is subject to grow further. Figure 1
shows a snippet from the knowledge base.
4. IMPLEMENTATION
We choose cTAKES[5], an open source NLP engine under
the Apache License, as our NLP engine to parse unstruc-
tured EMRs. The input to cTAKES is an unstructured
EMR document and it outputs an XML document where
nodes in the XML document represent different semantic
types. Figure 2 shows an XML element condition6
which
describes a disorder found in the EMR. As shown in the ex-
ample, cTAKES annotates each entity found in EMR with
UMLS and SNOMED vocabularies via ‘code’ and ‘cui’ at-
tributes and assigns values to the following attributes asso-
ciated with each entity:
• polarity - This attribute can take values 0 or -1. -1
indicates that the entity is negated and 0 indicates it
is not negated.
• uncertainty - This attribute can take four values7
. Value
0 indicates patient’s current status, value 1 indicates
patient’s history, value 2 indicates the patient’s family
history and value 3 indicates the uncertain status.
• conditional - This attribute can take values either true
or false. If the sentence discusses the entity in a condi-
tional sense(e.g., ‘if pain persist, take tylenol’), it takes
value true, else it takes value false.
The annotations of Figure 2 are interpreted for the disor-
der “atrial fibrillation” as indicating a non-conditional (con-
ditional=“false”), the presence of disorder (polarity=“0”) right
now(uncertainty=“0”).
To identify the conflicts, we take the XML document as
the input to our method and search for condition nodes with
the same ‘cui’ value, and ‘0’ as the uncertainty value but
with different polarity values. These instances are identified
as conflicts, because they state that the patient has partic-
ular condition and does not have it at the same time. We
then collect the contextual information about that condi-
tion to assist with the disambiguation. Currently we col-
lect only related medications. Although EMR documents
5
The two Web resources are,
1. http://www.healthline.com
2. http://www.druglib.com
6
A conditional element can contain either disorder or symp-
tom.
7
Although this attribute is supposed to indicate the uncer-
tainty of the concept, only one value indicates about un-
certainty. The other 3 values indicate whether the sentence
discuss patient’s current condition or his/her medical history
or his/her family history.
Figure 2: XML element describes Atrial Fibrillation
in cTAKES output
contain a lot of unstructured text, it has a semi-structured
section, which contains information about the medications
taken by the patient in a bulleted list. Since this section
is a bulleted list and does not contain sentences, NLP en-
gines can identify medications and populate their attributes
(polarity, uncertainty, and conditional values) with great ac-
curacy. This motivates us to use medications as reliable
contextual information. We use our knowledge base to asso-
ciate and confirm the applicability of medication to treat a
particular condition, to resolve the conflict. We leverage the
hierarchical relationships that exist in the knowledge base
to identify the related medications for the particular disor-
der/symptom. For example, a particular patient might be
taking zocor which is a trade name of simvastatin (zocor is a
child of simvastatin in the knowledge base) and simavastatin
is a type of statin. Statins are used to treat high blood choles-
terol levels. Further, this relationship (high blood cholesterol
level is treated with statin) is present in our knowledge base.
Since we can infer that zocor is a statin by using the hierar-
chical relationships, we can associate zocor with high blood
cholesterol level. If we find such associations between med-
ications in EMR and conflicted disorder/symptom, we can
resolve the conflict by assigning ‘0’ to the polarity value,
otherwise we assign ‘-1’ to the polarity value.
5. A USE CASE
This section elaborates our approach with two real EMR
documents. The first document illustrates how we resolve
the conflict when a patient actually has the disorder, and the
second document illustrates how we resolve the conflict when
the patient does not have the disorder. The first scenario is
created by shortcomings associated with negation detection
in the NLP engine, while the second scenario is due to the
failure of the NLP engine to understand that the sentence
discusses the possibility of having a disorder/symptom and
is uncertain.
The selected EMR for the first illustration has a conflict
on coronary artery disease. The patient has coronary artery
disease and it is mentioned clearly in his diagnosis list. How-
ever, the EMR also has the following sentence in an unstruc-
tured portion that the NLP engine is unable to interpret
correctly, and outputs ‘-1’ for the polarity of coronary artery
disease.
“Send for carotid duplex to rule out carotid artery steno-
sis given his risk factors and underlying coronary artery dis-
ease.”
According to the semantics of the sentence, the patient has
coronary artery disease and the term ‘rule out’ is applied to
carotid artery stenosis, but NLP engine associates ‘rule out’
with coronary artery disease resulting in an incorrect value
for the polarity attribute. Hence the generated XML docu-
ment contains two condition elements one with polarity ‘0’
and the other with ‘-1’ and both have ‘0’ as their uncertainty
value. This triggers our method to collect the context for
coronary artery disease within the EMR.
The following set of medications were found in the EMR
and identified as treating the coronary artery disease with
the help of the knowledge base.
• Ramipril
• Nitroglycerin
• Enalapril
• Zocor
• Warfarin
• Aspirin
This set of medications helps us to confirm that the pa-
tient is more likely to have coronary artery disease.
The second EMR document has a conflict on lower ex-
tremity edema. It contains the following two sentences.
“Extremities : Warm and dry. No clubbing or cyanosis. No
lower extremity edema.”
“I have advised the patient on the side effect of potential
lower extremity edema.”
Clearly the first sentence states that the patient does not
have lower extremity edema and the second sentence dis-
cusses the possibility of lower extremity edema as a side
effect. But the NLP engine does not understand the se-
mantics of the second sentence and interprets it as patient
currently has lower extremity edema. Further examination
of the EMR document with the help of the knowledge base
did not indicate any medication that may treat lower ex-
tremity edema. This observation suggests that the patient
is unlikely to have lower extremity edema as stated in the
EMR.
6. EVALUATION
We provide a preliminary evaluation of our method by
using real world data sources to see if it holds the expected
promise. Information presented here is within the constraints
and guidelines of the IRB protocol involving the use of the
limited data set. We selected 25 documents from our cor-
pus for the evaluation which have conflicts in their gener-
ated structured versions(XML documents). The XML doc-
uments were generated by parsing EMR documents through
cTAKES NLP engine. There were 32 conflicting instances
within these 25 documents. Table 3 summarizes the results
of conflict resolution using contextual information.
The accuracy of the experiment is defined as,
accuracy =
TP + TN
TP + TN + FP + FN
where TP is true positives, TN is true negatives, FP is
false positives and FN is false negatives.
The accuracy for this experiment was 71.87%. We will
discuss three important observations we can make from our
results.
Predicted Class
Positive Negative
Actual Class
Positive 18 6
Negative 3 5
Table 3: Results of Conflict Resolution. Label ‘pos-
itive’ means patient has particular condition and
‘negative’ means patient does not have particular
condition.
(1) False negatives are for common symptoms:
The 4 out of 6 instances categorized incorrectly as nega-
tive(false negative) are instances where our approach tries
to resolve conflicts involving common symptoms. Within
those 4 instances, headache and obesity appear once each
and shortness of breath appears twice. Doctors may not al-
ways prescribe medications to cure these kinds of symptoms
as they are very generic in nature and can co-occur with
many diseases. As such, doctors prefer to rather cure the
condition associated with these symptoms using very spe-
cific medications addressing the ultimate cause. Hence our
method was not able to categorize these instances as belong-
ing to the positive class even though the patient has these
symptoms.
(2) False positives are based on common medica-
tions:
We noticed that aspirin is the only medication that ap-
peared in collected contextual information for all 3 instances
incorrectly categorized as positive(false positives). So our
method categorizes these negative instances as positive based
on the evidence that the patient takes aspirin. Aspirin is
very a generic medication and it can provide minimal con-
clusive information about the patient’s conditions. This ob-
servation illustrates the need for ranking of the domain rela-
tionships based on specificity, i.e., the relationship between
metoprolol and hypertension is more specific than the rela-
tionship between aspirin and hypertension, because if the
patient takes metoprolol, it provides strong evidence that
the patient is a hypertensive patient than a patient who
takes aspirin even though both medications are prescribed
to hypertensive patients. This type of ranking will help to
ignore evidences which provide minimal conclusive informa-
tion about the patient’s condition and help to improve the
results by eliminating false positives.
(3) Conflicts over major disorders can be resolved
with good accuracy:
Our approach has been able to resolve conflicts over major
disorders with good accuracy. There were 17 conflicting in-
stances within the total of 32 on major disorders and the
following list contains the frequency of each disorder.
• 5 instances of coronary artery disease.
• 4 instances of atrial fibrillation.
• 2 instances of peripheral vascular disease.
• 1 instance each from ischemia, cardiomyopathies, coro-
nary heart disease, arthritis, ischemic cardiomyopathy,
and aortic valve stenosis.
We were able to resolve 15 out of 17 conflicts correctly.
Since these are major conditions and patients will have to
face more serious problems if they do not treat these condi-
tions well, they will be taking medications to cure or control
the condition. This fact is reflected in the EMRs and helps
us in resolving conflicts.
7. CONCLUSION AND FUTURE WORK
This paper demonstrates shortcomings of the state-of-the-
art NLP algorithms used to understand unstructured EMRs
and shows the need for complementary techniques for accu-
rate interpretation of documents. As shown in the examples,
there are instances which require much deeper understand-
ing of the words and some common sense reasoning for bet-
ter interpretation. Also, we proposed a method to resolve
conflicting instances created due to shortcomings of NLP
engines for clinical data processing with the help of com-
prehensive knowledge base. Our evaluation shows that this
method can resolve the conflicting instances over major dis-
orders with great accuracy and provide insights to improve
the approach.
While we have demonstrated results of the proposed method
in conflict resolution, we believe the knowledge-based ap-
proach can be used to detect and address many parsing is-
sues in NLP algorithms. For example, although NLP al-
gorithms detect that a patient does not have a particular
disorder/symptom(not a conflicting instance), if the contex-
tual information seems to strongly suggest otherwise, this
can trigger a need for better analysis of the original sen-
tences.
As the next step, we will work on ranking the relation-
ships based on their specificity. We believe it can be done
by analyzing the number of relationships each concept par-
ticipates in and the nature of the participating concepts. A
more comprehensive evaluation will also be shared in near
future.
8. REFERENCES
[1] Alan R Aronson. Metamap: Mapping text to the umls
metathesaurus. Bethesda, MD: NLM, NIH, DHHS,
2006.
[2] Olivier Bodenreider. The unified medical language
system (umls): integrating biomedical terminology.
Nucleic acids research, 32(suppl 1):D267–D270, 2004.
[3] Carol Friedman, Philip O Alderson, John HM Austin,
James J Cimino, and Stephen B Johnson. A general
natural-language text processor for clinical radiology.
Journal of the American Medical Informatics
Association, 1(2):161–174, 1994.
[4] Sujan Perera, Cory Henson, Krishnaprasad
Thirunarayan, Amit Sheth, and Suhas Nair. Data
driven knowledge acquisition method for domain
knowledge enrichment in the healthcare. In IEEE
International Conference on Bioinformatics and
Biomedicine, pages 1–8, October 2012.
[5] Guergana K Savova, James J Masanz, Philip V Ogren,
Jiaping Zheng, Sunghwan Sohn, Karin C
Kipper-Schuler, and Christopher G Chute. Mayo
clinical text analysis and knowledge extraction system
(ctakes): architecture, component evaluation and
applications. Journal of the American Medical
Informatics Association, 17(5):507–513, 2010.

Contenu connexe

Similaire à Challenges in understanding clinical notes: Why NLP Engines Fall Short

Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Michelle Phan (L.I.O.N.)
 
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Tiffany Nealy
 
Information GovernanceTitleCourse NameTo.docx
Information GovernanceTitleCourse NameTo.docxInformation GovernanceTitleCourse NameTo.docx
Information GovernanceTitleCourse NameTo.docx
dirkrplav
 
Risk management in obstetric & gynaecology
Risk management in obstetric &     gynaecologyRisk management in obstetric &     gynaecology
Risk management in obstetric & gynaecology
drmcbansal
 

Similaire à Challenges in understanding clinical notes: Why NLP Engines Fall Short (20)

cognitive computing for electronic medical record
cognitive computing for electronic medical record cognitive computing for electronic medical record
cognitive computing for electronic medical record
 
Natural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health RecordsNatural Language Processing to Curate Unstructured Electronic Health Records
Natural Language Processing to Curate Unstructured Electronic Health Records
 
EHR Implementation PPT.pptx
EHR Implementation PPT.pptxEHR Implementation PPT.pptx
EHR Implementation PPT.pptx
 
Essential Biology E5 (HL) The Human Brain
Essential Biology E5 (HL) The Human BrainEssential Biology E5 (HL) The Human Brain
Essential Biology E5 (HL) The Human Brain
 
NueWaveBusinessPlan
NueWaveBusinessPlanNueWaveBusinessPlan
NueWaveBusinessPlan
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
IRJET- Techniques for Analyzing Job Satisfaction in Working Employees – A...
IRJET-  	  Techniques for Analyzing Job Satisfaction in Working Employees – A...IRJET-  	  Techniques for Analyzing Job Satisfaction in Working Employees – A...
IRJET- Techniques for Analyzing Job Satisfaction in Working Employees – A...
 
Machine Learning and the Value of Health Technologies
Machine Learning and the Value of Health TechnologiesMachine Learning and the Value of Health Technologies
Machine Learning and the Value of Health Technologies
 
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
 
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_SymptomsCentral_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
Central_Analytics_Treating_the_Cause_Not_Just_the_Symptoms
 
ARTIFICIAL INTELLIGENCE BASED DATA GOVERNANCE FOR CHINESE ELECTRONIC HEALTH R...
ARTIFICIAL INTELLIGENCE BASED DATA GOVERNANCE FOR CHINESE ELECTRONIC HEALTH R...ARTIFICIAL INTELLIGENCE BASED DATA GOVERNANCE FOR CHINESE ELECTRONIC HEALTH R...
ARTIFICIAL INTELLIGENCE BASED DATA GOVERNANCE FOR CHINESE ELECTRONIC HEALTH R...
 
IJSRED-V2I3P26
IJSRED-V2I3P26IJSRED-V2I3P26
IJSRED-V2I3P26
 
Information GovernanceTitleCourse NameTo.docx
Information GovernanceTitleCourse NameTo.docxInformation GovernanceTitleCourse NameTo.docx
Information GovernanceTitleCourse NameTo.docx
 
Must know Secrets For Easier EHR Documentation
Must know Secrets For Easier EHR DocumentationMust know Secrets For Easier EHR Documentation
Must know Secrets For Easier EHR Documentation
 
Artificial Intelligence in oral radiology .pptx
Artificial Intelligence in oral radiology .pptxArtificial Intelligence in oral radiology .pptx
Artificial Intelligence in oral radiology .pptx
 
Psychology of Informatics Integration in Healthcare
Psychology of Informatics Integration in HealthcarePsychology of Informatics Integration in Healthcare
Psychology of Informatics Integration in Healthcare
 
Risk management in obstetric & gynaecology
Risk management in obstetric &     gynaecologyRisk management in obstetric &     gynaecology
Risk management in obstetric & gynaecology
 
Building a Better Regional Anesthesia Note (on paper or in an EHR)
Building a Better Regional Anesthesia Note (on paper or in an EHR)Building a Better Regional Anesthesia Note (on paper or in an EHR)
Building a Better Regional Anesthesia Note (on paper or in an EHR)
 
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
Apache Spark NLP for Healthcare: Lessons Learned Building Real-World Healthca...
 
Health Informatics- Module 1-Chapter 2.pptx
Health Informatics- Module 1-Chapter 2.pptxHealth Informatics- Module 1-Chapter 2.pptx
Health Informatics- Module 1-Chapter 2.pptx
 

Dernier

Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
adilkhan87451
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
mahaiklolahd
 

Dernier (20)

Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
Coimbatore Call Girls in Coimbatore 7427069034 genuine Escort Service Girl 10...
 
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
Russian Call Girls Lucknow Just Call 👉👉7877925207 Top Class Call Girl Service...
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
Manyata Tech Park ( Call Girls ) Bangalore ✔ 6297143586 ✔ Hot Model With Sexy...
 
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7Call Girls in Gagan Vihar (delhi) call me [🔝  9953056974 🔝] escort service 24X7
Call Girls in Gagan Vihar (delhi) call me [🔝 9953056974 🔝] escort service 24X7
 
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...Top Rated  Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
Top Rated Hyderabad Call Girls Erragadda ⟟ 9332606886 ⟟ Call Me For Genuine ...
 
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
Saket * Call Girls in Delhi - Phone 9711199012 Escorts Service at 6k to 50k a...
 
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service AvailableTrichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
Trichy Call Girls Book Now 9630942363 Top Class Trichy Escort Service Available
 
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...Russian Call Girls Service  Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
Russian Call Girls Service Jaipur {8445551418} ❤️PALLAVI VIP Jaipur Call Gir...
 
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Call Girls Kolkata Kalikapur 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls  * UPA...
Call Girl in Indore 8827247818 {LowPrice} ❤️ (ahana) Indore Call Girls * UPA...
 
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service AvailableCall Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
Call Girls Rishikesh Just Call 9667172968 Top Class Call Girl Service Available
 
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Hosur Just Call 9630942363 Top Class Call Girl Service Available
 
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
Premium Call Girls In Jaipur {8445551418} ❤️VVIP SEEMA Call Girl in Jaipur Ra...
 
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service AvailableCall Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
Call Girls Madurai Just Call 9630942363 Top Class Call Girl Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 9332606886 𖠋 Will You Mis...
 
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
9630942363 Genuine Call Girls In Ahmedabad Gujarat Call Girls Service
 
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
Premium Bangalore Call Girls Jigani Dail 6378878445 Escort Service For Hot Ma...
 

Challenges in understanding clinical notes: Why NLP Engines Fall Short

  • 1. Challenges in Understanding Clinical Notes: Why NLP Engines Fall Short and Where Background Knowledge Can Help Sujan Perera Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis) Wright State University, Dayton OH, USA sujan@knoesis.org Amit Sheth Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis) Wright State University, Dayton OH, USA amit@knoesis.org Krishnaprasad Thirunarayan Ohio Center of Excellence in Knowledge-enabled Computing (Kno.e.sis) Wright State University, Dayton OH, USA tkprasad@knoesis.org Suhas Nair ezDI, LLC suhas.nair@ezdi.us Neil Shah ezDI, LLC neil@ezdi.us ABSTRACT Understanding of Electronic Medical Records(EMRs) plays a crucial role in improving healthcare outcomes. However, the unstructured nature of EMRs poses several technical challenges for structured information extraction from clini- cal notes leading to automatic analysis. Natural Language Processing(NLP) techniques developed to process EMRs are effective for variety of tasks, they often fail to preserve the semantics of original information expressed in EMRs, partic- ularly in complex scenarios. This paper illustrates the com- plexity of the problems involved and deals with conflicts cre- ated due to the shortcomings of NLP techniques and demon- strates where domain specific knowledge bases can come to rescue in resolving conflicts that can significantly improve the semantic annotation and structured information extrac- tion. We discuss various insights gained from our study on real world dataset. Categories and Subject Descriptors I.7.m [Computing Methodologies]: Document and Text ProcessingMiscellaneous Keywords Natural Language Processing, Negation Detection, Knowl- edge Base Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full cita- tion on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re- publish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from Permissions@acm.org. DARE’13, Nov. 1, 2013, San Francisco, CA, USA. Copyright 2013 ACM 978-1-4503-2425-0/13/11$15.00 http://dx.doi.org/10.1145/2512410.2512427 ...$15.00. 1. INTRODUCTION The Patient Protection and Affordable Care Act has re- cently introduced a number of new regulations and guide- lines with the goal of improving healthcare outcomes by 2015. A key initiative has been the promotion of manage- ment and exploitation of Electronic Medical Records(EMRs). However, processing EMRs accurately requires addressing and overcoming several technical and non-technical chal- lenges. Specifically, the large volume of data that resides in hospital databases, 80% of which is unstructured, should be converted into machine understandable form preserving semantics, to enable automatic analysis. Conversion of unstructured components of EMRs, referred to here as clinical notes, to a machine understandable form requires multiple tasks including: 1) identification of the domain entities (i.e., disorders, symptoms, procedures, find- ings and medications), 2) annotation of domain entities with standard vocabularies, 3) understanding different linguistic and semantic components of the sentence construction such as negation, conditioning and uncertainty, 4) detection of whether the document discusses present condition1 or past condition of the patient, 5) detection of whether the docu- ment discusses patient’s condition or his/her family condi- tion. Natural Language Processing(NLP) techniques have been developed to perform these tasks with reasonable accuracy. For example, cTAKES[5], MedLEE[3] and MetaMap[1] are among the major NLP engines tuned to perform the above mentioned tasks for clinical notes. Even though NLP engines that perform the above mentioned tasks have significantly improved their outputs2 , they still exhibit significant short- comings. For example, consider an EMR document that has the following two sentences3 1 A condition includes disorders and symptoms 2 NLP output is machine understandable structured output, e.g., an XML document 3 All examples are from real world EMR documents
  • 2. 1. He is not having any symptoms of chest pain or exer- tional syncope or dizziness. 2. I advised him that if he experiences chest pain, short- ness of breath with exertion or dizziness or syncopal episodes to let us know and we can do appropriate workup. The first sentence clearly states that the patient does not have chest pain, exertional syncope and dizziness and NLP engine understands this correctly. The second sentence rec- ommends actions if he experiences these conditions. But unfortunately, current NLP engines do not understand that the second sentence has a conditional statement and incor- rectly outputs that the patient has chest pain, shortness of breath, dizziness and syncope. This leads to a conflicting scenario since the NLP output states both the presence and absence of the same condition for the same patient at the same time. This conflict must be resolved before a high- fidelity and meaningful analysis and inference can be made for clinical applications such as Clinical Document Improve- ment(CDI) and Computer Assisted Coding(CAC). We analyzed many conflicting instances such as the one identified above with the clinical domain and health in- formatics experts and recognized that contextual informa- tion(such as the medication taken by the patient and other conditions that the patient has) can be used to resolve these conflicts. For example, the absence of relevant medications for chest pain in the above example provides evidence that the patient does not have chest pain while the presence of such medications suggests the opposite. Motivated by these observations we try to imitate the thought process of domain experts to resolve the conflicts in the NLP output. This ap- proach requires a reasonable amount of domain knowledge in order to attempt to partially imitate the reasoning of a domain expert in resolving or avoiding the conflicts. In order to factor that in, we built a comprehensive knowl- edge base by mining EMR documents and by leveraging the open knowledge available in different formats. Our eval- uation demonstrates that the knowledge bases can play a crucial role in understanding EMRs by complementing NLP algorithms and overcoming some of their limitations. The rest of the paper is organized as follows, Section 2 discusses the problem in detail with real world examples. Section 3 describes the knowledge base in detail. The im- plementation is discussed in Section 4 and the evaluation is presented in Section 5. Section 6 concludes with a summary of our results and intended future work. 2. THE PROBLEM EMRs use complex natural language constructs that are easy to write and interpret by humans with common sense reasoning and working knowledge of the domain, but hard to process and make sense of automatically. That is, people have less problem understanding what is stated in the EMRs and can resolve apparent conflicts easily, but machines have a hard time understanding the stated facts especially in the presence of ambiguity. The following list contains linguis- tic and semantic components of the natural language state- ments that should be detected in order to understand the EMRs. • Negated statements - The statements that convey the absence of domain concepts. e.g., Patient has no angina, no orthopnea, PND, or lower extremity edema. • Conditional statements - The statements that convey possible future status. e.g., I advised him that if he experiences any chest pain, shortness of breath with exertion or dizziness or syncopal episodes to let us know and we can do appro- priate workup. • Uncertain statements - The statements that convey some doubt about the patient status. e.g., She is not sure if she is just depressed or not. • Statements about patient’s medical history. e.g., When the patient was last seen in the office, he was complaining of fairly significant chest discom- fort. • Statements about patient’s family. e.g., Also, several family members with high blood pressure. Current NLP engines cannot properly understand the above exemplified components of the sentences and associate them with relevant domain entities(i.e., no with angina, orthop- nea, PND and lower extremity edema in the first sentence). These shortcomings cause conflicts to be present in the out- put of NLP engines that are widely used for processing clin- ical notes. Here we will demonstrate a number of scenarios where current negation detection algorithms fail to understand the semantics of the sentences. As we will show with the exam- ples, although some of these shortcomings are due to pure NLP issues, there are complex situations where it needs a much more sophisticated solution that complements current NLP techniques. Example 1: The original sentence: “the patient denies any chest discom- fort, shortness of breath, orthopnea, paroxysmal nocturnal dyspnea, palpitations, or lower extremity edema.” According to the above sentence, the patient has none of the conditions mentioned, but NLP output states that the patient has palpitations and lower extremity edema. This is mainly because the negation detection algorithm is designed in a way that the effect of negation is propagated to prede- fined number of entities to the right and to the left side of negation indicating term(‘denies’). Example 2 and Example 3 illustrate two instances where the NLP algorithm incorrectly associates the negation indi- cation term to the domain entity. The first example misin- terprets the semantics, while the second example ultimately turns out to be correct. Example 2: The original sentence: “I do not have an explanation for this dyspnea.” The sentence conveys that doctor was unable to find an explanation for dyspnea, hence implicitly says the patient has dyspnea. But the NLP algorithm incorrectly associates ‘not’ with dyspnea. This example shows that the algorithm needs to have a deeper understanding of the words in order to derive implicit semantics. Example 3: The original sentence: “there was no evidence of ischemia.”
  • 3. The term “no” is associated with “evidence”. The current NLP algorithm incorrectly associates “no” with ischemia, but it turns out to interpret the semantics of this particular example properly because the doctor interprets the absence of evidence as absence of such condition(according to closed world reasoning). The first example showed that the NLP algorithm was not able to associate the negation indicator with the correct enti- ties, while second example shows that even if the algorithm associates the negation indicator with the proper term, it does not interpret the semantics of the sentence properly. Hence, it requires a more sophisticated solution to accom- pany NLP to understand the semantics of such sentences. Although the above examples illustrate only the com- plex scenarios for negation detection algorithms, similar in- stances exist for other aspects of the sentences as well(e.g., conditional and uncertainty). The failure to understand the semantics of the sentences leads to incorrect interpretations of patient status. It is hard to detect such misinterpreta- tions if there is only one mention of a particular condition in the EMR. But when there are multiple mentions of the same condition within the document and NLP misunder- stands one such instance, it leads to conflicting scenarios and NLP output reflects this conflict as illustrated earlier. We have found 620 such conflicts within 3172 EMR docu- ments parsed by cTAKES engine. The corresponding error for applications that consume the output of an NLP en- gine(e.g., CDI or CAC) would be considered unacceptable in most cases. Our objective is to resolve these conflicts created due to different reasons by using contextual infor- mation available in the EMR document. Specifically, we will demonstrate that these conflicts can be resolved with the help of domain specific knowledge bases. 3. THE KNOWLEDGE BASE We built a comprehensive knowledge base for the health- care domain by mining EMR documents and leveraging the open knowledge available in different formats. The knowl- edge base is expressed in the Resource Description Frame- work Schema(RDFS) language. The primary knowledge source for the knowledge base is the Unified Medical Language Sys- tem(UMLS)[2]. The UMLS consists of key domain concepts that we are interested in and their hierarchical relationships. We used the hierarchical relationships present in the UMLS for disorders, symptoms and findings. However, the medi- cation hierarchy is not explicitly present in UMLS. Instead, UMLS contains tradename of relationships which links dif- ferent brands of the same medication to its generic medica- tion. Hence, we used tradename of relationship in order to build the hierarchical relationships among the medications. Specifically, all the different brands of the same medication are added as sub-classes of the generic medication. Our approach requires knowledge about the relationships between disorders and symptoms (‘symptom A’ is symptom of ‘disorder B’) and the relationship between disorders/symptoms and medications(‘disorder B’ is treated with ‘medication C’) to resolve conflicts in NLP output4 . While UMLS pro- vides the required coverage in terms of domain concepts and their hierarchical relationships, it only contains a rela- tionship between disorders/symptoms and medications (i.e., 4 The experiments in this paper use only the relationships between disorder/symptom and medications Figure 1: A Snippet from Knowledge Base Concept Type Number of Concepts Condition(disorder or symp- tom) 125778 Medication 298993 Findings 172230 Procedures 262360 Table 1: Number of Concepts from each type of Domain Entities Relationship Type Number of Relation- ships is symptom of 8299 is treated with 41182 Table 2: Number of Relationships among Concepts
  • 4. is treated with). It lacks, for example, the knowledge about the associations of disorders with the symptoms. Hence, we developed a technique to mine the relationship between dis- orders and symptoms from EMRs[4]. Effectively, we added the is treated with relationship to our knowledge base from UMLS and the is symptom of relationship by mining EMR documents. Our knowledge base was further enriched by using two Web resources5 . We decided to add the knowl- edge in these two Web resources after conducting a careful assessment about the quality of stated facts. Table 1 and Table 2 contains the statistics about a recent version of our knowledge base, which is subject to grow further. Figure 1 shows a snippet from the knowledge base. 4. IMPLEMENTATION We choose cTAKES[5], an open source NLP engine under the Apache License, as our NLP engine to parse unstruc- tured EMRs. The input to cTAKES is an unstructured EMR document and it outputs an XML document where nodes in the XML document represent different semantic types. Figure 2 shows an XML element condition6 which describes a disorder found in the EMR. As shown in the ex- ample, cTAKES annotates each entity found in EMR with UMLS and SNOMED vocabularies via ‘code’ and ‘cui’ at- tributes and assigns values to the following attributes asso- ciated with each entity: • polarity - This attribute can take values 0 or -1. -1 indicates that the entity is negated and 0 indicates it is not negated. • uncertainty - This attribute can take four values7 . Value 0 indicates patient’s current status, value 1 indicates patient’s history, value 2 indicates the patient’s family history and value 3 indicates the uncertain status. • conditional - This attribute can take values either true or false. If the sentence discusses the entity in a condi- tional sense(e.g., ‘if pain persist, take tylenol’), it takes value true, else it takes value false. The annotations of Figure 2 are interpreted for the disor- der “atrial fibrillation” as indicating a non-conditional (con- ditional=“false”), the presence of disorder (polarity=“0”) right now(uncertainty=“0”). To identify the conflicts, we take the XML document as the input to our method and search for condition nodes with the same ‘cui’ value, and ‘0’ as the uncertainty value but with different polarity values. These instances are identified as conflicts, because they state that the patient has partic- ular condition and does not have it at the same time. We then collect the contextual information about that condi- tion to assist with the disambiguation. Currently we col- lect only related medications. Although EMR documents 5 The two Web resources are, 1. http://www.healthline.com 2. http://www.druglib.com 6 A conditional element can contain either disorder or symp- tom. 7 Although this attribute is supposed to indicate the uncer- tainty of the concept, only one value indicates about un- certainty. The other 3 values indicate whether the sentence discuss patient’s current condition or his/her medical history or his/her family history. Figure 2: XML element describes Atrial Fibrillation in cTAKES output contain a lot of unstructured text, it has a semi-structured section, which contains information about the medications taken by the patient in a bulleted list. Since this section is a bulleted list and does not contain sentences, NLP en- gines can identify medications and populate their attributes (polarity, uncertainty, and conditional values) with great ac- curacy. This motivates us to use medications as reliable contextual information. We use our knowledge base to asso- ciate and confirm the applicability of medication to treat a particular condition, to resolve the conflict. We leverage the hierarchical relationships that exist in the knowledge base to identify the related medications for the particular disor- der/symptom. For example, a particular patient might be taking zocor which is a trade name of simvastatin (zocor is a child of simvastatin in the knowledge base) and simavastatin is a type of statin. Statins are used to treat high blood choles- terol levels. Further, this relationship (high blood cholesterol level is treated with statin) is present in our knowledge base. Since we can infer that zocor is a statin by using the hierar- chical relationships, we can associate zocor with high blood cholesterol level. If we find such associations between med- ications in EMR and conflicted disorder/symptom, we can resolve the conflict by assigning ‘0’ to the polarity value, otherwise we assign ‘-1’ to the polarity value. 5. A USE CASE This section elaborates our approach with two real EMR documents. The first document illustrates how we resolve the conflict when a patient actually has the disorder, and the second document illustrates how we resolve the conflict when the patient does not have the disorder. The first scenario is created by shortcomings associated with negation detection in the NLP engine, while the second scenario is due to the failure of the NLP engine to understand that the sentence discusses the possibility of having a disorder/symptom and is uncertain. The selected EMR for the first illustration has a conflict on coronary artery disease. The patient has coronary artery disease and it is mentioned clearly in his diagnosis list. How- ever, the EMR also has the following sentence in an unstruc- tured portion that the NLP engine is unable to interpret correctly, and outputs ‘-1’ for the polarity of coronary artery disease. “Send for carotid duplex to rule out carotid artery steno- sis given his risk factors and underlying coronary artery dis- ease.” According to the semantics of the sentence, the patient has coronary artery disease and the term ‘rule out’ is applied to
  • 5. carotid artery stenosis, but NLP engine associates ‘rule out’ with coronary artery disease resulting in an incorrect value for the polarity attribute. Hence the generated XML docu- ment contains two condition elements one with polarity ‘0’ and the other with ‘-1’ and both have ‘0’ as their uncertainty value. This triggers our method to collect the context for coronary artery disease within the EMR. The following set of medications were found in the EMR and identified as treating the coronary artery disease with the help of the knowledge base. • Ramipril • Nitroglycerin • Enalapril • Zocor • Warfarin • Aspirin This set of medications helps us to confirm that the pa- tient is more likely to have coronary artery disease. The second EMR document has a conflict on lower ex- tremity edema. It contains the following two sentences. “Extremities : Warm and dry. No clubbing or cyanosis. No lower extremity edema.” “I have advised the patient on the side effect of potential lower extremity edema.” Clearly the first sentence states that the patient does not have lower extremity edema and the second sentence dis- cusses the possibility of lower extremity edema as a side effect. But the NLP engine does not understand the se- mantics of the second sentence and interprets it as patient currently has lower extremity edema. Further examination of the EMR document with the help of the knowledge base did not indicate any medication that may treat lower ex- tremity edema. This observation suggests that the patient is unlikely to have lower extremity edema as stated in the EMR. 6. EVALUATION We provide a preliminary evaluation of our method by using real world data sources to see if it holds the expected promise. Information presented here is within the constraints and guidelines of the IRB protocol involving the use of the limited data set. We selected 25 documents from our cor- pus for the evaluation which have conflicts in their gener- ated structured versions(XML documents). The XML doc- uments were generated by parsing EMR documents through cTAKES NLP engine. There were 32 conflicting instances within these 25 documents. Table 3 summarizes the results of conflict resolution using contextual information. The accuracy of the experiment is defined as, accuracy = TP + TN TP + TN + FP + FN where TP is true positives, TN is true negatives, FP is false positives and FN is false negatives. The accuracy for this experiment was 71.87%. We will discuss three important observations we can make from our results. Predicted Class Positive Negative Actual Class Positive 18 6 Negative 3 5 Table 3: Results of Conflict Resolution. Label ‘pos- itive’ means patient has particular condition and ‘negative’ means patient does not have particular condition. (1) False negatives are for common symptoms: The 4 out of 6 instances categorized incorrectly as nega- tive(false negative) are instances where our approach tries to resolve conflicts involving common symptoms. Within those 4 instances, headache and obesity appear once each and shortness of breath appears twice. Doctors may not al- ways prescribe medications to cure these kinds of symptoms as they are very generic in nature and can co-occur with many diseases. As such, doctors prefer to rather cure the condition associated with these symptoms using very spe- cific medications addressing the ultimate cause. Hence our method was not able to categorize these instances as belong- ing to the positive class even though the patient has these symptoms. (2) False positives are based on common medica- tions: We noticed that aspirin is the only medication that ap- peared in collected contextual information for all 3 instances incorrectly categorized as positive(false positives). So our method categorizes these negative instances as positive based on the evidence that the patient takes aspirin. Aspirin is very a generic medication and it can provide minimal con- clusive information about the patient’s conditions. This ob- servation illustrates the need for ranking of the domain rela- tionships based on specificity, i.e., the relationship between metoprolol and hypertension is more specific than the rela- tionship between aspirin and hypertension, because if the patient takes metoprolol, it provides strong evidence that the patient is a hypertensive patient than a patient who takes aspirin even though both medications are prescribed to hypertensive patients. This type of ranking will help to ignore evidences which provide minimal conclusive informa- tion about the patient’s condition and help to improve the results by eliminating false positives. (3) Conflicts over major disorders can be resolved with good accuracy: Our approach has been able to resolve conflicts over major disorders with good accuracy. There were 17 conflicting in- stances within the total of 32 on major disorders and the following list contains the frequency of each disorder. • 5 instances of coronary artery disease. • 4 instances of atrial fibrillation. • 2 instances of peripheral vascular disease. • 1 instance each from ischemia, cardiomyopathies, coro- nary heart disease, arthritis, ischemic cardiomyopathy, and aortic valve stenosis. We were able to resolve 15 out of 17 conflicts correctly. Since these are major conditions and patients will have to
  • 6. face more serious problems if they do not treat these condi- tions well, they will be taking medications to cure or control the condition. This fact is reflected in the EMRs and helps us in resolving conflicts. 7. CONCLUSION AND FUTURE WORK This paper demonstrates shortcomings of the state-of-the- art NLP algorithms used to understand unstructured EMRs and shows the need for complementary techniques for accu- rate interpretation of documents. As shown in the examples, there are instances which require much deeper understand- ing of the words and some common sense reasoning for bet- ter interpretation. Also, we proposed a method to resolve conflicting instances created due to shortcomings of NLP engines for clinical data processing with the help of com- prehensive knowledge base. Our evaluation shows that this method can resolve the conflicting instances over major dis- orders with great accuracy and provide insights to improve the approach. While we have demonstrated results of the proposed method in conflict resolution, we believe the knowledge-based ap- proach can be used to detect and address many parsing is- sues in NLP algorithms. For example, although NLP al- gorithms detect that a patient does not have a particular disorder/symptom(not a conflicting instance), if the contex- tual information seems to strongly suggest otherwise, this can trigger a need for better analysis of the original sen- tences. As the next step, we will work on ranking the relation- ships based on their specificity. We believe it can be done by analyzing the number of relationships each concept par- ticipates in and the nature of the participating concepts. A more comprehensive evaluation will also be shared in near future. 8. REFERENCES [1] Alan R Aronson. Metamap: Mapping text to the umls metathesaurus. Bethesda, MD: NLM, NIH, DHHS, 2006. [2] Olivier Bodenreider. The unified medical language system (umls): integrating biomedical terminology. Nucleic acids research, 32(suppl 1):D267–D270, 2004. [3] Carol Friedman, Philip O Alderson, John HM Austin, James J Cimino, and Stephen B Johnson. A general natural-language text processor for clinical radiology. Journal of the American Medical Informatics Association, 1(2):161–174, 1994. [4] Sujan Perera, Cory Henson, Krishnaprasad Thirunarayan, Amit Sheth, and Suhas Nair. Data driven knowledge acquisition method for domain knowledge enrichment in the healthcare. In IEEE International Conference on Bioinformatics and Biomedicine, pages 1–8, October 2012. [5] Guergana K Savova, James J Masanz, Philip V Ogren, Jiaping Zheng, Sunghwan Sohn, Karin C Kipper-Schuler, and Christopher G Chute. Mayo clinical text analysis and knowledge extraction system (ctakes): architecture, component evaluation and applications. Journal of the American Medical Informatics Association, 17(5):507–513, 2010.