Automatic ICD-10 Code Assignment to Consultations

Automatic ICD-10 Code
Assignment to Tele-Consultations
Nirav Kumar, Director of Data Science
Joinal Ahmed, Data Scientist - II

1. Introduction to Halodoc.
2. Machine Learning @ Halodoc.
3. Insurance Claim Adjudication
a. How ICD-10 is useful in claim processing?
b. ICD-10 code and anatomy.
c. How AI can solve this problem?
4. Building a Solution
a. Gathering the Data
b. Leveraging Deep Learning for Multi Label multi class
classification.
Agenda

The #1 Digital
Healthcare Hub
We are the #1 health-tech
company in Indonesia that
provides services to meet user’s
various Healthcare needs.
1M
IOS
7,6M
Android
Install
18M
Active Users
38M
Active Viewers

Our Services
Chat with Doctor
Online consultation with trusted certified
doctors, anytime and anywhere
20,000+ GPs & Specialists
Digital prescriptions
Health Store
Buy medicine, supplements, and healthcare
needs delivered straight to your door.
Partnered with 8,000+ pharmacies
Visit Hospital
Skip the queues at hospitals with online
Registration service to your preferred hospital
Partnered with 2,500+ Network Hospitals
Lab Test
Lab services at the comfort of your own
home (jabodetabek)
Partnered with PRODIA
Indonesia’s biggest lab provider
Link Insurance
Seamless and cashless transaction,
instant deduction from insurance plan
Working together with 15 top insurance providers
Articles
Reliable and trusted articles covering
a wide range of topic
Created and/or reviewed by doctors

Nationwide Access to Healthcare with Halodoc
Expanded access to online consultation with general practitioners and specialists throughout Indonesia.
With Halodoc

Halodoc
Committed in making an impact

7
Machine Learning @ Halodoc
● Accessing Quality of Care
● Recommendation Systems
● Order Fulfillment
● Identifying User Cohorts
● Personalization
● Insurance Adjudication
● OCR
● Filtering Obscenity
● Upselling / Cross Selling

8
Why ICD-10 for Claim Adjudication?
● Traditional Claim Adjudication process is time consuming.
○ Turnaround time of 24+ hours
○ Needs a manual coder to go through the claim and assign
code
● ICD-10 code can describe the ailment accurately
○ can be used up understand the patients needs
○ build a health profile*
○ can be used to give a personalised experience

What is ICD-10 code ?
International Classification of Diseases(ICD) - 10, is a standardized way of indicating diagnoses and procedures that
were performed during the consultation. ICD-10 codes have a variety of uses, ranging from Billing, Insurance claims to
predictive modeling of patient’s conditions with over 140,000 codes combined in the ICD-10-CM and ICD-10-PCS
taxonomies.

Proposed Solution : Leveraging AI to tag consultations
12

Gathering Data at Scale
13
● Consultations are labeled with
diagnostic codes by doctors and
analysts.
● Raw data from RDS is dumped to S3
buckets and later processed and
populated in Data Lake using Apache
Spark and Hudi on Amazon EMR.

Model Architecture: Deep Neural Nets with Attention
Embedding BiLSTM Label Attention Layer Output Layer

Embedding Generation - Leveraging Word2Vec

Embedding Model : CBOW Word2Vec
We utilize CBOW Word2Vec architecture for creating word embeddings that uses 𝑛 future words as well as 𝑛 past
words to create a word embedding.
The objective function for CBOW is:
Using Text data from doctor notes and ICD-10 descriptions in bahasa, the Word2Vec
model is trained with embedding size=256.
This Word2Vec model is used to generate text embeddings for input doctor notes
while Training the model and making predictions while inference

BiLSTM Layer
We use a BiLSTM architecture to capture contextual information across input words in D. In particular,
we use the BiLSTM to learn latent feature vectors representing input words from a sequence ew1 :wn of
vectors ew1 , ew2 , ..., ewn . We compute the hidden states of the LSTMs corresponding to the ith word (i ∈
{1,...,n}) as:
The dimensionality of the LSTM hidden states is set to u, resulting in the
size of the latent vectors hi at 2u. All the hidden state vectors of words in
D are concatenated to formulate a matrix H = [h1, h2, ..., hn] ∈ R2u×n.

Label Attention Layer
As the clinical documents have different lengths and each document has multi-labels, our goal
is to transform H (input from embedding layer) into label-specific vectors. We achieve that goal
by proposing a label attention mechanism. Our label attention mechanism takes H as the input
and output |L| label-specific vectors representing the input doctor note D .

Label Classification Layer
Given the vector document representation vℓ, we compute a probability for label ℓ using another linear layer and a sigmoid
transformation:
where 𝜷ℓ s a vector of prediction weights, and 𝑏ℓ is a scalar offset.

Training
The training procedure minimizes the binary cross-entropy loss,
plus the L2 norm of the model weights, using the adam optimizer.

Results
accuracy precision recall
Training 0.93 0.93 0.87
Validation 0.96 0.96 0.93

22
Join Us
Scalability, reliability and maintainability are the three pillars that govern what we build at Halodoc
Tech. We are always looking out for top engineering talent across all roles for our tech team, and if
solving hard problems with challenging requirements is your forte, please reach out to us with your
resumé at careers.india@halodoc.com.
Read more about our work at blogs.halodoc.io

5th Floor Shubha Ram Complex, 144, Mahatma Gandhi Rd, Craig Park Layout, Ashok Nagar,
Bangalore, Karnataka 560001

Feedback
Your feedback is important to us.
Don’t forget to rate and review the sessions.

Appendix
Hierarchical Structure

26
For each input document D, the model firstly produces the prediction for the first level of the ICD codes’ first three
characters (i.e. normalized codes). The predicted output of the first level “normalization” is embedded into a
vector with the projection size p. The vector is then concatenated with each label-specific vector of the second
level of the “raw” ICD codes before being fed into the feed-forward network to produce the final prediction. The
model is trained by minimizing the sum of the binary cross-entropy losses of the “normalization” and “raw” levels.

Automatic ICD-10 Code Assignment to Consultations

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Automatic ICD-10 Code Assignment to Consultations

Similaire à Automatic ICD-10 Code Assignment to Consultations (20)

Plus de Databricks

Plus de Databricks (20)

Dernier

Dernier (20)

Automatic ICD-10 Code Assignment to Consultations