SlideShare une entreprise Scribd logo
1  sur  36
2023 Trends in Applied NLP in
Healthcare:
Large Language Models,
No-Code, and Responsible AI
David Talby
CTO, John Snow Labs
2
Agenda
1. Large Language Models
2. No-Code Medical Chatbots
3. Responsible AI
3
55+ million 59% share
O’Reilly Media Gradient Flow
Downloads on PyPI.
“Most Widely Used NLP
Library in the Enterprise.”
of Healthcare NLP
teams use Spark NLP
John Snow Labs
is the team behind Spark NLP
4
Accelerating Biomedical Innovation by
Combining NLP and Knowledge Graphs
Extracting what, when, why, and how from Radiology
Reports in Real World Data Projects
Automated Classification and Entity Extraction from
Essential Clinical Trial Documents
Question Answering on Clinical Guidelines Identifying opioid-related adverse events
from unstructured text
Adverse Drug Event Detection using Spark NLP Lessons Learned De-Identifying 700 Million Patients
Notes with Spark NLP
Understand Patient Experience Journey
to Improve Pharma Value Chain
A Real-time NLP-Based Clinical Decision
Support Platform for Psychiatry and Oncology
Case Studies from the NLP Summit
5
2022 Peer-Reviewed Papers
Deeper Clinical Document
Understanding Using Relation
Extraction
New state-of-the-art accuracy on:
2019 Phenotype-Gene Relations dataset
2018 n2c2 Posology Relations dataset
2012 Adverse Drug Events Drug-Reaction dataset
2012 i2b2 Clinical Temporal Relations challenge
2010 i2b2 Clinical Relations challenge
Mining Adverse Drug Reactions from
Unstructured Mediums at Scale
New state-of-the-art accuracy on:
ADE benchmark
SMM4H benchmark
CADEC entity recognition dataset
CADEC relation extraction dataset
Biomedical Named Entity Recognition
in Eight Languages with Zero Code
Changes
New state-of-the-art accuracy on:
LivingNER dataset using a single model architecture in
English, French, Italian, Portuguese, Galatian, Catalan &
Romanian
Accurate Clinical and Biomedical
Named
Entity Recognition at Scale
New state-of-the-art accuracy on:
2018 n2c2 medication extraction
2014 n2c2 de-identification
2010 i2b2/VA clinical concept extraction
8 different Biomedical NLP benchmarks
6
2023 Year of the LLM
7
1. Open-Source is Catching Up Fast
State of AI Report, Nathan Benaich & Ian Hogarth, https://www.stateof.ai/
11th October 2022
8
1. Open-Source is Catching Up Fast
A Survey of Large Language Models, Zhao et. al., arxiv.org/abs/2303.18223
Submitted on 31 Mar 2023 (v1), last revised 24 Apr 2023 (v6)
9
2. Costs Are Coming Down Fast
At the MIT event, Altman was asked if training GPT-4 cost $100 million;
he replied, “It’s more than that.”
10
2. Costs Are Coming Down Fast
Dolly 2.0 as trained on a human-generated dataset of prompts and
responses. The training methodology is similar to InstructGPT but with
a claimed higher accuracy and lower training costs of less than $30.
11
3. Medical Large Language Models Are Here
Medical Question Answering with
BioGPT
Medical Question Answering with BioGPT-JSL
Faster inference than HF
Fine-tuned with fresh medical data
The first ever closed-book medical question
answering LLM based on BioGPT
12
Medical Specialty: Pediatrics - Neonatal, Sample Name: Chest Closure
Text :
Summary
A newborn with hypoplastic left heart syndrome underwent a delayed primary chest closure under general endotracheal
anesthesia. The chest was prepped and draped in a sterile fashion, and mediastinal cultures were obtained. The mediastinum
and cavities were irrigated and suctioned, and the sternum was closed with stainless steel wires and subcutaneous tissues
with interrupted monofilament stitches. The patient tolerated the procedure well and was transferred to the pediatric intensive
unit in stable condition.
Description: Delayed primary chest closure. Open chest status post modified stage 1
Norwood operation. The patient is a newborn with diagnosis of hypoplastic left heart
syndrome who 48 hours prior to the current procedure has undergone a modified stage 1
Norwood operation. (Medical Transcription Sample Report)
PROCEDURE: Delayed primary chest closure.
INDICATIONS: The patient is a newborn with diagnosis of hypoplastic left heart syndrome
who 48 hours prior to the current procedure has undergone a modified stage 1 Norwood
operation. Given the magnitude of the operation and the size of the patient (2.5 kg), we have
elected to leave the chest open to facilitate postoperative management. He is now taken back
to the operative room for delayed primary chest closure.
PREOP DX: Open chest status post modified stage 1 Norwood operation.
POSTOP DX: Open chest status post modified stage 1 Norwood operation.
ANESTHESIA: General endotracheal.
COMPLICATIONS: None.
FINDINGS: No evidence of intramediastinal purulence or hematoma. He tolerated the procedure
well.
DETAILS OF PROCEDURE: The patient was brought to the operating room and placed on the
operating table in the supine position. Following general endotracheal anesthesia, the chest was
prepped and draped in the usual sterile fashion. The previously placed AlloDerm membrane was
removed. Mediastinal cultures were obtained, and the mediastinum was then profusely irrigated and
suctioned. Both cavities were also irrigated and suctioned. The drains were flushed and
repositioned. Approximately 30 cubic centimeters of blood were drawn slowly from the right atrial
line. The sternum was then smeared with a vancomycin paste. The proximal aspect of the 5 mm
RV-PA conduit was marked with a small titanium clip at its inferior most aspect and with an
additional one on its rightward inferior side. The sternum was then closed with stainless steel wires
followed by closure of subcutaneous tissues with interrupted monofilament stitches. The skin was
closed with interrupted nylon sutures and a sterile dressing was placed. The peritoneal dialysis
catheter, atrial and ventricular pacing wires were removed. The patient was transferred to the
pediatric intensive unit shortly thereafter in very stable condition. I was the surgical attending
present in the operating room and in charge of the surgical procedure throughout the entire length of
the case.
Summarize Clinical Notes, Biomedical Research, and Patient Messages
3. Medical Large Language Models Are Here
13
Healthcare-Specific LLM’s Outperform
General-Purpose LLM’s
• Clinical note summarization is 30% more accurate than
general state-of-the-art LLMs (BART, Flan-T5, Pegasus).
• On clinical entity recognition, John Snow Labs'
models make half of the errors that ChatGPT does.
• De-Identification out-of-the-box accuracy is
93% compared to ChatGPT’s 60% on detecting PHI in
clinical notes.
• Extracting ICD-10-CM codes is done with a 76%
success rate versus 26% for GPT-3.5 and 36% for
GPT-4.
www.johnsnowlabs.com/large-language-models-blog
14
Free Licenses for Open Research
15
Agenda
1. Large Language Models
2. No-Code Medical Chatbots
3. Responsible AI
16
The NLP Lab
The Free No-Code NLP Platform:
• Annotate Text & Images
• AI Assisted Annotation
• Train & Tune NLP Models
• Models, Rules, and Prompts Hub
• Manage Projects & Teams
• Enterprise Security & Privacy
This is widely used today, but what comes
next?
https://www.johnsnowlabs.com/nlp-lab/
17
Answering Clinical Questions
Which female patients have not
started taking beta blockers
within a month after a heart attack?
Demographics
Cohort
Building
Not, And, Or
Drug Classes
Timeline Common Terms
18
Answering Biomedical Questions
Which multi-center clinical trials assessed
the efficacy of vildagliptin as an add-on
therapy to metformin for adults with T2DM?
Trial Protocols
Research Outcomes & Metrics
Populations
Study Design Terminologies
19
No Data Sharing No BS No Test Gaps
Airgap Deployment Knowledge Base NLP Test
Run behind your firewall,
never send data to 3rd parties
No hallucinations or
unexplained results
Responsible AI: Test for
robustness, fairness, bias,
toxicity, and data leakage
Towards Regulatory-Grade Chatbots
20
An End-to-end System
Chat & Query Application
Pre-Processing Cluster
Kubernetes Keycloak
Vector
Database
Curated
datasets &
terminologies
Multi-modal
Patient data
21
An End-to-end System: Capabilities
Answer ‘noisy’ natural
language questions
Find cohorts by conditions,
grouping and/or timeline
Explain & cite answers
Maintain session & context
Analyze multi-modal data
Near-real-time freshness
Normalize patient data
Link patient data over time
Scale to millions of patients
Run on commodity hardware
On-premise, high-compliance, scale-as-you-go
Strong security, role-based access, single sign-
on
Semantic
Search
Curated
datasets &
terminologies
Multi-modal
Patient data
22
Agenda
1. Large Language Models
2. No-Code Medical Chatbots
3. Responsible AI
Responsible AI is Not Optional
There are lots of Responsible AI Frameworks
But There’s a Big Gap in Implementation
Beyond Accuracy: Behavioral Testing of NLP
models with CheckList
Ribiero et. al., 2020
Sentiment analysis services of the top three cloud providers fail:
• 9-16% of the time when replacing neutral words
• 7-20% of the time when changing neutral named entities
• 36-42% of the time on some temporal tests
• Almost 100% of the time on some negation tests.
BBQ: A Hand-Built Bias Benchmark for
Question Answering
Parrish et. al., 2022
Biases around race, gender, physical appearance,
disability, and religion are ingrained in state-of-the-art
question answering models – sometimes changing the
likely answer more than 80% of the time.
Information Leakage in Embedding
Models
Song and Raghunathan, 2020
Data leakage of 50-70% of personal information
into popular word & sentence embeddings.
What Do You See in this Patient?
Behavioral Testing of Clinical NLP Models
van Aken et. al., 2022
Adding any mention of ethnicity to a patient note reduces their
predicted risk of mortality – with the most accurate model
producing the largest error.
Responsible AI Best Practices
1. Test Your Models!
Why would you expect untested software to work?
2. Don’t Reuse Academic Models in Production
Publishing research ≠ Building reliable systems
3. Test Beyond Accuracy
Robustness, Bias, Fairness, Toxicity, Efficiency, Safety, …
27
Simple
O’Reilly Media
Comprehensive
Test all aspects of
model quality before
going to production
Open Source
Open under the Apache
2.0 license and designed
for easy extension
Papers with Code
Generate & run
50+ test types on
popular NLP tasks
Introducing the NLP Test Library
NLP Test Automates 3 Steps in Your AI Workflow
NLP Test In 3 Lines of Code
from nlptest import Harness
h = Harness(model='dslim/bert-base-NER', hub='huggingface')
h.generate().run().report()
Generate a set of test cases
given a task, model & dataset
Run the test suite, generating
a data frame of test results
Generate a summary report
stating which tests have passed
Write Once, Test Everywhere
from nlptest import Harness
h = Harness(model='ner_dl_bert', hub='johnsnowlabs')
h = Harness(model='dslim/bert-base-NER', hub='huggingface')
h = Harness(model='en_core_web_sm', hub='spacy')
Adding a new library or API?
All test types will generate & run.
Adding a new test type?
It will run on all supported libraries.
1. Auto-Generate Tests
2. Run Tests
Test type Test case Expected result
add_typos Wang Li is a ductor. Wang Li: Person
add_context Wang Li is a doctor. #careers Wang Li: Person
replace_to_hispanic_name Juan Moreno is a doctor. Juan Moreno: Person
min_gender_representation Female 30
min_gender_f1_score Female 0.85
From a test suite created with generate(), manually, or with load():
Category Pass Rate Minimum Pass Rate Pass?
Robustness 50% 75% 
Bias 85% 85% 
Representation 100% 100% 
Fairness 66% 100% 
Calling run() and then report() produces a summary:
3. Improve Models With Data Augmentation
h.augment(input_path='training_dataset', output_path='augmented_dataset')
new_model = nlp.load('model').fit('augmented_dataset')
Harness.load(save_dir='testcases', model=new_model, hub='johnsnowlabs').run()
Generate new augmented
labeled data for the model’s
training (not test!) dataset.
Train a new model using your
favorite framework using the
augmented training dataset.
Run a regression test: Create a
new test harness with the new
model and the old test suite.
Integrate Testing Into CI/CD or MLOps
class DataScienceWorkFlow(FlowSpec):
@step
def train(self):
...
@step
def run_tests(self):
harness = Harness.load(model=self.model, save_dir=“testsuite")
self.report = harness.run().report()
@step
def deploy(self):
if self.report["score"] > self.test_threshold:
...
Train a new version of a model
Run a regression test
Only deploy if the test passed
Getting Started with NLP Test
TUTORIALS AND EXAMPLES:
CONTRIBUTING:
https://github.com/johnsnowlabs/nlptest
COMMUNITY CHAT:
https://spark-nlp.slack.com @ #nlp-test
https://nlptest.org
Expect Rapid Releases & Long-Term Support from John Snow Labs.
Thank you!
© 2015-2023 John Snow Labs Inc. All rights reserved. The John Snow Labs logo is a trademarks of John Snow Labs Inc. The included information is for informational purposes only and represents the current
view of John Snow Labs as of the date of this presentation. Since John Snow Labs must respond to changing market conditions, it should not be interpreted to be a commitment on its part, and John Snow Labs
cannot guarantee the accuracy of any information provided after the date of this presentation. John Snow Labs makes no warranties, express
or statutory, as to the information in this presentation.
david@JohnSnowLabs.com
@davidtalby
in/davidtalby

Contenu connexe

Tendances

Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 

Tendances (20)

LLMs_talk_March23.pdf
LLMs_talk_March23.pdfLLMs_talk_March23.pdf
LLMs_talk_March23.pdf
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
 
Transformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGITransformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGI
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
 
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve OmohundroOpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1AI and ML Series - Introduction to Generative AI and LLMs - Session 1
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
gpt3_presentation.pdf
gpt3_presentation.pdfgpt3_presentation.pdf
gpt3_presentation.pdf
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
 
Unleashing the Google Bard Discover the Revolutionary New Tool How does it Co...
Unleashing the Google Bard Discover the Revolutionary New Tool How does it Co...Unleashing the Google Bard Discover the Revolutionary New Tool How does it Co...
Unleashing the Google Bard Discover the Revolutionary New Tool How does it Co...
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 

Similaire à Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in Healthcare - AMIA April 2023

Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
IJDKP
 
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-finalKCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
Deborah Woodcock
 
College Writing II Synthesis Essay Assignment Summer Semester 2017.docx
College Writing II Synthesis Essay Assignment Summer Semester 2017.docxCollege Writing II Synthesis Essay Assignment Summer Semester 2017.docx
College Writing II Synthesis Essay Assignment Summer Semester 2017.docx
clarebernice
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016
Rick Silva
 

Similaire à Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in Healthcare - AMIA April 2023 (20)

Medical Deep Learning: Clinical, Technical, & Regulatory Challenges and How t...
Medical Deep Learning: Clinical, Technical, & Regulatory Challenges and How t...Medical Deep Learning: Clinical, Technical, & Regulatory Challenges and How t...
Medical Deep Learning: Clinical, Technical, & Regulatory Challenges and How t...
 
NLP tutorial at AIME 2020
NLP tutorial at AIME 2020NLP tutorial at AIME 2020
NLP tutorial at AIME 2020
 
JALANov2000
JALANov2000JALANov2000
JALANov2000
 
Dalton
DaltonDalton
Dalton
 
Dalton presentation
Dalton presentationDalton presentation
Dalton presentation
 
The Randomized Controlled Trial: The Gold Standard of Clinical Science and a ...
The Randomized Controlled Trial: The Gold Standard of Clinical Science and a ...The Randomized Controlled Trial: The Gold Standard of Clinical Science and a ...
The Randomized Controlled Trial: The Gold Standard of Clinical Science and a ...
 
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
Big Data and the Promise and Pitfalls when Applied to Disease Prevention and ...
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
The Clinical Genome Conference 2014
The Clinical Genome Conference 2014The Clinical Genome Conference 2014
The Clinical Genome Conference 2014
 
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew Litt
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew LittHealthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew Litt
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew Litt
 
Big Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH HeadedBig Data in Biomedicine: Where is the NIH Headed
Big Data in Biomedicine: Where is the NIH Headed
 
Nursing and its importance in future
Nursing and its importance in futureNursing and its importance in future
Nursing and its importance in future
 
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
EXAMINING THE EFFECT OF FEATURE SELECTION ON IMPROVING PATIENT DETERIORATION ...
 
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASEUSING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
USING DATA MINING TECHNIQUES FOR DIAGNOSIS AND PROGNOSIS OF CANCER DISEASE
 
FAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR trackFAIRness and Accountability BioIT 2019 FAIR track
FAIRness and Accountability BioIT 2019 FAIR track
 
Machine learning in biology
Machine learning in biologyMachine learning in biology
Machine learning in biology
 
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-finalKCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
KCI_NLP_OHSUResearchWeek2016-NLPatOHSU-final
 
ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014ICBO 2014, October 8, 2014
ICBO 2014, October 8, 2014
 
College Writing II Synthesis Essay Assignment Summer Semester 2017.docx
College Writing II Synthesis Essay Assignment Summer Semester 2017.docxCollege Writing II Synthesis Essay Assignment Summer Semester 2017.docx
College Writing II Synthesis Essay Assignment Summer Semester 2017.docx
 
Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016Reg Sci Lecture Dec 2016
Reg Sci Lecture Dec 2016
 

Plus de David Talby

Natural Language Understanding with Machine Learned Annotators and Deep Learn...
Natural Language Understanding with Machine Learned Annotators and Deep Learn...Natural Language Understanding with Machine Learned Annotators and Deep Learn...
Natural Language Understanding with Machine Learned Annotators and Deep Learn...
David Talby
 
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
Architecting a Predictive,  Petabyte-Scale, Self-Learning Fraud Detection SystemArchitecting a Predictive,  Petabyte-Scale, Self-Learning Fraud Detection System
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
David Talby
 
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
David Talby
 

Plus de David Talby (13)

Building State-of-the-art Natural Language Processing Projects with Free Soft...
Building State-of-the-art Natural Language Processing Projects with Free Soft...Building State-of-the-art Natural Language Processing Projects with Free Soft...
Building State-of-the-art Natural Language Processing Projects with Free Soft...
 
Turning Medical Expert Knowledge into Responsible Language Models - K1st World
Turning Medical Expert Knowledge into Responsible Language Models - K1st WorldTurning Medical Expert Knowledge into Responsible Language Models - K1st World
Turning Medical Expert Knowledge into Responsible Language Models - K1st World
 
How to Apply NLP to Analyze Clinical Trials
How to Apply NLP to Analyze Clinical TrialsHow to Apply NLP to Analyze Clinical Trials
How to Apply NLP to Analyze Clinical Trials
 
New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022New Frontiers in Applied NLP​ - PAW Healthcare 2022
New Frontiers in Applied NLP​ - PAW Healthcare 2022
 
Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021Applying NLP to Personalized Healthcare - 2021
Applying NLP to Personalized Healthcare - 2021
 
Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...
Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...
Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...
 
Natural Language Understanding in Healthcare
Natural Language Understanding in HealthcareNatural Language Understanding in Healthcare
Natural Language Understanding in Healthcare
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
Deep learning for natural language understanding
Deep learning for natural language understandingDeep learning for natural language understanding
Deep learning for natural language understanding
 
Build your open source data science platform
Build your open source data science platformBuild your open source data science platform
Build your open source data science platform
 
Natural Language Understanding with Machine Learned Annotators and Deep Learn...
Natural Language Understanding with Machine Learned Annotators and Deep Learn...Natural Language Understanding with Machine Learned Annotators and Deep Learn...
Natural Language Understanding with Machine Learned Annotators and Deep Learn...
 
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
Architecting a Predictive,  Petabyte-Scale, Self-Learning Fraud Detection SystemArchitecting a Predictive,  Petabyte-Scale, Self-Learning Fraud Detection System
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
 
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
 

Dernier

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
amitlee9823
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
amitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 

Dernier (20)

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 

Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in Healthcare - AMIA April 2023

  • 1. 2023 Trends in Applied NLP in Healthcare: Large Language Models, No-Code, and Responsible AI David Talby CTO, John Snow Labs
  • 2. 2 Agenda 1. Large Language Models 2. No-Code Medical Chatbots 3. Responsible AI
  • 3. 3 55+ million 59% share O’Reilly Media Gradient Flow Downloads on PyPI. “Most Widely Used NLP Library in the Enterprise.” of Healthcare NLP teams use Spark NLP John Snow Labs is the team behind Spark NLP
  • 4. 4 Accelerating Biomedical Innovation by Combining NLP and Knowledge Graphs Extracting what, when, why, and how from Radiology Reports in Real World Data Projects Automated Classification and Entity Extraction from Essential Clinical Trial Documents Question Answering on Clinical Guidelines Identifying opioid-related adverse events from unstructured text Adverse Drug Event Detection using Spark NLP Lessons Learned De-Identifying 700 Million Patients Notes with Spark NLP Understand Patient Experience Journey to Improve Pharma Value Chain A Real-time NLP-Based Clinical Decision Support Platform for Psychiatry and Oncology Case Studies from the NLP Summit
  • 5. 5 2022 Peer-Reviewed Papers Deeper Clinical Document Understanding Using Relation Extraction New state-of-the-art accuracy on: 2019 Phenotype-Gene Relations dataset 2018 n2c2 Posology Relations dataset 2012 Adverse Drug Events Drug-Reaction dataset 2012 i2b2 Clinical Temporal Relations challenge 2010 i2b2 Clinical Relations challenge Mining Adverse Drug Reactions from Unstructured Mediums at Scale New state-of-the-art accuracy on: ADE benchmark SMM4H benchmark CADEC entity recognition dataset CADEC relation extraction dataset Biomedical Named Entity Recognition in Eight Languages with Zero Code Changes New state-of-the-art accuracy on: LivingNER dataset using a single model architecture in English, French, Italian, Portuguese, Galatian, Catalan & Romanian Accurate Clinical and Biomedical Named Entity Recognition at Scale New state-of-the-art accuracy on: 2018 n2c2 medication extraction 2014 n2c2 de-identification 2010 i2b2/VA clinical concept extraction 8 different Biomedical NLP benchmarks
  • 6. 6 2023 Year of the LLM
  • 7. 7 1. Open-Source is Catching Up Fast State of AI Report, Nathan Benaich & Ian Hogarth, https://www.stateof.ai/ 11th October 2022
  • 8. 8 1. Open-Source is Catching Up Fast A Survey of Large Language Models, Zhao et. al., arxiv.org/abs/2303.18223 Submitted on 31 Mar 2023 (v1), last revised 24 Apr 2023 (v6)
  • 9. 9 2. Costs Are Coming Down Fast At the MIT event, Altman was asked if training GPT-4 cost $100 million; he replied, “It’s more than that.”
  • 10. 10 2. Costs Are Coming Down Fast Dolly 2.0 as trained on a human-generated dataset of prompts and responses. The training methodology is similar to InstructGPT but with a claimed higher accuracy and lower training costs of less than $30.
  • 11. 11 3. Medical Large Language Models Are Here Medical Question Answering with BioGPT Medical Question Answering with BioGPT-JSL Faster inference than HF Fine-tuned with fresh medical data The first ever closed-book medical question answering LLM based on BioGPT
  • 12. 12 Medical Specialty: Pediatrics - Neonatal, Sample Name: Chest Closure Text : Summary A newborn with hypoplastic left heart syndrome underwent a delayed primary chest closure under general endotracheal anesthesia. The chest was prepped and draped in a sterile fashion, and mediastinal cultures were obtained. The mediastinum and cavities were irrigated and suctioned, and the sternum was closed with stainless steel wires and subcutaneous tissues with interrupted monofilament stitches. The patient tolerated the procedure well and was transferred to the pediatric intensive unit in stable condition. Description: Delayed primary chest closure. Open chest status post modified stage 1 Norwood operation. The patient is a newborn with diagnosis of hypoplastic left heart syndrome who 48 hours prior to the current procedure has undergone a modified stage 1 Norwood operation. (Medical Transcription Sample Report) PROCEDURE: Delayed primary chest closure. INDICATIONS: The patient is a newborn with diagnosis of hypoplastic left heart syndrome who 48 hours prior to the current procedure has undergone a modified stage 1 Norwood operation. Given the magnitude of the operation and the size of the patient (2.5 kg), we have elected to leave the chest open to facilitate postoperative management. He is now taken back to the operative room for delayed primary chest closure. PREOP DX: Open chest status post modified stage 1 Norwood operation. POSTOP DX: Open chest status post modified stage 1 Norwood operation. ANESTHESIA: General endotracheal. COMPLICATIONS: None. FINDINGS: No evidence of intramediastinal purulence or hematoma. He tolerated the procedure well. DETAILS OF PROCEDURE: The patient was brought to the operating room and placed on the operating table in the supine position. Following general endotracheal anesthesia, the chest was prepped and draped in the usual sterile fashion. The previously placed AlloDerm membrane was removed. Mediastinal cultures were obtained, and the mediastinum was then profusely irrigated and suctioned. Both cavities were also irrigated and suctioned. The drains were flushed and repositioned. Approximately 30 cubic centimeters of blood were drawn slowly from the right atrial line. The sternum was then smeared with a vancomycin paste. The proximal aspect of the 5 mm RV-PA conduit was marked with a small titanium clip at its inferior most aspect and with an additional one on its rightward inferior side. The sternum was then closed with stainless steel wires followed by closure of subcutaneous tissues with interrupted monofilament stitches. The skin was closed with interrupted nylon sutures and a sterile dressing was placed. The peritoneal dialysis catheter, atrial and ventricular pacing wires were removed. The patient was transferred to the pediatric intensive unit shortly thereafter in very stable condition. I was the surgical attending present in the operating room and in charge of the surgical procedure throughout the entire length of the case. Summarize Clinical Notes, Biomedical Research, and Patient Messages 3. Medical Large Language Models Are Here
  • 13. 13 Healthcare-Specific LLM’s Outperform General-Purpose LLM’s • Clinical note summarization is 30% more accurate than general state-of-the-art LLMs (BART, Flan-T5, Pegasus). • On clinical entity recognition, John Snow Labs' models make half of the errors that ChatGPT does. • De-Identification out-of-the-box accuracy is 93% compared to ChatGPT’s 60% on detecting PHI in clinical notes. • Extracting ICD-10-CM codes is done with a 76% success rate versus 26% for GPT-3.5 and 36% for GPT-4. www.johnsnowlabs.com/large-language-models-blog
  • 14. 14 Free Licenses for Open Research
  • 15. 15 Agenda 1. Large Language Models 2. No-Code Medical Chatbots 3. Responsible AI
  • 16. 16 The NLP Lab The Free No-Code NLP Platform: • Annotate Text & Images • AI Assisted Annotation • Train & Tune NLP Models • Models, Rules, and Prompts Hub • Manage Projects & Teams • Enterprise Security & Privacy This is widely used today, but what comes next? https://www.johnsnowlabs.com/nlp-lab/
  • 17. 17 Answering Clinical Questions Which female patients have not started taking beta blockers within a month after a heart attack? Demographics Cohort Building Not, And, Or Drug Classes Timeline Common Terms
  • 18. 18 Answering Biomedical Questions Which multi-center clinical trials assessed the efficacy of vildagliptin as an add-on therapy to metformin for adults with T2DM? Trial Protocols Research Outcomes & Metrics Populations Study Design Terminologies
  • 19. 19 No Data Sharing No BS No Test Gaps Airgap Deployment Knowledge Base NLP Test Run behind your firewall, never send data to 3rd parties No hallucinations or unexplained results Responsible AI: Test for robustness, fairness, bias, toxicity, and data leakage Towards Regulatory-Grade Chatbots
  • 20. 20 An End-to-end System Chat & Query Application Pre-Processing Cluster Kubernetes Keycloak Vector Database Curated datasets & terminologies Multi-modal Patient data
  • 21. 21 An End-to-end System: Capabilities Answer ‘noisy’ natural language questions Find cohorts by conditions, grouping and/or timeline Explain & cite answers Maintain session & context Analyze multi-modal data Near-real-time freshness Normalize patient data Link patient data over time Scale to millions of patients Run on commodity hardware On-premise, high-compliance, scale-as-you-go Strong security, role-based access, single sign- on Semantic Search Curated datasets & terminologies Multi-modal Patient data
  • 22. 22 Agenda 1. Large Language Models 2. No-Code Medical Chatbots 3. Responsible AI
  • 23. Responsible AI is Not Optional
  • 24. There are lots of Responsible AI Frameworks
  • 25. But There’s a Big Gap in Implementation Beyond Accuracy: Behavioral Testing of NLP models with CheckList Ribiero et. al., 2020 Sentiment analysis services of the top three cloud providers fail: • 9-16% of the time when replacing neutral words • 7-20% of the time when changing neutral named entities • 36-42% of the time on some temporal tests • Almost 100% of the time on some negation tests. BBQ: A Hand-Built Bias Benchmark for Question Answering Parrish et. al., 2022 Biases around race, gender, physical appearance, disability, and religion are ingrained in state-of-the-art question answering models – sometimes changing the likely answer more than 80% of the time. Information Leakage in Embedding Models Song and Raghunathan, 2020 Data leakage of 50-70% of personal information into popular word & sentence embeddings. What Do You See in this Patient? Behavioral Testing of Clinical NLP Models van Aken et. al., 2022 Adding any mention of ethnicity to a patient note reduces their predicted risk of mortality – with the most accurate model producing the largest error.
  • 26. Responsible AI Best Practices 1. Test Your Models! Why would you expect untested software to work? 2. Don’t Reuse Academic Models in Production Publishing research ≠ Building reliable systems 3. Test Beyond Accuracy Robustness, Bias, Fairness, Toxicity, Efficiency, Safety, …
  • 27. 27 Simple O’Reilly Media Comprehensive Test all aspects of model quality before going to production Open Source Open under the Apache 2.0 license and designed for easy extension Papers with Code Generate & run 50+ test types on popular NLP tasks Introducing the NLP Test Library
  • 28. NLP Test Automates 3 Steps in Your AI Workflow
  • 29. NLP Test In 3 Lines of Code from nlptest import Harness h = Harness(model='dslim/bert-base-NER', hub='huggingface') h.generate().run().report() Generate a set of test cases given a task, model & dataset Run the test suite, generating a data frame of test results Generate a summary report stating which tests have passed
  • 30. Write Once, Test Everywhere from nlptest import Harness h = Harness(model='ner_dl_bert', hub='johnsnowlabs') h = Harness(model='dslim/bert-base-NER', hub='huggingface') h = Harness(model='en_core_web_sm', hub='spacy') Adding a new library or API? All test types will generate & run. Adding a new test type? It will run on all supported libraries.
  • 32. 2. Run Tests Test type Test case Expected result add_typos Wang Li is a ductor. Wang Li: Person add_context Wang Li is a doctor. #careers Wang Li: Person replace_to_hispanic_name Juan Moreno is a doctor. Juan Moreno: Person min_gender_representation Female 30 min_gender_f1_score Female 0.85 From a test suite created with generate(), manually, or with load(): Category Pass Rate Minimum Pass Rate Pass? Robustness 50% 75%  Bias 85% 85%  Representation 100% 100%  Fairness 66% 100%  Calling run() and then report() produces a summary:
  • 33. 3. Improve Models With Data Augmentation h.augment(input_path='training_dataset', output_path='augmented_dataset') new_model = nlp.load('model').fit('augmented_dataset') Harness.load(save_dir='testcases', model=new_model, hub='johnsnowlabs').run() Generate new augmented labeled data for the model’s training (not test!) dataset. Train a new model using your favorite framework using the augmented training dataset. Run a regression test: Create a new test harness with the new model and the old test suite.
  • 34. Integrate Testing Into CI/CD or MLOps class DataScienceWorkFlow(FlowSpec): @step def train(self): ... @step def run_tests(self): harness = Harness.load(model=self.model, save_dir=“testsuite") self.report = harness.run().report() @step def deploy(self): if self.report["score"] > self.test_threshold: ... Train a new version of a model Run a regression test Only deploy if the test passed
  • 35. Getting Started with NLP Test TUTORIALS AND EXAMPLES: CONTRIBUTING: https://github.com/johnsnowlabs/nlptest COMMUNITY CHAT: https://spark-nlp.slack.com @ #nlp-test https://nlptest.org Expect Rapid Releases & Long-Term Support from John Snow Labs.
  • 36. Thank you! © 2015-2023 John Snow Labs Inc. All rights reserved. The John Snow Labs logo is a trademarks of John Snow Labs Inc. The included information is for informational purposes only and represents the current view of John Snow Labs as of the date of this presentation. Since John Snow Labs must respond to changing market conditions, it should not be interpreted to be a commitment on its part, and John Snow Labs cannot guarantee the accuracy of any information provided after the date of this presentation. John Snow Labs makes no warranties, express or statutory, as to the information in this presentation. david@JohnSnowLabs.com @davidtalby in/davidtalby