SlideShare une entreprise Scribd logo
1  sur  7
In Research and Development
October 15th
Machine Learning
Why Machine Learning?
We want the machine to do something:
1. we can’t do
2. we prefer not to do
3. we can’t do at scale
2
Machine Learning Use Cases
3
Counting
Parasites
Classifying
Documents
Coding Doctors
Notes
Use Case 1 - Counting Parasites
4
The Problem
Counting the parasites for our animal
health products is labor-intensive,
tedious, cumbersome and error-prone
1. Evaluate the practical feasibility of endo-parasites (worms) counting from videos
2. Create a prototype of a model that will count worms from a video of a single
segment of a Petri dish with at least 90% precision
Our Goal
• Detected between 70% - 95% of parasites across different petri dish segments
• Additional work required to move and count the entire petri dish
• Did not detect dead worms
Initial Results
Use Case 2 – Classifying Clinical Documents
5
The Problem
Building the Trial Master File (TMF) is a
laborious task that requires a lot of
time and manpower. Mislabeled
documents may take time to locate
during auditing
Evaluate the feasibility of machine learning for classifying and labeling documents,
especially when documents are scanned and partially handwritten
Our Goal
• Accurately classified 90% of documents to their labels after applying OCR
• Additional work required to integrate the machine learning model with the product
we use for TMF
Initial Results
Use Case 3 – Coding Doctor’s Notes
6
The Problem
Creating population health metrics
from certain global EMR systems is
difficult since they are not coded and
only have unstructured doctors notes
1. Evaluate the feasibility of mapping unstructured doctors notes to ICD-10 codes
2. Achieve > 95% accuracy that can be verified by physicians and specialists
Our Goal
• Initial machine learning approach achieved very poor results
• 75% - 95% accuracy was achieved through a rules-based approach with regular
expressions.
Initial Results
Chief Complaint:
Agitation
The patient complains of
feeling agitated most of
time. Associated symptoms
include anxiety.
Patient is not currently
medication.
Lessons Learned
• Finding your Precision vs Recall needs
• Language Handling
• Durability of results
• Trust & Confidence in results
• Vendor software integration
7

Contenu connexe

Similaire à Pistoia Alliance USA Conference 2016

KineMatik November 2010
KineMatik November 2010KineMatik November 2010
KineMatik November 2010
Michael Price
 
Quality Analyst_5+ years exp
Quality Analyst_5+ years expQuality Analyst_5+ years exp
Quality Analyst_5+ years exp
sivarani M
 

Similaire à Pistoia Alliance USA Conference 2016 (20)

zero defect
 zero defect zero defect
zero defect
 
What Doctors Can Teach Us on Continuous Learning
What Doctors Can Teach Us on Continuous LearningWhat Doctors Can Teach Us on Continuous Learning
What Doctors Can Teach Us on Continuous Learning
 
KineMatik November 2010
KineMatik November 2010KineMatik November 2010
KineMatik November 2010
 
Testing Your Software Testers
Testing Your Software TestersTesting Your Software Testers
Testing Your Software Testers
 
Implementing Metrics & Completeness Reporting in TMF Management​
Implementing Metrics & Completeness Reporting in TMF Management​Implementing Metrics & Completeness Reporting in TMF Management​
Implementing Metrics & Completeness Reporting in TMF Management​
 
Software Testing
Software TestingSoftware Testing
Software Testing
 
Basics in software testing
Basics in software testingBasics in software testing
Basics in software testing
 
Five Key Findings: Foolproof Computerized Examination
Five Key Findings: Foolproof Computerized ExaminationFive Key Findings: Foolproof Computerized Examination
Five Key Findings: Foolproof Computerized Examination
 
Automate test-oim
Automate test-oimAutomate test-oim
Automate test-oim
 
GMP Module 123123- Laboratory Controls.pptx
GMP Module 123123- Laboratory Controls.pptxGMP Module 123123- Laboratory Controls.pptx
GMP Module 123123- Laboratory Controls.pptx
 
CV-LAJ
CV-LAJCV-LAJ
CV-LAJ
 
Quality Analyst_5+ years exp
Quality Analyst_5+ years expQuality Analyst_5+ years exp
Quality Analyst_5+ years exp
 
How to Apply NLP to Analyze Clinical Trials
How to Apply NLP to Analyze Clinical TrialsHow to Apply NLP to Analyze Clinical Trials
How to Apply NLP to Analyze Clinical Trials
 
Modern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industriesModern quality systems in pharmaceutical education and industries
Modern quality systems in pharmaceutical education and industries
 
Speed Up Software Releasing with Automated Testing
Speed Up Software Releasing with Automated TestingSpeed Up Software Releasing with Automated Testing
Speed Up Software Releasing with Automated Testing
 
What is Software Quality and how to measure it?
What is Software Quality and how to measure it?What is Software Quality and how to measure it?
What is Software Quality and how to measure it?
 
1.1.pptx
1.1.pptx1.1.pptx
1.1.pptx
 
Automation testing interview pdf org
Automation testing interview pdf orgAutomation testing interview pdf org
Automation testing interview pdf org
 
Fundamentals of testing - Testing & Implementations
Fundamentals of testing - Testing & ImplementationsFundamentals of testing - Testing & Implementations
Fundamentals of testing - Testing & Implementations
 
3. quality.ppt
3. quality.ppt3. quality.ppt
3. quality.ppt
 

Plus de Pistoia Alliance

Plus de Pistoia Alliance (20)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Fair by design
Fair by designFair by design
Fair by design
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIR
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcare
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoE
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 

Dernier

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
PirithiRaju
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
NazaninKarimi6
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
levieagacer
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdfPests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
Pests of cotton_Borer_Pests_Binomics_Dr.UPR.pdf
 
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verifiedSector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
Sector 62, Noida Call girls :8448380779 Model Escorts | 100% verified
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRLKochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
Kochi ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Kochi ESCORT SERVICE❤CALL GIRL
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 

Pistoia Alliance USA Conference 2016

  • 1. In Research and Development October 15th Machine Learning
  • 2. Why Machine Learning? We want the machine to do something: 1. we can’t do 2. we prefer not to do 3. we can’t do at scale 2
  • 3. Machine Learning Use Cases 3 Counting Parasites Classifying Documents Coding Doctors Notes
  • 4. Use Case 1 - Counting Parasites 4 The Problem Counting the parasites for our animal health products is labor-intensive, tedious, cumbersome and error-prone 1. Evaluate the practical feasibility of endo-parasites (worms) counting from videos 2. Create a prototype of a model that will count worms from a video of a single segment of a Petri dish with at least 90% precision Our Goal • Detected between 70% - 95% of parasites across different petri dish segments • Additional work required to move and count the entire petri dish • Did not detect dead worms Initial Results
  • 5. Use Case 2 – Classifying Clinical Documents 5 The Problem Building the Trial Master File (TMF) is a laborious task that requires a lot of time and manpower. Mislabeled documents may take time to locate during auditing Evaluate the feasibility of machine learning for classifying and labeling documents, especially when documents are scanned and partially handwritten Our Goal • Accurately classified 90% of documents to their labels after applying OCR • Additional work required to integrate the machine learning model with the product we use for TMF Initial Results
  • 6. Use Case 3 – Coding Doctor’s Notes 6 The Problem Creating population health metrics from certain global EMR systems is difficult since they are not coded and only have unstructured doctors notes 1. Evaluate the feasibility of mapping unstructured doctors notes to ICD-10 codes 2. Achieve > 95% accuracy that can be verified by physicians and specialists Our Goal • Initial machine learning approach achieved very poor results • 75% - 95% accuracy was achieved through a rules-based approach with regular expressions. Initial Results Chief Complaint: Agitation The patient complains of feeling agitated most of time. Associated symptoms include anxiety. Patient is not currently medication.
  • 7. Lessons Learned • Finding your Precision vs Recall needs • Language Handling • Durability of results • Trust & Confidence in results • Vendor software integration 7

Notes de l'éditeur

  1. Counting Parasites Classifying Documents Coding Doctor Notes Parasite detection in videos of petri dishes Document Categorization for clinical trials EMR Doctors Note translation
  2. More features