SlideShare une entreprise Scribd logo
1  sur  42
Towards automated phenotypic cell profiling
with high-content imaging
Ola Spjuth
Department of Pharmaceutical Biosciences, Uppsala University
Scaleout Systems AB
Automation and robotics will be increasingly
important in biological labs
AI will have high impact in experimental design
and hypothesis testing/generation
Use of live/temporal
profiling will increase
Use of live/temporal profiling will
increase
Data velocity will increase
Automated, continuous
analytics and AI will be needed
Who are we?
• Academic research group at Uppsala University
• Background in computational pharmacology (data science, AI/ML)
• Good at e-infrastructure, big data (data engineering)
• Setting up an high-content imaging lab for cell profiling
Research group website: http://pharmb.io
Accelerate drug discovery using AI,
automation and intelligent design
of experiments
• Predict safety concerns
• Explain drug mechanisms
• Screen for new drugs
Research objective
Hypothesis
revise
Insight
• Iterative
• Flexible
• Mostly manual
• Slow
Experiments
Analysis and interpretation
Traditional hypothesis testing
• Retrospective analysis
• Hopefully predictive
• Expensive
• Limited for hypothesis
testing
more
Predictive modeling
Database
Data generation
Traditional Processing Stream Processing
Data
Data Query
request
response
Real- T ime
Analytics
Data Results
ModelPrediction
Modeling and prediction
Data-driven science
Data
Hypothesis
Scientist
Data
aditional Processing Stream Processing
Data
Repository
a Query
request
response
Real- T ime
Analytics
Data Results
Current fact finding
Analyze data in motion – before it is stored
Low latency paradigm, push model
Data driven: bring data to the analytics
al fact finding
d analyze information stored on disk
aradigm, pull model
driven: submits queries to static data
Model
Insights
Considerations for “the next experiment”
• Quality over Quantity: Better data is often more useful than simply
more data
• Data collection may be expensive
• Cost of time and materials for an experiment
• Cheap vs. expensive data
• Raw images vs. annotated images
• Want to collect best data at minimal cost
• Can machines (AI) learn with fewer training instances if they ask the
right questions?
Intelligently designing experiments
• Plan experiment under
constrained resources
• Vary factors and study response
• Seek optimal design
• DoE (Design of Experiments)
• Example: Select X combinations
of drugs to test (cannot measure
all combinations due to costs,
time etc.)
”…DECREASE, an efficient machine learning model that
requires only a limited set of pairwise dose–response
measurements for accurate prediction of drug combination
synergy in a given sample.”
Active learning: Which experiment should
be done next?
Nature Model
Data Passive
Learning
Nature ModelResponse
Active
Learning
Query
• Exploration: Could lead to better predictions in future
• Exploitation: Make best predictions given current data
• Tradeoff!
Our vision:
Closed-loop (autonomous) experimentation
Automation Informatics
Data
essing Stream Processing
Query
esponse
Real- T ime
Analytics
Data Results
Current fact finding
Analyze data in motion – before it is storedstored on disk
Continuous AI
Results
Intelligent design of
experiments Experiments
Scientist External data
Automation in life science
• Varying degrees of automation!
• Automated instrument: working with a microplate (or stack of microplates)
• Robot: Liquid handling robot
• Automated lab
• A set of instruments, each working with microplates
• A plate handling robot serving multiple instruments
Robot scientist
1. King, R. D. et al. Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427, 247–252 (2004)
2. King, RD et al. "The Automation of Science". (2009) Science. 324 (5923): 85–89
3. Williams, K. et al."Cheaper faster drug development validated by the repositioning of drugs against neglected tropical diseases". (2015) Journal of the
Royal Society Interface. 12 (104): 20141289.
Adam1,2 is able to perform
independent experiments to test
hypotheses and interpret findings
without human guidance:
• hypothesizing to explain
observations
• devising experiments to test these
hypotheses
• physically running the
experiments using laboratory
robotics
• interpreting the results from the
experiments
• repeating the cycle as required
Let’s go back to cell profiling with high
content imaging!
High-throughput
biology
ThousandsThousandsDozens
Robots vs. Disease:
”Tackling World Health
Problems by Analyzing
Cell Images”
Anne E. Carpenter, PhD
IMAGING
PLATFORM
Genetic or
chemical
perturbations
Experiments
in multi-
well plates
Imaging Features Hypotheses
Convolutional Neural Network
Predictions
Cell painting: Imaging with multiplexed dyes
Bray et al. (2016). “Cell Painting, a High-Content Image-Based Assay for Morphological
Profiling Using Multiplexed Fluorescent Dyes.” Nature Protocols 11 (9): 1757–74.
Holographic live cell imaging
• Quantitative phase-contrast microscopy
• Holographic phase-shift imaging
• Label-free, live cell imaging
• Used inside incubator
HoloMonitor system
Protein degradation Cholesterol-lowering DNA replication
Microtubule stabilizer Actin disruptor Kinase inhibitor
Classify images into biological
mechanisms
Kensert A, Harrison PJ, Spjuth O.
Transfer learning with deep convolutional neural network for classifying cellular morphological changes.
SLAS DISCOVERY: Advancing Life Sciences R&D. 24, 4 (2019)
•Fluorescent LNPs (lipids)
•Fluorescent Cargo (mRNA)
•Fluorescent Product (protein)
No
LNPs
Partial LNP
uptake
LNP uptake and mRNA
decoding
Make predictions
using available
data
External data
Data warehouse
Design new
experimentsAI
Modeling
Publish data and models
Manual wet lab
Hypothesis
Verify using
external
protocol
Automated lab
Carry out
new
experiments
Analysis pipeline
Aim: Intelligent system for
drug/chemical profiling
Hypothesis
Hypothesis
test
generate
Fully automated cell painting
• Facilities, environmental control, ventilation
• Instruments and control software
• Automation system (dynamic scheduling)
• Lab protocol
• Compute and storage resources
• Analysis pipelines
Automating our cell-based lab
Fixed setup (version 1)
• ImageXpress XLS (Molecular Devices)
• Plate robot (Preciseflex)
• Plate incubator (Liconic), barcode reader
• BioMek 4000 liquid handling (Beckman
Coulter)
• Green Button Go lab automation software
(Biosero)
Observations:
• Quick to get up and running
• Suitable for fixed protocols
• Dependent on vendors to
solve problems
• Not easy to expand or
configure for us
Our priorities:
• Flexibility to expand/adapt
• Open source or good APIs
• Low cost, serviceable by us
• Configurable by us
Opentrons OT-2
Biotek MultiFlo FX, Multi-Mode Dispenser
Open source lab automation
Universal Robots UR10e
Biotek 405 LS, Washer
Collaborators wanted!
Dealing with large scale data
• High volume, relatively high velocity
• Continuously process data, train
models, serve models
• Embrace scalable virtual
infrastructures (cloud) and
microservices (containers)
GPU cluster
CPU server
Storage
Cloud
HPC
Online processing
Robotized lab
images
Automating our data processing
ImageDBImage viewer
File system
Metadata Files (images)
https://github.com/pharmbio/imagedb
Cold storage
Hot storage
Online,
intelligent
processing
Cell profilesQC workflows Interestingness models
HASTE CORE and Cell Profiler Pipeline
https://github.com/HASTE-project/cellprofiler-pipeline
Avoid storing
uninteresting data
Robotized lab
Data scientists
Empowering our data scientists
ImageDB
File system
Metadata Files (images)
Models
CPU/GPU/HPC cloud
Notebooks
Data
Models
External
users
Services
Public services
Publish
Data is not static!
• Public databases
• Batch/continuous updates
• In-house data
• Batch/continuous updates
 Need to continuously re-train models.
AI modeling life cycle
Model Development
ML studio
ML workflow
automation
Package & Deploy Models Model Serving
Model
management
Model
serving
Monitoring
Explore Data and
Develop Models
Train at scale
Register Model
and Metadata for
Serving
Package and
Publish Run in
operations Monitor
LoggingIntegrate
Data
scientist
Data
Engineer
Data
Engineer
Promote
Model
Ship
Model
In collaboration with:
https://github.com/leanaiorg/leanaistack
Lean AI Stack
http://haste.research.it.uu.se/
Carolina Wählby Ola Spjuth Andreas
Hellander
Relevant software we develop (and others
could use)
• Virtual Infrastructure with Kubernetes (IaaC)
• Portable, scalable, resilient
• ImageDB (projects, images, results etc.)
• ImageViewer
• Batch and Continuous Cell Profiler pipelines
• Deep Learning notebooks
• Open source lab automation system (in progress)
• Design cell-based experiments (in progress)
• Construct steering protocols for robotized lab
• Compound annotation project (to be started)
ImageDBImage viewer
https://github.com/pharmbio
Integrate with our other AI services
Site-of-metabolism and reaction types
http://ptp.service.pharmb.io/
https://metpred.service.pharmb.io/draw/
Target (safety) profiles
Data-driven science
Data
Hypothesis
Scientist
Data
aditional Processing Stream Processing
Data
Repository
a Query
request
response
Real- T ime
Analytics
Data Results
Current fact finding
Analyze data in motion – before it is stored
Low latency paradigm, push model
Data driven: bring data to the analytics
al fact finding
d analyze information stored on disk
aradigm, pull model
driven: submits queries to static data
Insights
Stream Processing
Real- T ime
Analytics
Data Results
Current fact finding
Analyze data in motion – before it is stored
Low latency paradigm, push model
Data driven: bring data to the analytics
isk
ata
Data
Repository
Data Query
request
response
Real- T ime
Analytics
Data Results
Current fact finding
Analyze data in motion – before it is stored
Low latency paradigm, push model
Data driven: bring data to the analytics
Historical fact finding
Find and analyze information stored on disk
Batch paradigm, pull model
Query-driven: submits queries to static data
Some ongoing projects
• Cell painting on combinations of 2-3 environmental compounds
• Cell painting on 380 kinase inhibitors on U2OS and MCF7 cell lines
• Cell painting and holographic imaging of 120 GPCR drugs
• Exploring dynamics of drug delivery using LNPs via imaging
• Deep Learning (CNN, RNN) and Cell Profiler features
• Comparing Cell Morphology with Gene Expression (public data)
• Several other projects in the pipeline…
We believe in Open Science
• All source code (software, notebooks) published online:
https://github.com/pharmbio
• Protocols published online
• https://protocol-delivery.protocols.opentrons.com/protocol/1494-uppsala-
university
• All data will be made available online
Collaborations and funding
IT-dept/UU
Andreas Hellander
Salman Toor
Carolina Wählby
Ida-Maria Sintorn
MedSci/UU
Kim Kultima
Stephanie Herman
Payam Emami
NGI/UGC
Adam Ameur
UUH/Clinical Genetics
Lucia Cavelier
AstraZeneca/Stena Line
Lars Carlsson
Ernst Ahlberg
Prosilico AB
Urban Fagerholm
Sven Hellberg
Karolinska
Institutet/MEB
Juni Palmgren
Martin Eklund
Jordi Carreras Puigvert
Karolinska
Institutet/IMM
Roland Grafström
Pekka Kohonen
SciLifeLab Data center
Johan Rung
Hanna Kultima
Funding:Consortia and involvements:
- Thank you -
Email: ola.spjuth@farmbio.uu.se
Web: https://pharmb.io

Contenu connexe

Tendances

Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchPeter van Heusden
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theoryC. Tobin Magle
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...Carole Goble
 
Reproducible research: First steps.
Reproducible research: First steps. Reproducible research: First steps.
Reproducible research: First steps. Richard Layton
 
Developing a Research Case Study
Developing a Research Case StudyDeveloping a Research Case Study
Developing a Research Case StudyJulie Goldman
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Carole Goble
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcUSD Bioinformatics
 
Building a flexible infrastructure with Bioclipse, open source, and federated...
Building a flexible infrastructure with Bioclipse, open source, and federated...Building a flexible infrastructure with Bioclipse, open source, and federated...
Building a flexible infrastructure with Bioclipse, open source, and federated...Ola Spjuth
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your dataAlex Henderson
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDatabricks
 
Is one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical researchIs one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical researchGreg Landrum
 
Initial steps towards a production platform for DNA sequence analysis on the ...
Initial steps towards a production platform for DNA sequence analysis on the ...Initial steps towards a production platform for DNA sequence analysis on the ...
Initial steps towards a production platform for DNA sequence analysis on the ...Barbera van Schaik
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsSean Ekins
 

Tendances (20)

Scientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible researchScientific Workflow Systems for accessible, reproducible research
Scientific Workflow Systems for accessible, reproducible research
 
Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
ISMB/ECCB 2013 Keynote Goble Results may vary: what is reproducible? why do o...
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
Reproducible research: First steps.
Reproducible research: First steps. Reproducible research: First steps.
Reproducible research: First steps.
 
Developing a Research Case Study
Developing a Research Case StudyDeveloping a Research Case Study
Developing a Research Case Study
 
CSHALS 2013
CSHALS 2013CSHALS 2013
CSHALS 2013
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
 
Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014Results may vary: Collaborations Workshop, Oxford 2014
Results may vary: Collaborations Workshop, Oxford 2014
 
Session ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmcSession ii g2 overview chemical modeling mmc
Session ii g2 overview chemical modeling mmc
 
Building a flexible infrastructure with Bioclipse, open source, and federated...
Building a flexible infrastructure with Bioclipse, open source, and federated...Building a flexible infrastructure with Bioclipse, open source, and federated...
Building a flexible infrastructure with Bioclipse, open source, and federated...
 
Too good to be true? How validate your data
Too good to be true? How validate your dataToo good to be true? How validate your data
Too good to be true? How validate your data
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
Resume 2016 detailed
Resume 2016 detailedResume 2016 detailed
Resume 2016 detailed
 
Drug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge GraphsDrug Repurposing using Deep Learning on Knowledge Graphs
Drug Repurposing using Deep Learning on Knowledge Graphs
 
Is one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical researchIs one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical research
 
Initial steps towards a production platform for DNA sequence analysis on the ...
Initial steps towards a production platform for DNA sequence analysis on the ...Initial steps towards a production platform for DNA sequence analysis on the ...
Initial steps towards a production platform for DNA sequence analysis on the ...
 
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning ModelsMining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
Mining 'Bigger' Datasets to Create, Validate and Share Machine Learning Models
 

Similaire à Towards automated phenotypic cell profiling with high-content imaging

Towards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery LabsTowards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery LabsOla Spjuth
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchEuropean Bioinformatics Institute
 
Automating cell-based screening with open source, robotics and AI
Automating cell-based screening with open source, robotics and AIAutomating cell-based screening with open source, robotics and AI
Automating cell-based screening with open source, robotics and AIOla Spjuth
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Ola Spjuth
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker, Inc.
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataPhilip Cheung
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsGigaScience, BGI Hong Kong
 
Continuous modeling - automating model building on high-performance e-Infrast...
Continuous modeling - automating model building on high-performance e-Infrast...Continuous modeling - automating model building on high-performance e-Infrast...
Continuous modeling - automating model building on high-performance e-Infrast...Ola Spjuth
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for ScienceIan Foster
 
The case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesThe case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesOla Spjuth
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford MedMachine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford MedSri Ambati
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Spark Summit
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...DataScienceConferenc1
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesBastian Greshake
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsmikaelhuss
 

Similaire à Towards automated phenotypic cell profiling with high-content imaging (20)

Towards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery LabsTowards Automated AI-guided Drug Discovery Labs
Towards Automated AI-guided Drug Discovery Labs
 
Advanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven ResearchAdvanced Bioinformatics for Genomics and BioData Driven Research
Advanced Bioinformatics for Genomics and BioData Driven Research
 
Automating cell-based screening with open source, robotics and AI
Automating cell-based screening with open source, robotics and AIAutomating cell-based screening with open source, robotics and AI
Automating cell-based screening with open source, robotics and AI
 
Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...Automating the process of continuously prioritising data, updating and deploy...
Automating the process of continuously prioritising data, updating and deploy...
 
Docker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce HoffDocker in Open Science Data Analysis Challenges by Bruce Hoff
Docker in Open Science Data Analysis Challenges by Bruce Hoff
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientistsRamil Mauleon: Galaxy: bioinformatics for rice scientists
Ramil Mauleon: Galaxy: bioinformatics for rice scientists
 
CV_10/17
CV_10/17CV_10/17
CV_10/17
 
Cv long
Cv longCv long
Cv long
 
Continuous modeling - automating model building on high-performance e-Infrast...
Continuous modeling - automating model building on high-performance e-Infrast...Continuous modeling - automating model building on high-performance e-Infrast...
Continuous modeling - automating model building on high-performance e-Infrast...
 
Collins seattle-2014-final
Collins seattle-2014-finalCollins seattle-2014-final
Collins seattle-2014-final
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
The case for cloud computing in Life Sciences
The case for cloud computing in Life SciencesThe case for cloud computing in Life Sciences
The case for cloud computing in Life Sciences
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford MedMachine Learning in Modern Medicine with Erin LeDell at Stanford Med
Machine Learning in Modern Medicine with Erin LeDell at Stanford Med
 
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
Escaping Flatland: Interactive High-Dimensional Data Analysis in Drug Discove...
 
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
[DSC Europe 23][DigiHealth] Vesna Pajic - Machine Learning Techniques for omi...
 
openSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association StudiesopenSNP - Crowdsourcing Genome Wide Association Studies
openSNP - Crowdsourcing Genome Wide Association Studies
 
Data analysis & integration challenges in genomics
Data analysis & integration challenges in genomicsData analysis & integration challenges in genomics
Data analysis & integration challenges in genomics
 
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...Activities at the Royal Society of Chemistry to gather, extract and analyze b...
Activities at the Royal Society of Chemistry to gather, extract and analyze b...
 

Plus de Ola Spjuth

Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
Combining Prediction Intervals on Multi-Source Non-Disclosed Regression DatasetsCombining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
Combining Prediction Intervals on Multi-Source Non-Disclosed Regression DatasetsOla Spjuth
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudOla Spjuth
 
Storage and Analysis of Sensitive Large-Scale Biomedical Data in Sweden
Storage and Analysis of Sensitive Large-Scale Biomedical Data in SwedenStorage and Analysis of Sensitive Large-Scale Biomedical Data in Sweden
Storage and Analysis of Sensitive Large-Scale Biomedical Data in SwedenOla Spjuth
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceOla Spjuth
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Ola Spjuth
 
Interoperability and scalability with microservices in science
Interoperability and scalability with microservices in scienceInteroperability and scalability with microservices in science
Interoperability and scalability with microservices in scienceOla Spjuth
 
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)Ola Spjuth
 
Accessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseAccessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseOla Spjuth
 

Plus de Ola Spjuth (9)

Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
Combining Prediction Intervals on Multi-Source Non-Disclosed Regression DatasetsCombining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
Data-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and CloudData-intensive bioinformatics on HPC and Cloud
Data-intensive bioinformatics on HPC and Cloud
 
Storage and Analysis of Sensitive Large-Scale Biomedical Data in Sweden
Storage and Analysis of Sensitive Large-Scale Biomedical Data in SwedenStorage and Analysis of Sensitive Large-Scale Biomedical Data in Sweden
Storage and Analysis of Sensitive Large-Scale Biomedical Data in Sweden
 
Enabling Translational Medicine with e-Science
Enabling Translational Medicine with e-ScienceEnabling Translational Medicine with e-Science
Enabling Translational Medicine with e-Science
 
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
Analyzing Big Data in Medicine with Virtual Research Environments and Microse...
 
Interoperability and scalability with microservices in science
Interoperability and scalability with microservices in scienceInteroperability and scalability with microservices in science
Interoperability and scalability with microservices in science
 
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)
Chemical decision support in toxicology and pharmacology (OpenToxEU 2013)
 
Accessing and scripting CDK from Bioclipse
Accessing and scripting CDK from BioclipseAccessing and scripting CDK from Bioclipse
Accessing and scripting CDK from Bioclipse
 

Dernier

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 

Dernier (20)

Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 

Towards automated phenotypic cell profiling with high-content imaging

  • 1. Towards automated phenotypic cell profiling with high-content imaging Ola Spjuth Department of Pharmaceutical Biosciences, Uppsala University Scaleout Systems AB
  • 2. Automation and robotics will be increasingly important in biological labs
  • 3. AI will have high impact in experimental design and hypothesis testing/generation
  • 5. Use of live/temporal profiling will increase Data velocity will increase Automated, continuous analytics and AI will be needed
  • 6.
  • 7. Who are we? • Academic research group at Uppsala University • Background in computational pharmacology (data science, AI/ML) • Good at e-infrastructure, big data (data engineering) • Setting up an high-content imaging lab for cell profiling Research group website: http://pharmb.io
  • 8. Accelerate drug discovery using AI, automation and intelligent design of experiments • Predict safety concerns • Explain drug mechanisms • Screen for new drugs Research objective
  • 9. Hypothesis revise Insight • Iterative • Flexible • Mostly manual • Slow Experiments Analysis and interpretation Traditional hypothesis testing • Retrospective analysis • Hopefully predictive • Expensive • Limited for hypothesis testing more Predictive modeling Database Data generation Traditional Processing Stream Processing Data Data Query request response Real- T ime Analytics Data Results ModelPrediction Modeling and prediction
  • 10. Data-driven science Data Hypothesis Scientist Data aditional Processing Stream Processing Data Repository a Query request response Real- T ime Analytics Data Results Current fact finding Analyze data in motion – before it is stored Low latency paradigm, push model Data driven: bring data to the analytics al fact finding d analyze information stored on disk aradigm, pull model driven: submits queries to static data Model Insights
  • 11. Considerations for “the next experiment” • Quality over Quantity: Better data is often more useful than simply more data • Data collection may be expensive • Cost of time and materials for an experiment • Cheap vs. expensive data • Raw images vs. annotated images • Want to collect best data at minimal cost • Can machines (AI) learn with fewer training instances if they ask the right questions?
  • 12. Intelligently designing experiments • Plan experiment under constrained resources • Vary factors and study response • Seek optimal design • DoE (Design of Experiments) • Example: Select X combinations of drugs to test (cannot measure all combinations due to costs, time etc.) ”…DECREASE, an efficient machine learning model that requires only a limited set of pairwise dose–response measurements for accurate prediction of drug combination synergy in a given sample.”
  • 13. Active learning: Which experiment should be done next? Nature Model Data Passive Learning Nature ModelResponse Active Learning Query • Exploration: Could lead to better predictions in future • Exploitation: Make best predictions given current data • Tradeoff!
  • 14. Our vision: Closed-loop (autonomous) experimentation Automation Informatics Data essing Stream Processing Query esponse Real- T ime Analytics Data Results Current fact finding Analyze data in motion – before it is storedstored on disk Continuous AI Results Intelligent design of experiments Experiments Scientist External data
  • 15.
  • 16. Automation in life science • Varying degrees of automation! • Automated instrument: working with a microplate (or stack of microplates) • Robot: Liquid handling robot • Automated lab • A set of instruments, each working with microplates • A plate handling robot serving multiple instruments
  • 17. Robot scientist 1. King, R. D. et al. Functional genomic hypothesis generation and experimentation by a robot scientist. Nature 427, 247–252 (2004) 2. King, RD et al. "The Automation of Science". (2009) Science. 324 (5923): 85–89 3. Williams, K. et al."Cheaper faster drug development validated by the repositioning of drugs against neglected tropical diseases". (2015) Journal of the Royal Society Interface. 12 (104): 20141289. Adam1,2 is able to perform independent experiments to test hypotheses and interpret findings without human guidance: • hypothesizing to explain observations • devising experiments to test these hypotheses • physically running the experiments using laboratory robotics • interpreting the results from the experiments • repeating the cycle as required
  • 18. Let’s go back to cell profiling with high content imaging!
  • 19. High-throughput biology ThousandsThousandsDozens Robots vs. Disease: ”Tackling World Health Problems by Analyzing Cell Images” Anne E. Carpenter, PhD IMAGING PLATFORM
  • 20. Genetic or chemical perturbations Experiments in multi- well plates Imaging Features Hypotheses Convolutional Neural Network Predictions Cell painting: Imaging with multiplexed dyes Bray et al. (2016). “Cell Painting, a High-Content Image-Based Assay for Morphological Profiling Using Multiplexed Fluorescent Dyes.” Nature Protocols 11 (9): 1757–74.
  • 21. Holographic live cell imaging • Quantitative phase-contrast microscopy • Holographic phase-shift imaging • Label-free, live cell imaging • Used inside incubator HoloMonitor system
  • 22. Protein degradation Cholesterol-lowering DNA replication Microtubule stabilizer Actin disruptor Kinase inhibitor Classify images into biological mechanisms Kensert A, Harrison PJ, Spjuth O. Transfer learning with deep convolutional neural network for classifying cellular morphological changes. SLAS DISCOVERY: Advancing Life Sciences R&D. 24, 4 (2019)
  • 23. •Fluorescent LNPs (lipids) •Fluorescent Cargo (mRNA) •Fluorescent Product (protein) No LNPs Partial LNP uptake LNP uptake and mRNA decoding
  • 24.
  • 25. Make predictions using available data External data Data warehouse Design new experimentsAI Modeling Publish data and models Manual wet lab Hypothesis Verify using external protocol Automated lab Carry out new experiments Analysis pipeline Aim: Intelligent system for drug/chemical profiling Hypothesis Hypothesis test generate
  • 26. Fully automated cell painting • Facilities, environmental control, ventilation • Instruments and control software • Automation system (dynamic scheduling) • Lab protocol • Compute and storage resources • Analysis pipelines
  • 27. Automating our cell-based lab Fixed setup (version 1) • ImageXpress XLS (Molecular Devices) • Plate robot (Preciseflex) • Plate incubator (Liconic), barcode reader • BioMek 4000 liquid handling (Beckman Coulter) • Green Button Go lab automation software (Biosero) Observations: • Quick to get up and running • Suitable for fixed protocols • Dependent on vendors to solve problems • Not easy to expand or configure for us Our priorities: • Flexibility to expand/adapt • Open source or good APIs • Low cost, serviceable by us • Configurable by us
  • 28. Opentrons OT-2 Biotek MultiFlo FX, Multi-Mode Dispenser Open source lab automation Universal Robots UR10e Biotek 405 LS, Washer Collaborators wanted!
  • 29.
  • 30. Dealing with large scale data • High volume, relatively high velocity • Continuously process data, train models, serve models • Embrace scalable virtual infrastructures (cloud) and microservices (containers) GPU cluster CPU server Storage Cloud HPC Online processing
  • 31. Robotized lab images Automating our data processing ImageDBImage viewer File system Metadata Files (images) https://github.com/pharmbio/imagedb Cold storage Hot storage Online, intelligent processing Cell profilesQC workflows Interestingness models HASTE CORE and Cell Profiler Pipeline https://github.com/HASTE-project/cellprofiler-pipeline Avoid storing uninteresting data
  • 32. Robotized lab Data scientists Empowering our data scientists ImageDB File system Metadata Files (images) Models CPU/GPU/HPC cloud Notebooks Data Models External users Services Public services Publish
  • 33. Data is not static! • Public databases • Batch/continuous updates • In-house data • Batch/continuous updates  Need to continuously re-train models.
  • 34. AI modeling life cycle Model Development ML studio ML workflow automation Package & Deploy Models Model Serving Model management Model serving Monitoring Explore Data and Develop Models Train at scale Register Model and Metadata for Serving Package and Publish Run in operations Monitor LoggingIntegrate Data scientist Data Engineer Data Engineer Promote Model Ship Model In collaboration with: https://github.com/leanaiorg/leanaistack Lean AI Stack
  • 36. Relevant software we develop (and others could use) • Virtual Infrastructure with Kubernetes (IaaC) • Portable, scalable, resilient • ImageDB (projects, images, results etc.) • ImageViewer • Batch and Continuous Cell Profiler pipelines • Deep Learning notebooks • Open source lab automation system (in progress) • Design cell-based experiments (in progress) • Construct steering protocols for robotized lab • Compound annotation project (to be started) ImageDBImage viewer https://github.com/pharmbio
  • 37. Integrate with our other AI services Site-of-metabolism and reaction types http://ptp.service.pharmb.io/ https://metpred.service.pharmb.io/draw/ Target (safety) profiles
  • 38. Data-driven science Data Hypothesis Scientist Data aditional Processing Stream Processing Data Repository a Query request response Real- T ime Analytics Data Results Current fact finding Analyze data in motion – before it is stored Low latency paradigm, push model Data driven: bring data to the analytics al fact finding d analyze information stored on disk aradigm, pull model driven: submits queries to static data Insights Stream Processing Real- T ime Analytics Data Results Current fact finding Analyze data in motion – before it is stored Low latency paradigm, push model Data driven: bring data to the analytics isk ata Data Repository Data Query request response Real- T ime Analytics Data Results Current fact finding Analyze data in motion – before it is stored Low latency paradigm, push model Data driven: bring data to the analytics Historical fact finding Find and analyze information stored on disk Batch paradigm, pull model Query-driven: submits queries to static data
  • 39. Some ongoing projects • Cell painting on combinations of 2-3 environmental compounds • Cell painting on 380 kinase inhibitors on U2OS and MCF7 cell lines • Cell painting and holographic imaging of 120 GPCR drugs • Exploring dynamics of drug delivery using LNPs via imaging • Deep Learning (CNN, RNN) and Cell Profiler features • Comparing Cell Morphology with Gene Expression (public data) • Several other projects in the pipeline…
  • 40. We believe in Open Science • All source code (software, notebooks) published online: https://github.com/pharmbio • Protocols published online • https://protocol-delivery.protocols.opentrons.com/protocol/1494-uppsala- university • All data will be made available online
  • 41. Collaborations and funding IT-dept/UU Andreas Hellander Salman Toor Carolina Wählby Ida-Maria Sintorn MedSci/UU Kim Kultima Stephanie Herman Payam Emami NGI/UGC Adam Ameur UUH/Clinical Genetics Lucia Cavelier AstraZeneca/Stena Line Lars Carlsson Ernst Ahlberg Prosilico AB Urban Fagerholm Sven Hellberg Karolinska Institutet/MEB Juni Palmgren Martin Eklund Jordi Carreras Puigvert Karolinska Institutet/IMM Roland Grafström Pekka Kohonen SciLifeLab Data center Johan Rung Hanna Kultima Funding:Consortia and involvements:
  • 42. - Thank you - Email: ola.spjuth@farmbio.uu.se Web: https://pharmb.io