SlideShare une entreprise Scribd logo
1  sur  56
Oxford Nanopore SmidgION
DNA-IoT Interdiction:
● Epidemics
● Poaching/Smuggling
● Acute Lethal
Infections
DeepDream (wikipedia)
is a computer vision program
created by Google which uses a
convolutional neural network to
find and enhance patterns in
images via algorithmic
pareidolia[1], thus creating a
dream-like hallucinogenic
appearance in the deliberately
over-processed images.
A late-stage DeepDream processed photograph of three men in a pool.
[1]Pareidolia is a psychological phenomenon in which the mind responds to a stimulus (an image or a sound) by
perceiving a familiar pattern where none exists.
Allen Day, PhD // Science Advocate // @allenday // #genomics #ml #datascience
GOOGLE CONFIDENTIAL
Google Cloud
Run your apps on the same system as Google
Table of Contents
Introduction Precision Medicine: an Informed Opinion
Section 1 Deep Learning Concepts
Section 2 Deep Learning @ Genomic Analysis
Section 3 Deep Learning @ Precision Agriculture
➤ ➤
➤
➤
Genetic
Optimization
(Breeding)
Organism Context
(Environment)
Optimization
Today’s Focus: Learn these Functions
Deep Neural Networks: Algorithms that Learn
● Modernization of artificial neural networks
● Made of of simple mathematical units,
organized in layers, that together can
compute some (arbitrary) function
● more layers = deeper = more general
● Learn from raw, heterogeneous data
* Human Performance
based on analysis done
by Andrej Karpathy.
More details here.
Image understanding is (getting) better than human level
ImageNet Challenge: Given
an image, predict one of
1000+ of classes
%errors
“Given an image,
predict one of
1000+ of classes”
Image credit:
360phot0.blogspot.com
ImageNet
Challenge
Transfer Learning
Quickly able to Learn New Concepts
“t-rex”“quidditch”
Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images 2015
Style Transfer
Learn features from one dataset, apply them to another
Can be done within domain:
Image Labels => New Image Classes
And between domains:
Image Features => Image Filters
Image Labels + Language Model => Image Captions
Show and Tell: A
Neural Image Caption
Generator 2015
Style Transfer
https://magenta.tensorflow.org/
Released in Nov. 2015
#1
repository
for “machine learning”
category on GitHub
TensorFlow
Genetic
Optimization
(Breeding)
Marker Assisted Breeding
Google Cloud Platform
Marker-Assisted Breeding Rapidly Increases Frequency of
Favorable Genes
https://www.slideshare.net/finance28/monsanto-082305a
Yield needs to increase by
3% per year
to match GDP growth
Marker-assisted selection for quantitative traits
https://www.sec.gov/Archives/edgar/data/1110783/0000950134
02011773/c71992exv99w2.htm
Select & Recombine
Identify
desirable individuals
Grow
Select & Recombine
Grow
Generate Marker Fingerprint
Sample tissue
Extract DNAModel Data & Identify
desirable carriers
Marker-Assisted Breeding Rapidly Increases Frequency of
Favorable Genes
Genomics & Genetics Problems:
How to Start Applying DNNs?
Must-haves for deep learning:
● Lots of data: >50k examples, >1M examples ideal
● High-quality input and labels for training
● Label ~ F(data) unknown but certainly function exists
● High-quality prev. efforts so we know that DNNs are key
○ i.e. hard to solve with classical statistical
approaches
SNP and indel calling from NGS data
Verily | Confidential & Proprietary
Calling genetic variation may seem easy...
Verily | Confidential & Proprietary
... but lots of places in the genome are difficult
Creating a universal SNP and small indel
variant caller with deep neural networks
Ryan Poplin, Cory McLean, Dan Newburger, Jojo Dijamco, Nam Nguyen, Dion Loy,
Sam Gross, Madeleine Cule, Peyton Greenside, Justin Zook, Marc Salit, Mark
DePristo, Verily Life Sciences, October 2016
DNN (Inception V3) Predicts True Genotype from Pileup Images
{ 0.001, 0.994, 0.005 }
{ 0.001, 0.990, 0.009 }
{ 0.000, 0.001, 0.999 }
{ 0.600, 0.399, 0.001 }
Output:
Probability of diploid
genotype states
{ HOM_REF, HET, HOM_VAR }
Raw pixels
Input:
Millions of labeled pileup
images from gold standard
samples
Verily | Confidential & Proprietary
Using deep learning for ultra-accurate mutation detection
Input:
Millions of labeled
pileup image
stacks from gold
standard sample
Raw pixels
{ 0.001, 0.994, 0.005 }
{ 0.001, 0.990, 0.009 }
{ 0.000, 0.001, 0.999 }
{ 0.600, 0.399, 0.001 }
Output:
Probability distribution
over the three diploid
genotype states
{ HOM_REF, HET, HOM_VAR }
31
Verily | Confidential & Proprietary
Example DNA read pileup “images”
true snps true indels false variants
red = {A,C,G,T}. green = {quality score}. blue = {read strand}.
alpha = {matches ref genome}.
Verily | Confidential & Proprietary
PrecisionFDA: unique opportunity with blinded truth sample
NA12878
t
log($-1
)
reads writes edits
Select & Recombine
Grow
Generate Marker Fingerprint
Sample tissue
Extract DNAModel Data & Identify
desirable carriers
Marker-Assisted Breeding Rapidly Increases Frequency of
Favorable Genes
DNA sequencing is no
longer the bottleneck...
Select & Recombine
Grow
Generate Marker Fingerprint
Sample tissue
Extract DNAModel Data & Identify
desirable carriers
Marker-Assisted Breeding Rapidly Increases Frequency of
Favorable Genes
Leading to increased
investment in
machine learning DNA sequencing is no
longer the bottleneck...
Select & Recombine
Grow
Generate Marker Fingerprint
Sample tissue
Extract DNAModel Data & Identify
desirable carriers
Marker-Assisted Breeding Rapidly Increases Frequency of
Favorable Genes
Increased investment
in machine
learning…
...requires more data and other data types
Organism Context
(Environment)
Optimization
Gene/Environment Harmonization
anezconsulting.com/precision-agronomy/
Agronometric Integration
● Satellite & UAV
Images
● Geological Data
● Meteorological
& Sensor Data
● Cultivar Data
● Other GIS Data
● Yield Data
TensorFlow
https://cloudplatform.googleblog.com/2015/11/startup-spotlight-Descartes-Labs-monitors-planet-Earths-resources-with-Google-Compute-Engine.html
Open Source Software
&
Open Access Data
Bootstrapping a Virtuous Cycle
● Increased profit (from risk modeling) leads to increased investment
and risk reduction in the form of:
● More accurate forecasting / engineering of climate
○ Collect & model more meteorological data
● Development of crop varieties to complement future terrestrial /
climate conditions
● High-precision placement and monitoring of individual plants
○ Autonomous planting
○ remote sensing
+ =
+
Tractors are
Geospatial Printers
+
Tractors are
Geospatial Printers
Micro-environment optimized cultivars
Mapping the Diversity of Maize Races in Mexico
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0114657
Why Cannabis?
● Intellectual Property - No patented genes or strains… yet
● Update Mar 18, 2017: US PTO issues trademark for Gorilla Glue #4
● Production - Breeding is highly fragmented… for now
● However, unclear that breeding will centralize due to cheap DNA
sequencing and digital phenotyping
● Distribution (Growing) - Most likely to centralize due to economies of
scale (e.g. multi-tenant greenhouses), and already crowded, wtf?
● Market Access - Unclear that this is a viable segment of supply chain
(see GG#4 above). Also self-replication property of plants...
Why Cannabis?
● Intellectual Property - No patented genes or strains… yet
● Update Mar 18, 2017: US PTO issues trademark for Gorilla Glue #4
● Production - Breeding is highly fragmented… for now
● However, unclear that breeding will centralize due to cheap DNA
sequencing and digital phenotyping
● Distribution (Growing) - Most likely to centralize due to economies of
scale (e.g. multi-tenant greenhouses), and already crowded, wtf?
● Market Access - Unclear that this is a viable segment of supply chain
(see GG#4 above). Also self-replication property of plants...
● Threat: does Cannabis become like Yogurt starter kits?
Cannabis Genomics @ Google Cloud
https://cloud.google.com/bigquery/public-data/1000-cannabis
Build What’s Next
Thank You!
Allen Day, PhD // Science Advocate // @allenday // #genomics #ml #datascience

Contenu connexe

Tendances

Introduction to R for Data Mining
Introduction to R for Data MiningIntroduction to R for Data Mining
Introduction to R for Data Mining
Revolution Analytics
 
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Spark Summit
 
Genetic programming with clojure.spec and Beyond
Genetic programming with clojure.spec and BeyondGenetic programming with clojure.spec and Beyond
Genetic programming with clojure.spec and Beyond
Carin Meier
 

Tendances (20)

Machine learning in the life sciences with knime
Machine learning in the life sciences with knimeMachine learning in the life sciences with knime
Machine learning in the life sciences with knime
 
2016 bergen-sars
2016 bergen-sars2016 bergen-sars
2016 bergen-sars
 
2016 davis-biotech
2016 davis-biotech2016 davis-biotech
2016 davis-biotech
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
2015 aem-grs-keynote
2015 aem-grs-keynote2015 aem-grs-keynote
2015 aem-grs-keynote
 
2016 davis-plantbio
2016 davis-plantbio2016 davis-plantbio
2016 davis-plantbio
 
2015 balti-and-bioinformatics
2015 balti-and-bioinformatics2015 balti-and-bioinformatics
2015 balti-and-bioinformatics
 
Is one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical researchIs one enough? Data warehousing for biomedical research
Is one enough? Data warehousing for biomedical research
 
2015 genome-center
2015 genome-center2015 genome-center
2015 genome-center
 
李育杰/The Growth of a Data Scientist
李育杰/The Growth of a Data Scientist李育杰/The Growth of a Data Scientist
李育杰/The Growth of a Data Scientist
 
2014 sage-talk
2014 sage-talk2014 sage-talk
2014 sage-talk
 
2014 aus-agta
2014 aus-agta2014 aus-agta
2014 aus-agta
 
Introduction to R for Data Mining
Introduction to R for Data MiningIntroduction to R for Data Mining
Introduction to R for Data Mining
 
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆巨量與開放資料之創新機會與關鍵挑戰-曾新穆
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
 
Machine Learning in Healthcare Diagnostics
Machine Learning in Healthcare DiagnosticsMachine Learning in Healthcare Diagnostics
Machine Learning in Healthcare Diagnostics
 
VariantSpark a library for genomics by Lynn Langit
VariantSpark a library for genomics by Lynn LangitVariantSpark a library for genomics by Lynn Langit
VariantSpark a library for genomics by Lynn Langit
 
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
Finding Needles in Genomic Haystacks with “Wide” Random Forest: Spark Summit ...
 
Genetic programming with clojure.spec and Beyond
Genetic programming with clojure.spec and BeyondGenetic programming with clojure.spec and Beyond
Genetic programming with clojure.spec and Beyond
 
Reproducibility for IR evaluation
Reproducibility for IR evaluationReproducibility for IR evaluation
Reproducibility for IR evaluation
 
2014 bangkok-talk
2014 bangkok-talk2014 bangkok-talk
2014 bangkok-talk
 

Similaire à 20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Culver City - DataScience.com

II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
Dr. Haxel Consult
 

Similaire à 20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Culver City - DataScience.com (20)

20170402 Crop Innovation and Business - Amsterdam
20170402 Crop Innovation and Business - Amsterdam20170402 Crop Innovation and Business - Amsterdam
20170402 Crop Innovation and Business - Amsterdam
 
Cloud Accelerated Genomics
Cloud Accelerated GenomicsCloud Accelerated Genomics
Cloud Accelerated Genomics
 
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
20170315 Cloud Accelerated Genomics - Tel Aviv / Phoenix
 
Deep Dive Into Deep Learning : How AI is Powering the Future of Endpoint Secu...
Deep Dive Into Deep Learning : How AI is Powering the Future of Endpoint Secu...Deep Dive Into Deep Learning : How AI is Powering the Future of Endpoint Secu...
Deep Dive Into Deep Learning : How AI is Powering the Future of Endpoint Secu...
 
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical ResearchII-SDV 2017: The Next Era: Deep Learning for Biomedical Research
II-SDV 2017: The Next Era: Deep Learning for Biomedical Research
 
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdfMachine_Learning_with_MATLAB_Seminar_Latest.pdf
Machine_Learning_with_MATLAB_Seminar_Latest.pdf
 
Edge-based Discovery of Training Data for Machine Learning
Edge-based Discovery of Training Data for Machine LearningEdge-based Discovery of Training Data for Machine Learning
Edge-based Discovery of Training Data for Machine Learning
 
Next generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciencesNext generation genomics: Petascale data in the life sciences
Next generation genomics: Petascale data in the life sciences
 
building intelligent systems with large scale deep learning
building intelligent systems with large scale deep learningbuilding intelligent systems with large scale deep learning
building intelligent systems with large scale deep learning
 
emerging trends.pdf
emerging trends.pdfemerging trends.pdf
emerging trends.pdf
 
AI Cybersecurity: Pros & Cons. AI is reshaping cybersecurity
AI Cybersecurity: Pros & Cons. AI is reshaping cybersecurityAI Cybersecurity: Pros & Cons. AI is reshaping cybersecurity
AI Cybersecurity: Pros & Cons. AI is reshaping cybersecurity
 
Spark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van HamSpark Summit EU talk by Erwin Datema and Roeland van Ham
Spark Summit EU talk by Erwin Datema and Roeland van Ham
 
Measuring Relevance in the Negative Space
Measuring Relevance in the Negative SpaceMeasuring Relevance in the Negative Space
Measuring Relevance in the Negative Space
 
Machine learning_ Replicating Human Brain
Machine learning_ Replicating Human BrainMachine learning_ Replicating Human Brain
Machine learning_ Replicating Human Brain
 
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUERandy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
Randy Goebel for the KIEF 2018. FROM DATA TO ECONOMIC VALUE
 
Introduction to Artificial Intelligence and Machine Learning: Ecosystem and T...
Introduction to Artificial Intelligence and Machine Learning: Ecosystem and T...Introduction to Artificial Intelligence and Machine Learning: Ecosystem and T...
Introduction to Artificial Intelligence and Machine Learning: Ecosystem and T...
 
Webinar trends in machine learning ce adar july 9 2020 susan mckeever
Webinar trends in machine learning ce adar july 9 2020 susan mckeeverWebinar trends in machine learning ce adar july 9 2020 susan mckeever
Webinar trends in machine learning ce adar july 9 2020 susan mckeever
 
Slima explainable deep learning using fuzzy logic human ist u fribourg ver 17...
Slima explainable deep learning using fuzzy logic human ist u fribourg ver 17...Slima explainable deep learning using fuzzy logic human ist u fribourg ver 17...
Slima explainable deep learning using fuzzy logic human ist u fribourg ver 17...
 
Machine Learning for Domain Experts
Machine Learning for Domain ExpertsMachine Learning for Domain Experts
Machine Learning for Domain Experts
 
Paper
PaperPaper
Paper
 

Plus de Allen Day, PhD

Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBIHadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Allen Day, PhD
 
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
Allen Day, PhD
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
Allen Day, PhD
 
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Allen Day, PhD
 

Plus de Allen Day, PhD (18)

Genome Analysis Pipelines with Spark and ADAM
Genome Analysis Pipelines with Spark and ADAMGenome Analysis Pipelines with Spark and ADAM
Genome Analysis Pipelines with Spark and ADAM
 
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGIHadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
Hadoop and Genomics - What you need to know - 2015.04.09 - Shenzhen - BGI
 
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBIHadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
Hadoop and Genomics - What you need to know - Cambridge - Sanger Center and EBI
 
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
Hadoop and Genomics - What You Need to Know - London - Viadex RCC - 2015.03.17
 
Hadoop as a Platform for Genomics - Strata 2015, San Jose
Hadoop as a Platform for Genomics - Strata 2015, San JoseHadoop as a Platform for Genomics - Strata 2015, San Jose
Hadoop as a Platform for Genomics - Strata 2015, San Jose
 
Genomics isn't Special
Genomics isn't SpecialGenomics isn't Special
Genomics isn't Special
 
Renaissance in Medicine - Strata - NoSQL and Genomics
Renaissance in Medicine - Strata - NoSQL and GenomicsRenaissance in Medicine - Strata - NoSQL and Genomics
Renaissance in Medicine - Strata - NoSQL and Genomics
 
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
2014.06.16 - BGI - Genomics BigData Workloads - Shenzhen China
 
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
2014.06.30 - Renaissance in Medicine - Singapore Management University - Data...
 
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San JoseR + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
R + Storm Moneyball - Realtime Advanced Statistics - Hadoop Summit - San Jose
 
Human Genetics & Big Data [sans Ethics]
Human Genetics & Big Data [sans Ethics]Human Genetics & Big Data [sans Ethics]
Human Genetics & Big Data [sans Ethics]
 
Building Data Science Teams, Abbreviated
Building Data Science Teams, AbbreviatedBuilding Data Science Teams, Abbreviated
Building Data Science Teams, Abbreviated
 
Genomics Crash Course for Data Engineers
Genomics Crash Course for Data EngineersGenomics Crash Course for Data Engineers
Genomics Crash Course for Data Engineers
 
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
20140228 - Singapore - BDAS - Ensuring Hadoop Production Success
 
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
20131212 - Sydney - Garvan Institute - Human Genetics and Big Data
 
2013.12.12 - Sydney - Big Data Analytics
2013.12.12 - Sydney - Big Data Analytics2013.12.12 - Sydney - Big Data Analytics
2013.12.12 - Sydney - Big Data Analytics
 
20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns20131011 - Los Gatos - Netflix - Big Data Design Patterns
20131011 - Los Gatos - Netflix - Big Data Design Patterns
 
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
 

Dernier

Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Sheetaleventcompany
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan 087776558899
 
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
Sheetaleventcompany
 
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Sheetaleventcompany
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
MedicoseAcademics
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
Sheetaleventcompany
 

Dernier (20)

Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdfShazia Iqbal 2024 - Bioorganic Chemistry.pdf
Shazia Iqbal 2024 - Bioorganic Chemistry.pdf
 
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
Dehradun Call Girls Service {8854095900} ❤️VVIP ROCKY Call Girl in Dehradun U...
 
💰Call Girl In Bangalore☎️63788-78445💰 Call Girl service in Bangalore☎️Bangalo...
💰Call Girl In Bangalore☎️63788-78445💰 Call Girl service in Bangalore☎️Bangalo...💰Call Girl In Bangalore☎️63788-78445💰 Call Girl service in Bangalore☎️Bangalo...
💰Call Girl In Bangalore☎️63788-78445💰 Call Girl service in Bangalore☎️Bangalo...
 
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
Cara Menggugurkan Kandungan Dengan Cepat Selesai Dalam 24 Jam Secara Alami Bu...
 
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
Ahmedabad Call Girls Book Now 9630942363 Top Class Ahmedabad Escort Service A...
 
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...Kolkata Call Girls Shobhabazar  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Gir...
Kolkata Call Girls Shobhabazar 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Gir...
 
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsAppMost Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
 
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...Kolkata Call Girls Naktala  💯Call Us 🔝 8005736733 🔝 💃  Top Class Call Girl Se...
Kolkata Call Girls Naktala 💯Call Us 🔝 8005736733 🔝 💃 Top Class Call Girl Se...
 
🚺LEELA JOSHI WhatsApp Number +91-9930245274 ✔ Unsatisfied Bhabhi Call Girls T...
🚺LEELA JOSHI WhatsApp Number +91-9930245274 ✔ Unsatisfied Bhabhi Call Girls T...🚺LEELA JOSHI WhatsApp Number +91-9930245274 ✔ Unsatisfied Bhabhi Call Girls T...
🚺LEELA JOSHI WhatsApp Number +91-9930245274 ✔ Unsatisfied Bhabhi Call Girls T...
 
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
👉Chandigarh Call Girl Service📲Niamh 8868886958 📲Book 24hours Now📲👉Sexy Call G...
 
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
Jaipur Call Girl Service 📞9xx000xx09📞Just Call Divya📲 Call Girl In Jaipur No💰...
 
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
❤️Call Girl Service In Chandigarh☎️9814379184☎️ Call Girl in Chandigarh☎️ Cha...
 
Control of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronicControl of Local Blood Flow: acute and chronic
Control of Local Blood Flow: acute and chronic
 
Intramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptxIntramuscular & Intravenous Injection.pptx
Intramuscular & Intravenous Injection.pptx
 
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptxANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF RESPIRATORY SYSTEM.pptx
 
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptxANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
ANATOMY AND PHYSIOLOGY OF REPRODUCTIVE SYSTEM.pptx
 
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
Call Girls Bangalore - 450+ Call Girl Cash Payment 💯Call Us 🔝 6378878445 🔝 💃 ...
 
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service AvailableCall Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
Call Girls Shahdol Just Call 8250077686 Top Class Call Girl Service Available
 
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
💚Call Girls In Amritsar 💯Anvi 📲🔝8725944379🔝Amritsar Call Girl No💰Advance Cash...
 
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
Bandra East [ best call girls in Mumbai Get 50% Off On VIP Escorts Service 90...
 

20170428 - Look to Precision Agriculture to Bootstrap Precision Medicine - Culver City - DataScience.com

  • 1. Oxford Nanopore SmidgION DNA-IoT Interdiction: ● Epidemics ● Poaching/Smuggling ● Acute Lethal Infections
  • 2. DeepDream (wikipedia) is a computer vision program created by Google which uses a convolutional neural network to find and enhance patterns in images via algorithmic pareidolia[1], thus creating a dream-like hallucinogenic appearance in the deliberately over-processed images. A late-stage DeepDream processed photograph of three men in a pool. [1]Pareidolia is a psychological phenomenon in which the mind responds to a stimulus (an image or a sound) by perceiving a familiar pattern where none exists.
  • 3. Allen Day, PhD // Science Advocate // @allenday // #genomics #ml #datascience
  • 4. GOOGLE CONFIDENTIAL Google Cloud Run your apps on the same system as Google
  • 5. Table of Contents Introduction Precision Medicine: an Informed Opinion Section 1 Deep Learning Concepts Section 2 Deep Learning @ Genomic Analysis Section 3 Deep Learning @ Precision Agriculture
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 13. Deep Neural Networks: Algorithms that Learn ● Modernization of artificial neural networks ● Made of of simple mathematical units, organized in layers, that together can compute some (arbitrary) function ● more layers = deeper = more general ● Learn from raw, heterogeneous data
  • 14. * Human Performance based on analysis done by Andrej Karpathy. More details here. Image understanding is (getting) better than human level ImageNet Challenge: Given an image, predict one of 1000+ of classes %errors
  • 15. “Given an image, predict one of 1000+ of classes” Image credit: 360phot0.blogspot.com ImageNet Challenge
  • 16. Transfer Learning Quickly able to Learn New Concepts “t-rex”“quidditch” Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images 2015
  • 17. Style Transfer Learn features from one dataset, apply them to another Can be done within domain: Image Labels => New Image Classes And between domains: Image Features => Image Filters Image Labels + Language Model => Image Captions Show and Tell: A Neural Image Caption Generator 2015
  • 19. Released in Nov. 2015 #1 repository for “machine learning” category on GitHub TensorFlow
  • 21. Google Cloud Platform Marker-Assisted Breeding Rapidly Increases Frequency of Favorable Genes https://www.slideshare.net/finance28/monsanto-082305a
  • 22. Yield needs to increase by 3% per year to match GDP growth
  • 23. Marker-assisted selection for quantitative traits https://www.sec.gov/Archives/edgar/data/1110783/0000950134 02011773/c71992exv99w2.htm
  • 25. Select & Recombine Grow Generate Marker Fingerprint Sample tissue Extract DNAModel Data & Identify desirable carriers Marker-Assisted Breeding Rapidly Increases Frequency of Favorable Genes
  • 26. Genomics & Genetics Problems: How to Start Applying DNNs? Must-haves for deep learning: ● Lots of data: >50k examples, >1M examples ideal ● High-quality input and labels for training ● Label ~ F(data) unknown but certainly function exists ● High-quality prev. efforts so we know that DNNs are key ○ i.e. hard to solve with classical statistical approaches SNP and indel calling from NGS data
  • 27. Verily | Confidential & Proprietary Calling genetic variation may seem easy...
  • 28. Verily | Confidential & Proprietary ... but lots of places in the genome are difficult
  • 29. Creating a universal SNP and small indel variant caller with deep neural networks Ryan Poplin, Cory McLean, Dan Newburger, Jojo Dijamco, Nam Nguyen, Dion Loy, Sam Gross, Madeleine Cule, Peyton Greenside, Justin Zook, Marc Salit, Mark DePristo, Verily Life Sciences, October 2016
  • 30. DNN (Inception V3) Predicts True Genotype from Pileup Images { 0.001, 0.994, 0.005 } { 0.001, 0.990, 0.009 } { 0.000, 0.001, 0.999 } { 0.600, 0.399, 0.001 } Output: Probability of diploid genotype states { HOM_REF, HET, HOM_VAR } Raw pixels Input: Millions of labeled pileup images from gold standard samples
  • 31. Verily | Confidential & Proprietary Using deep learning for ultra-accurate mutation detection Input: Millions of labeled pileup image stacks from gold standard sample Raw pixels { 0.001, 0.994, 0.005 } { 0.001, 0.990, 0.009 } { 0.000, 0.001, 0.999 } { 0.600, 0.399, 0.001 } Output: Probability distribution over the three diploid genotype states { HOM_REF, HET, HOM_VAR } 31
  • 32. Verily | Confidential & Proprietary Example DNA read pileup “images” true snps true indels false variants red = {A,C,G,T}. green = {quality score}. blue = {read strand}. alpha = {matches ref genome}.
  • 33. Verily | Confidential & Proprietary PrecisionFDA: unique opportunity with blinded truth sample NA12878
  • 35. Select & Recombine Grow Generate Marker Fingerprint Sample tissue Extract DNAModel Data & Identify desirable carriers Marker-Assisted Breeding Rapidly Increases Frequency of Favorable Genes DNA sequencing is no longer the bottleneck...
  • 36. Select & Recombine Grow Generate Marker Fingerprint Sample tissue Extract DNAModel Data & Identify desirable carriers Marker-Assisted Breeding Rapidly Increases Frequency of Favorable Genes Leading to increased investment in machine learning DNA sequencing is no longer the bottleneck...
  • 37. Select & Recombine Grow Generate Marker Fingerprint Sample tissue Extract DNAModel Data & Identify desirable carriers Marker-Assisted Breeding Rapidly Increases Frequency of Favorable Genes Increased investment in machine learning… ...requires more data and other data types
  • 39. anezconsulting.com/precision-agronomy/ Agronometric Integration ● Satellite & UAV Images ● Geological Data ● Meteorological & Sensor Data ● Cultivar Data ● Other GIS Data ● Yield Data
  • 42. Bootstrapping a Virtuous Cycle ● Increased profit (from risk modeling) leads to increased investment and risk reduction in the form of: ● More accurate forecasting / engineering of climate ○ Collect & model more meteorological data ● Development of crop varieties to complement future terrestrial / climate conditions ● High-precision placement and monitoring of individual plants ○ Autonomous planting ○ remote sensing
  • 43.
  • 44. + =
  • 47. Mapping the Diversity of Maize Races in Mexico http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0114657
  • 48.
  • 49.
  • 50.
  • 51.
  • 52. Why Cannabis? ● Intellectual Property - No patented genes or strains… yet ● Update Mar 18, 2017: US PTO issues trademark for Gorilla Glue #4 ● Production - Breeding is highly fragmented… for now ● However, unclear that breeding will centralize due to cheap DNA sequencing and digital phenotyping ● Distribution (Growing) - Most likely to centralize due to economies of scale (e.g. multi-tenant greenhouses), and already crowded, wtf? ● Market Access - Unclear that this is a viable segment of supply chain (see GG#4 above). Also self-replication property of plants...
  • 53. Why Cannabis? ● Intellectual Property - No patented genes or strains… yet ● Update Mar 18, 2017: US PTO issues trademark for Gorilla Glue #4 ● Production - Breeding is highly fragmented… for now ● However, unclear that breeding will centralize due to cheap DNA sequencing and digital phenotyping ● Distribution (Growing) - Most likely to centralize due to economies of scale (e.g. multi-tenant greenhouses), and already crowded, wtf? ● Market Access - Unclear that this is a viable segment of supply chain (see GG#4 above). Also self-replication property of plants... ● Threat: does Cannabis become like Yogurt starter kits?
  • 54. Cannabis Genomics @ Google Cloud https://cloud.google.com/bigquery/public-data/1000-cannabis
  • 55.
  • 56. Build What’s Next Thank You! Allen Day, PhD // Science Advocate // @allenday // #genomics #ml #datascience