SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Applications of Machine Learning
for Materials Discovery at NREL
Caleb Phillips, Ph.D.
Data Analysis and Visualization
Computational Sciences Center
National Renewable Energy Laboratory
NREL | 2
Modern “Full Stack” Materials Science
Synthesis
Characterization
Computation
Golden, Colorado
NREL | 3
Skeptics Allowed
“I’ll admit it, there may be something to
this ‘big data’ and ‘machine learning’
thing everyone keeps talking about.”
- Anonymous cynic (2017)
What changed?
• Computational power
• Deep Neural Networks
• Cheap storage, big data
• Increasing adoption/investment
NREL | 4
Setting Realistic Expectations
Machine learning &
Deep Learning
Image source: Gartner.com, Aug. 2017
NREL | 5
Overview of the talk
Compelling examples of materials-oriented machine learning at
work at NREL:
• Improving the throughput of experimentation:
• Interpretation: Accelerate the data->knowledge path
• Automation: Replace onerous manual tasks
• Prediction: Predict properties not measured
• Augmenting or replacing DFT simulations in candidate screening
• Prediction: End-to-end deep learning on molecular and
atomistic structures
Will cover applications at a high level – talk to or email me for more info
} Do work
faster with
more insight
} Focus work,
avoid some
altogether
NREL | 6
But first, the Data
The Materials Project
http://materials.nrel.gov
http://organiceletronics.nrel.gov
http://htem.nrel.gov
{
{ {
Experimental
Theoretical
Both
NREL | 7
Example: Experimental Materials Discovery
Taylor et al. Adv. Funct. Mater. 18, 3169 (2008)
% In
Conductivity of Annealed InZnO
Goal: Make a PhD Thesis Amount of Analysis a Routine Activity
Composition
Structure
Property
Process
Slide credit: John Perkins
NREL | 8
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
NREL | 9
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
AI/ML
Opportunities
Improve fidelity
Guided search
Faster screening
Automation
Visualization
Property Prediction{
NREL | 10
Initial motivating problem: Accelerate Slow Analysis Tasks
880 Unanalyzed XRD Patterns (Data)
1 Structure Phase Map (Knowledge)
Slow
NREL | 11
Clustering by structure and composition
Samples
Results
Extracted Data Set
~ 1000 XRD patterns
Spectral Clustering
NNLS Decomposition
Apply Machine Learning to Determine Clusters in XRD Patterns
Fast: ~ 30 seconds on a laptop
Calculated XRD Patterns
NREL | 12
Automatic band gap calculation
Goal: replace highly subjective manual
Process with something scalable, automated,
and (more) accurate.
Combining experimental and theoretical data compare properties across a
wide landscape of materials systems and synthesis conditions.
Schwarting et al., Materials Discovery (2018)
NREL | 13
Application Driven (High Throughput) Materials Discovery at NREL
Input:
Theoretical
calculations
Combinatorial
synthesis
Spatially resolved
characterization
Output:
Application driven
optimization
AI/ML
Opportunities
Improve fidelity
Guided search
Faster screening
Automation
Visualization
Property Prediction{
NREL | 14
High throughput screening using computational results
Constraints
Molecule
Generator
Predictive
(Machine Learning)
Model
Simulation on
Supercomputer
$$$
Results
Database
OR
Best candidates
All candidates
(sequentially)
Visualization &
Analysis
Materials
Synthesis
$$$$$
Measurement
and Validation
New
Materials
Theoretical
Experimental
Training
on
Past Results
Phillips et al. CoDA (2016)
NREL | 15
Predict opto-electric properties of molecules
Support Vector Regression (SVR) performance when predicting calculated band gap. Residual
error is linear and normally distributed. Median error is effectively zero, RMSE is 0.25 EV or
less for most scenarios.
First try: learn using
molecular descriptors
(traditional feature
engineering)
2 million candidates
NREL | 16
End-to-end Learning: Skip the feature extraction
Image Recognition: Convolutional
Neural Networks (CNNs)
O
Message
Passing
Blocks
Node Recurrent Units
Node
Embedding
Layer
Graph
Output
Layer(s)
Dense
Regression
Layers
Predictions
Input
Graph
(Molecule)
Molecular Graphs: Message Passing Neural
Networks (MPNNs)Gilmer et al., CoRR (2017)
Key hypothesis: model
can learn which features
are important directly from
structure.
NREL | 17
End-to-end Learning: Skip the feature extraction
Duplicate 1 (DFT)
MachineLearningPrediction
3-5x improvement over manually engineered features.
Accuracy approaching repeated-measures accuracy of DFT.
Gap 0.90
HOMO 1.05
LUMO 0.89
Spectral overlap 1.28
Polymer HOMO 1.24
Polymer LUMO 1.03
Polymer gap 1.19
Polymer optical LUMO 1.02
!"#$("&'ℎ)*+ ,+&-*)*.)
!"#$(012 3456)'&7+8)
St. John et al. https://arxiv.org/abs/1807.10363. (2018)
NREL | 18
Transfer learning and training set size
St. John et al. https://arxiv.org/abs/1807.10363. (2018)
NREL | 19
End-to-end learning for crystalline materials
Represent crystal structure as a graph
to allow end-to-end learning.
Kamdar. 2018. NREL/US DOE CSGF.
NREL | 20
Thanks to Many Collaborators
(and many funding sources)
Theory
Stephan Lany
Vladan Stevonvic
Aaron Holder
@ LBNL
Gerd Ceder
Kristin Persson
Data
Robert White
Kristin Munch
Peter Graf
@ NIST
Zachary Trautt
Robert Hanisch
Experiment
Andriy Zakutayev
John Perkins
Philip Parilla
David Ginley
Bill Tumas
Sebastian Siol
Lauren Garten
Elisabetta Arca
Matthew Taylor
@ NIST
Martin Green
Jae Hattrick-Simpers
Nam Nguyen
@ SLAC
Apurva Mehta
@ ANL
Debbie Myers
AI/ML
Jacob Hinkle
Marcus Schwarting
Peter St. John
@ Harvard
Harshil Kamdar Slide credit: John Perkins
NREL | 21
Selected Publications
Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson,
Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen.
Message-passing neural networks for high-throughput polymer screening.
In submission. ArXiv preprint: https://arxiv.org/abs/1807.10363
Marcus Schwarting, Sebastian Siol, Kevin Talley, Andriy Zakutayev, Caleb Phillips.
Automated algorithms for band gap analysis from optical absorption spectra.
Materials Discovery, April 18, 2018. https://doi.org/10.1016/j.md.2018.04.003
Andriy Zakutayev, Nick Wunder, Marcus Schwarting, John Perkins, Robert White,
Kristin Munch, William Tumas, and Caleb Phillips.
An open experimental database for exploring inorganic materials.
Nature. Scientific Data. April 3, 2018. https://www.nature.com/articles/sdata201853
Caleb Phillips, Ross Larson, Kristin Munch, Nikos Kopidakis.
Guided Search for Organic Photovoltaic Materials Using Predictive Data Modeling.
Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico.
www.nrel.gov
Thank you
This work was authored by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy,
LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by U.S.
Department of Energy Office of Energy Efficiency and Renewable Energy. The views expressed in the article do not
necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher,
by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up,
irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for
U.S. Government purposes.
caleb.phillips@nrel.gov
NREL | 23
Save work: predict not-measured properties
• Electrical conductivity prediction using random forest model
• Training variables: chemical composition, XRD peak count, deposition conditions
• Training process: 10-fold cross-validation by withdrawing 25% sample libraries
• Training set: 16K data points varying by 9-10 orders of magnitude
Predicted vs Measured
Conductivity
Prediction accuracy for
Conductivity
Prediction accuracy of
1-2 orders of
magnitude, reasonable
for semiconductors
Zakutayev et al. Scientific Data 5 180053 (2018)
NREL | 24
What’s in my database?
tSne model can group
70K samples based on
similarity of their
chemical compositions
t-distributed stochastic neighbor embedding (tSne) dimensionality reduction model
Zakutayev et al. Scientific Data 5 180053 (2018)

Contenu connexe

Tendances

Tendances (20)

Machine Learning for Molecules: Lessons and Challenges of Data-Centric Chemistry
Machine Learning for Molecules: Lessons and Challenges of Data-Centric ChemistryMachine Learning for Molecules: Lessons and Challenges of Data-Centric Chemistry
Machine Learning for Molecules: Lessons and Challenges of Data-Centric Chemistry
 
TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...TMS workshop on machine learning in materials science: Intro to deep learning...
TMS workshop on machine learning in materials science: Intro to deep learning...
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug Discovery
 
molecular docking screnning. pptx
molecular docking screnning. pptxmolecular docking screnning. pptx
molecular docking screnning. pptx
 
An Introduction to Neural Architecture Search
An Introduction to Neural Architecture SearchAn Introduction to Neural Architecture Search
An Introduction to Neural Architecture Search
 
Relational knowledge distillation
Relational knowledge distillationRelational knowledge distillation
Relational knowledge distillation
 
Graph Attention Networks.pptx
Graph Attention Networks.pptxGraph Attention Networks.pptx
Graph Attention Networks.pptx
 
Retrosynthesis tutorial v2
Retrosynthesis tutorial v2Retrosynthesis tutorial v2
Retrosynthesis tutorial v2
 
Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)Automated Machine Learning (Auto ML)
Automated Machine Learning (Auto ML)
 
Neural Architecture Search: Learning How to Learn
Neural Architecture Search: Learning How to LearnNeural Architecture Search: Learning How to Learn
Neural Architecture Search: Learning How to Learn
 
DeepSMILES
DeepSMILESDeepSMILES
DeepSMILES
 
Explicit Density Models
Explicit Density ModelsExplicit Density Models
Explicit Density Models
 
Basics of QSAR Modeling
Basics of QSAR ModelingBasics of QSAR Modeling
Basics of QSAR Modeling
 
遺伝的アルゴリズムによるNクイーン問題の解法
遺伝的アルゴリズムによるNクイーン問題の解法遺伝的アルゴリズムによるNクイーン問題の解法
遺伝的アルゴリズムによるNクイーン問題の解法
 
Transfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine LearningTransfer Learning -- The Next Frontier for Machine Learning
Transfer Learning -- The Next Frontier for Machine Learning
 
Denovo
DenovoDenovo
Denovo
 
PhD CV: Postdoctoral Research
PhD CV: Postdoctoral ResearchPhD CV: Postdoctoral Research
PhD CV: Postdoctoral Research
 
High throughput screening
High throughput screeningHigh throughput screening
High throughput screening
 
Warsaw Data Science - Factorization Machines Introduction
Warsaw Data Science -  Factorization Machines IntroductionWarsaw Data Science -  Factorization Machines Introduction
Warsaw Data Science - Factorization Machines Introduction
 
Introduction to Deep Learning
Introduction to Deep LearningIntroduction to Deep Learning
Introduction to Deep Learning
 

Similaire à Applications of Machine Learning for Materials Discovery at NREL

Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
Anubhav Jain
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
aimsnist
 

Similaire à Applications of Machine Learning for Materials Discovery at NREL (20)

The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...The Materials Project: A Community Data Resource for Accelerating New Materia...
The Materials Project: A Community Data Resource for Accelerating New Materia...
 
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
CLIM Program: Remote Sensing Workshop, High Performance Computing and Spatial...
 
2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model2D/3D Materials screening and genetic algorithm with ML model
2D/3D Materials screening and genetic algorithm with ML model
 
Going Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCFGoing Smart and Deep on Materials at ALCF
Going Smart and Deep on Materials at ALCF
 
Physics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learningPhysics inspired artificial intelligence/machine learning
Physics inspired artificial intelligence/machine learning
 
Materials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learningMaterials discovery through theory, computation, and machine learning
Materials discovery through theory, computation, and machine learning
 
Discovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials ProjectDiscovering and Exploring New Materials through the Materials Project
Discovering and Exploring New Materials through the Materials Project
 
Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...Overview of accelerated materials design efforts in the Hacking Materials res...
Overview of accelerated materials design efforts in the Hacking Materials res...
 
Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...Computational Materials Design and Data Dissemination through the Materials P...
Computational Materials Design and Data Dissemination through the Materials P...
 
Learning Systems for Science
Learning Systems for ScienceLearning Systems for Science
Learning Systems for Science
 
NIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials DesignNIST-JARVIS infrastructure for Improved Materials Design
NIST-JARVIS infrastructure for Improved Materials Design
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...The Status of ML Algorithms for Structure-property Relationships Using Matb...
The Status of ML Algorithms for Structure-property Relationships Using Matb...
 
ME Synopsis
ME SynopsisME Synopsis
ME Synopsis
 
Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...Combining density functional theory calculations, supercomputing, and data-dr...
Combining density functional theory calculations, supercomputing, and data-dr...
 
Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...Software tools, crystal descriptors, and machine learning applied to material...
Software tools, crystal descriptors, and machine learning applied to material...
 
IEEE Datamining 2016 Title and Abstract
IEEE  Datamining 2016 Title and AbstractIEEE  Datamining 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
 
Data dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNLData dissemination and materials informatics at LBNL
Data dissemination and materials informatics at LBNL
 
The Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource ProvisioningThe Interplay of Workflow Execution and Resource Provisioning
The Interplay of Workflow Execution and Resource Provisioning
 
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and ApplicationsData Mining to Discovery for Inorganic Solids: Software Tools and Applications
Data Mining to Discovery for Inorganic Solids: Software Tools and Applications
 

Plus de aimsnist

A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
aimsnist
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...
aimsnist
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
aimsnist
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
aimsnist
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisition
aimsnist
 
Progress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science TextProgress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science Text
aimsnist
 

Plus de aimsnist (17)

Enabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and DiscoveryEnabling Data Science Methods for Catalyst Design and Discovery
Enabling Data Science Methods for Catalyst Design and Discovery
 
A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...A Framework and Infrastructure for Uncertainty Quantification and Management ...
A Framework and Infrastructure for Uncertainty Quantification and Management ...
 
Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...Predicting local atomic structures from X-ray absorption spectroscopy using t...
Predicting local atomic structures from X-ray absorption spectroscopy using t...
 
Smart Metrics for High Performance Material Design
Smart Metrics for High Performance Material DesignSmart Metrics for High Performance Material Design
Smart Metrics for High Performance Material Design
 
Graphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials ScienceGraphs, Environments, and Machine Learning for Materials Science
Graphs, Environments, and Machine Learning for Materials Science
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
The MGI and AI
The MGI and AIThe MGI and AI
The MGI and AI
 
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
Failing Fastest: What an Effective HTE and ML Workflow Enables for Functional...
 
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
How to Leverage Artificial Intelligence to Accelerate Data Collection and Ana...
 
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses FasterCoupling AI with HiTp experiments to Discover Metallic Glasses Faster
Coupling AI with HiTp experiments to Discover Metallic Glasses Faster
 
Autonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisitionAutonomous experimental phase diagram acquisition
Autonomous experimental phase diagram acquisition
 
Pathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of MaterialsPathways Towards a Hierarchical Discovery of Materials
Pathways Towards a Hierarchical Discovery of Materials
 
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum DataAutomated Generation of High-accuracy Interatomic Potentials Using Quantum Data
Automated Generation of High-accuracy Interatomic Potentials Using Quantum Data
 
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
Polymer Genome: An Informatics Platform for Polymer Dielectrics Discovery and...
 
Materials Data in Action
Materials Data in ActionMaterials Data in Action
Materials Data in Action
 
Combinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials DiscoveryCombinatorial Experimentation and Machine Learning for Materials Discovery
Combinatorial Experimentation and Machine Learning for Materials Discovery
 
Progress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science TextProgress in Natural Language Processing of Materials Science Text
Progress in Natural Language Processing of Materials Science Text
 

Dernier

notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
MsecMca
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Kandungan 087776558899
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
Epec Engineered Technologies
 

Dernier (20)

Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects2016EF22_0 solar project report rooftop projects
2016EF22_0 solar project report rooftop projects
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Ramesh Nagar Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak HamilCara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
Cara Menggugurkan Sperma Yang Masuk Rahim Biyar Tidak Hamil
 
Employee leave management system project.
Employee leave management system project.Employee leave management system project.
Employee leave management system project.
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Work-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptxWork-Permit-Receiver-in-Saudi-Aramco.pptx
Work-Permit-Receiver-in-Saudi-Aramco.pptx
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
DC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equationDC MACHINE-Motoring and generation, Armature circuit equation
DC MACHINE-Motoring and generation, Armature circuit equation
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
Standard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power PlayStandard vs Custom Battery Packs - Decoding the Power Play
Standard vs Custom Battery Packs - Decoding the Power Play
 

Applications of Machine Learning for Materials Discovery at NREL

  • 1. Applications of Machine Learning for Materials Discovery at NREL Caleb Phillips, Ph.D. Data Analysis and Visualization Computational Sciences Center National Renewable Energy Laboratory
  • 2. NREL | 2 Modern “Full Stack” Materials Science Synthesis Characterization Computation Golden, Colorado
  • 3. NREL | 3 Skeptics Allowed “I’ll admit it, there may be something to this ‘big data’ and ‘machine learning’ thing everyone keeps talking about.” - Anonymous cynic (2017) What changed? • Computational power • Deep Neural Networks • Cheap storage, big data • Increasing adoption/investment
  • 4. NREL | 4 Setting Realistic Expectations Machine learning & Deep Learning Image source: Gartner.com, Aug. 2017
  • 5. NREL | 5 Overview of the talk Compelling examples of materials-oriented machine learning at work at NREL: • Improving the throughput of experimentation: • Interpretation: Accelerate the data->knowledge path • Automation: Replace onerous manual tasks • Prediction: Predict properties not measured • Augmenting or replacing DFT simulations in candidate screening • Prediction: End-to-end deep learning on molecular and atomistic structures Will cover applications at a high level – talk to or email me for more info } Do work faster with more insight } Focus work, avoid some altogether
  • 6. NREL | 6 But first, the Data The Materials Project http://materials.nrel.gov http://organiceletronics.nrel.gov http://htem.nrel.gov { { { Experimental Theoretical Both
  • 7. NREL | 7 Example: Experimental Materials Discovery Taylor et al. Adv. Funct. Mater. 18, 3169 (2008) % In Conductivity of Annealed InZnO Goal: Make a PhD Thesis Amount of Analysis a Routine Activity Composition Structure Property Process Slide credit: John Perkins
  • 8. NREL | 8 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization
  • 9. NREL | 9 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization AI/ML Opportunities Improve fidelity Guided search Faster screening Automation Visualization Property Prediction{
  • 10. NREL | 10 Initial motivating problem: Accelerate Slow Analysis Tasks 880 Unanalyzed XRD Patterns (Data) 1 Structure Phase Map (Knowledge) Slow
  • 11. NREL | 11 Clustering by structure and composition Samples Results Extracted Data Set ~ 1000 XRD patterns Spectral Clustering NNLS Decomposition Apply Machine Learning to Determine Clusters in XRD Patterns Fast: ~ 30 seconds on a laptop Calculated XRD Patterns
  • 12. NREL | 12 Automatic band gap calculation Goal: replace highly subjective manual Process with something scalable, automated, and (more) accurate. Combining experimental and theoretical data compare properties across a wide landscape of materials systems and synthesis conditions. Schwarting et al., Materials Discovery (2018)
  • 13. NREL | 13 Application Driven (High Throughput) Materials Discovery at NREL Input: Theoretical calculations Combinatorial synthesis Spatially resolved characterization Output: Application driven optimization AI/ML Opportunities Improve fidelity Guided search Faster screening Automation Visualization Property Prediction{
  • 14. NREL | 14 High throughput screening using computational results Constraints Molecule Generator Predictive (Machine Learning) Model Simulation on Supercomputer $$$ Results Database OR Best candidates All candidates (sequentially) Visualization & Analysis Materials Synthesis $$$$$ Measurement and Validation New Materials Theoretical Experimental Training on Past Results Phillips et al. CoDA (2016)
  • 15. NREL | 15 Predict opto-electric properties of molecules Support Vector Regression (SVR) performance when predicting calculated band gap. Residual error is linear and normally distributed. Median error is effectively zero, RMSE is 0.25 EV or less for most scenarios. First try: learn using molecular descriptors (traditional feature engineering) 2 million candidates
  • 16. NREL | 16 End-to-end Learning: Skip the feature extraction Image Recognition: Convolutional Neural Networks (CNNs) O Message Passing Blocks Node Recurrent Units Node Embedding Layer Graph Output Layer(s) Dense Regression Layers Predictions Input Graph (Molecule) Molecular Graphs: Message Passing Neural Networks (MPNNs)Gilmer et al., CoRR (2017) Key hypothesis: model can learn which features are important directly from structure.
  • 17. NREL | 17 End-to-end Learning: Skip the feature extraction Duplicate 1 (DFT) MachineLearningPrediction 3-5x improvement over manually engineered features. Accuracy approaching repeated-measures accuracy of DFT. Gap 0.90 HOMO 1.05 LUMO 0.89 Spectral overlap 1.28 Polymer HOMO 1.24 Polymer LUMO 1.03 Polymer gap 1.19 Polymer optical LUMO 1.02 !"#$("&'ℎ)*+ ,+&-*)*.) !"#$(012 3456)'&7+8) St. John et al. https://arxiv.org/abs/1807.10363. (2018)
  • 18. NREL | 18 Transfer learning and training set size St. John et al. https://arxiv.org/abs/1807.10363. (2018)
  • 19. NREL | 19 End-to-end learning for crystalline materials Represent crystal structure as a graph to allow end-to-end learning. Kamdar. 2018. NREL/US DOE CSGF.
  • 20. NREL | 20 Thanks to Many Collaborators (and many funding sources) Theory Stephan Lany Vladan Stevonvic Aaron Holder @ LBNL Gerd Ceder Kristin Persson Data Robert White Kristin Munch Peter Graf @ NIST Zachary Trautt Robert Hanisch Experiment Andriy Zakutayev John Perkins Philip Parilla David Ginley Bill Tumas Sebastian Siol Lauren Garten Elisabetta Arca Matthew Taylor @ NIST Martin Green Jae Hattrick-Simpers Nam Nguyen @ SLAC Apurva Mehta @ ANL Debbie Myers AI/ML Jacob Hinkle Marcus Schwarting Peter St. John @ Harvard Harshil Kamdar Slide credit: John Perkins
  • 21. NREL | 21 Selected Publications Peter C. St. John, Caleb Phillips, Travis W. Kemper, A. Nolan Wilson, Michael F. Crowley, Mark R. Nimlos, Ross E. Larsen. Message-passing neural networks for high-throughput polymer screening. In submission. ArXiv preprint: https://arxiv.org/abs/1807.10363 Marcus Schwarting, Sebastian Siol, Kevin Talley, Andriy Zakutayev, Caleb Phillips. Automated algorithms for band gap analysis from optical absorption spectra. Materials Discovery, April 18, 2018. https://doi.org/10.1016/j.md.2018.04.003 Andriy Zakutayev, Nick Wunder, Marcus Schwarting, John Perkins, Robert White, Kristin Munch, William Tumas, and Caleb Phillips. An open experimental database for exploring inorganic materials. Nature. Scientific Data. April 3, 2018. https://www.nature.com/articles/sdata201853 Caleb Phillips, Ross Larson, Kristin Munch, Nikos Kopidakis. Guided Search for Organic Photovoltaic Materials Using Predictive Data Modeling. Conference on Data Analysis (CoDA) 2016. March 2-4, 2016. Santa Fe, New Mexico.
  • 22. www.nrel.gov Thank you This work was authored by the National Renewable Energy Laboratory, operated by Alliance for Sustainable Energy, LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by U.S. Department of Energy Office of Energy Efficiency and Renewable Energy. The views expressed in the article do not necessarily represent the views of the DOE or the U.S. Government. The U.S. Government retains and the publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work, or allow others to do so, for U.S. Government purposes. caleb.phillips@nrel.gov
  • 23. NREL | 23 Save work: predict not-measured properties • Electrical conductivity prediction using random forest model • Training variables: chemical composition, XRD peak count, deposition conditions • Training process: 10-fold cross-validation by withdrawing 25% sample libraries • Training set: 16K data points varying by 9-10 orders of magnitude Predicted vs Measured Conductivity Prediction accuracy for Conductivity Prediction accuracy of 1-2 orders of magnitude, reasonable for semiconductors Zakutayev et al. Scientific Data 5 180053 (2018)
  • 24. NREL | 24 What’s in my database? tSne model can group 70K samples based on similarity of their chemical compositions t-distributed stochastic neighbor embedding (tSne) dimensionality reduction model Zakutayev et al. Scientific Data 5 180053 (2018)