SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Evgeny Blokhin
Chelyabinsk SUSU’2013 summer workshop
Max-Planck Institute for Solid State Research
Stuttgart, Germany
Materials informatics
Outlook
1. Data-mining in materials science
2. Blue Obelisk
3. Python programming language
What is data-mining?
statistics
databases
information theory machine learning
artificial intelligence
optimization
Data
mining
Tasks of data-mining
1. Classification
2. Prognosing
3. Visualization
4. Reasoning
5. Analysis
6. Expert systems
Big data in materials science
EXAMPLE: nearly for the last 4 years
with my colleagues-theoreticians we produced:
over 9000 simulation output files
over 50 articles
1. Accelrys Pipeline Pilot and Materials Studio, http://accelrys.com/products
2. AFLOW framework and Aflowlib.org repository, http://www.aflowlib.org
3. AIDA, Bosch LLC
4. Blue Obelisk Data Repository (XSLT, XML), http://bodr.sourceforge.net
5. CCLib (Python), http://cclib.sf.net
6. CDF (Python), http://kitchingroup.cheme.cmu.edu/cdf
7. CMR (Python), https://wiki.fysik.dtu.dk/cmr
8. Comp. Chem. Comparison and Benchmark Database, http://cccbdb.nist.gov
9. cctbx: Computational Crystallography Toolbox, http://cctbx.sourceforge.net
10. ESTEST (Python, XQuery), http://estest.ucdavis.edu
11. J-ICE online viewer (based on Jmol, Java), http://j-ice.sourceforge.net
12. Materials Project (Python), http://www.materialsproject.org
13. PAULING FILE world largest database for inorganic compounds, http://paulingfile.com
14. Quixote, http://quixote.wikispot.org
15. Scipio (Java), https://scipio.iciq.es
16. WebMO: Web-based interface to computational chemistry packages (Java,
Perl), http://webmo.net
New type of modeling software
…and smart codes
ENCUT = 500
IBRION = 2
ISIF = 3
NSW = 20
IDIOT = 3
NELMIN = 5
EDIFF = 1.0e-08
EDIFFG = -1.0e-08
IALGO = 38
ISMEAR = 0
LREAL = .FALSE.
LWAVE = .FALSE.
*** VASP MASTER: I AM SURE YOU KNOW WHAT
YOU ARE DOING ***
d-metal oxides
band gap problem
standard DFT GGA
approach
Hartree-Fock
admixing
LCAO
approximation
Usage of Gaussian
basis sets
good atomization
energy
Example of inference over an ontology
Open data, open standards, open source in
chemistry
Open data, open standards, open source in
chemistry
1.Elsevier, Wiley, Springer publishers are “evil”
2.“The right to read is right to mine”
3.“Jailbreaking” the scientific data from PDFs:
access, reuse, integrity
4.Why the level of collaboration is so low?
Materials Project
Prof. G. Ceder,
MIT, Boston
Guido van Rossum,
Google, Dropbox
http://goo.gl/FtFS7h
Python programming language
Advantages of Python
Syntax: tabulation, syntactic sugar, speech-
like, flexibility, expression
VERY fast prototyping
Great popularity in scientific community
100% cross-platform and portable
Disadvantages of Python
Relatively slow speed comparing to compiled
languages like C++ or Fortran
Global Interpreter Lock (GIL)
Historically not popular in some narrow
scientific areas (“reigns” of Java)
Two examples
list = [x**2 for x in range(10)]
numbers = [10, 4, 2, -1, 6]
filter(lambda x: x < 5, numbers)
1. Multi-dimensional array manipulation (fast!)
2. Discrete fourier transform
3. Linear Algebra
4. Mathematical functions
5. Matrix library
6. Polynomials
7. Set routines
8. Sorting, searching and counting
9. Statistics
eigvals, eigvecs = numpy.linalg.eigh(dynmat)
Solving eigenvalue problem for a
dynamical matrix (phonopy code):

Contenu connexe

Tendances

Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...GigaScience, BGI Hong Kong
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshotsdatacite
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataHerbert Van de Sompel
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceRaul Palma
 
Proposal for Text Mining PubAg
Proposal for Text Mining PubAgProposal for Text Mining PubAg
Proposal for Text Mining PubAgJake Lever
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data RepositoriesHeinz Pampel
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningRoss Mounce
 
BioHackathon 2010 Intro
BioHackathon 2010 IntroBioHackathon 2010 Intro
BioHackathon 2010 IntroBrad Chapman
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurghJun Zhao
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboardamiraryani
 
Science Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkScience Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkJean-Claude Bradley
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the partsCarole Goble
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Alejandra Gonzalez-Beltran
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014Dag Endresen
 

Tendances (20)

Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
Lisa Johnson at #ICG13: Re-assembly, quality evaluation, and annotation of 67...
 
Peer Review and Science2.0
Peer Review and Science2.0Peer Review and Science2.0
Peer Review and Science2.0
 
What is DataCite-screenshots
What is DataCite-screenshotsWhat is DataCite-screenshots
What is DataCite-screenshots
 
MESUR: Making sense and use of usage data
MESUR: Making sense and use of usage dataMESUR: Making sense and use of usage data
MESUR: Making sense and use of usage data
 
Aspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth ScienceAspects of Reproducibility in Earth Science
Aspects of Reproducibility in Earth Science
 
Proposal for Text Mining PubAg
Proposal for Text Mining PubAgProposal for Text Mining PubAg
Proposal for Text Mining PubAg
 
Columbia ONS Archiving May09
Columbia ONS Archiving May09Columbia ONS Archiving May09
Columbia ONS Archiving May09
 
re3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositoriesre3data.org – Registry of Research Data Repositories
re3data.org – Registry of Research Data Repositories
 
FAIRy Stories
FAIRy StoriesFAIRy Stories
FAIRy Stories
 
Workshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data miningWorkshop 5: Uptake of, and concepts in text and data mining
Workshop 5: Uptake of, and concepts in text and data mining
 
BioHackathon 2010 Intro
BioHackathon 2010 IntroBioHackathon 2010 Intro
BioHackathon 2010 Intro
 
2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh2011 03-provenance-workshop-edingurgh
2011 03-provenance-workshop-edingurgh
 
UPennONS
UPennONSUPennONS
UPennONS
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
Using Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-SwitchboardUsing Neo4j for exploring the research graph connections made by RD-Switchboard
Using Neo4j for exploring the research graph connections made by RD-Switchboard
 
Science Commons Open Notebook Science Talk
Science Commons Open Notebook Science TalkScience Commons Open Notebook Science Talk
Science Commons Open Notebook Science Talk
 
Research Objects: more than the sum of the parts
Research Objects: more than the sum of the partsResearch Objects: more than the sum of the parts
Research Objects: more than the sum of the parts
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
NISO May 8 Webinar: An astronomy library's quest for greater access to litera...
 
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
GBIF-Norway status for the 6th European GBIF nodes meeting April 2014
 

En vedette

Ab initio temperature phonons group theory
Ab initio temperature phonons group theoryAb initio temperature phonons group theory
Ab initio temperature phonons group theorySergey Sozykin
 
Electrochemistry perovskites defects
Electrochemistry perovskites defectsElectrochemistry perovskites defects
Electrochemistry perovskites defectsSergey Sozykin
 
Application of Al alloys
Application of Al alloysApplication of Al alloys
Application of Al alloysSergey Sozykin
 
Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Sergey Sozykin
 

En vedette (6)

Vaulin pohang 2010
Vaulin pohang 2010Vaulin pohang 2010
Vaulin pohang 2010
 
Ab initio temperature phonons group theory
Ab initio temperature phonons group theoryAb initio temperature phonons group theory
Ab initio temperature phonons group theory
 
Binary sigma phases
Binary sigma phasesBinary sigma phases
Binary sigma phases
 
Electrochemistry perovskites defects
Electrochemistry perovskites defectsElectrochemistry perovskites defects
Electrochemistry perovskites defects
 
Application of Al alloys
Application of Al alloysApplication of Al alloys
Application of Al alloys
 
Misfit layered compounds PbTa2
Misfit layered compounds PbTa2Misfit layered compounds PbTa2
Misfit layered compounds PbTa2
 

Similaire à Materials informatics

UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG: connecting the knowledge community
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureRoss Mounce
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8Scott Edmunds
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynoteCarole Goble
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliersaimsnist
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open scienceSarah Jones
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkJean-Claude Bradley
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versaNathan Shammah
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Ross Mounce
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data RepositoriesHeinz Pampel
 
Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Jane Bromley
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenHeinz Pampel
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Carole Goble
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals FederationManjulaPatel
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemHerbert Van de Sompel
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeologyguest756e05
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsJeremy Frey
 

Similaire à Materials informatics (20)

UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
UKSG Conference 2017 Breakout - Advancing the Research Paper of the Future: c...
 
Open Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | FutureOpen Research Data: Licensing | Standards | Future
Open Research Data: Licensing | Standards | Future
 
HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8HKU Data Curation MLIM7350 Class 8
HKU Data Curation MLIM7350 Class 8
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
When The New Science Is In The Outliers
When The New Science Is In The OutliersWhen The New Science Is In The Outliers
When The New Science Is In The Outliers
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
NITLE Open Notebook Science Talk
NITLE Open Notebook Science TalkNITLE Open Notebook Science Talk
NITLE Open Notebook Science Talk
 
'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa'Scikit-project': How open source is empowering open science – and vice versa
'Scikit-project': How open source is empowering open science – and vice versa
 
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
Specimen-level mining: bringing knowledge back 'home' to the Natural History ...
 
re3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositoriesre3data.org – a Registry of Research Data Repositories
re3data.org – a Registry of Research Data Repositories
 
OpenSciNY Open Notebook Science
OpenSciNY Open Notebook ScienceOpenSciNY Open Notebook Science
OpenSciNY Open Notebook Science
 
Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013Presentation for agINFRA Hackathon in Athens 12th December 2013
Presentation for agINFRA Hackathon in Athens 12th December 2013
 
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und PerspektivenForschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
Forschungsdaten-Repositorien Typen, Herausforderungen und Perspektiven
 
Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017Being Reproducible: SSBSS Summer School 2017
Being Reproducible: SSBSS Summer School 2017
 
The eCrystals Federation
The eCrystals FederationThe eCrystals Federation
The eCrystals Federation
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
 
Reproducible Research and the Cloud
Reproducible Research and the CloudReproducible Research and the Cloud
Reproducible Research and the Cloud
 
An Open Context for Archaeology
An Open Context for ArchaeologyAn Open Context for Archaeology
An Open Context for Archaeology
 
Blogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart LabsBlogs Logs Pods: Smart Labs
Blogs Logs Pods: Smart Labs
 
Open science platforms
Open science platformsOpen science platforms
Open science platforms
 

Plus de Sergey Sozykin

Susu seminar summer_2012
Susu seminar summer_2012Susu seminar summer_2012
Susu seminar summer_2012Sergey Sozykin
 
лекция 5 graphen
лекция 5 graphenлекция 5 graphen
лекция 5 graphenSergey Sozykin
 
лекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbлекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbSergey Sozykin
 
лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах Sergey Sozykin
 
лекция 5 memristor
лекция 5 memristorлекция 5 memristor
лекция 5 memristorSergey Sozykin
 
лекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикилекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикиSergey Sozykin
 

Plus de Sergey Sozykin (6)

Susu seminar summer_2012
Susu seminar summer_2012Susu seminar summer_2012
Susu seminar summer_2012
 
лекция 5 graphen
лекция 5 graphenлекция 5 graphen
лекция 5 graphen
 
лекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsbлекция 3 дефекты в полупроводниках ga n alsb
лекция 3 дефекты в полупроводниках ga n alsb
 
лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах лекция 2 атомные смещения в бинарных сплавах
лекция 2 атомные смещения в бинарных сплавах
 
лекция 5 memristor
лекция 5 memristorлекция 5 memristor
лекция 5 memristor
 
лекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физикилекция 1 обзор методов вычислительной физики
лекция 1 обзор методов вычислительной физики
 

Dernier

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 

Dernier (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 

Materials informatics

  • 1. Evgeny Blokhin Chelyabinsk SUSU’2013 summer workshop Max-Planck Institute for Solid State Research Stuttgart, Germany Materials informatics
  • 2. Outlook 1. Data-mining in materials science 2. Blue Obelisk 3. Python programming language
  • 3. What is data-mining? statistics databases information theory machine learning artificial intelligence optimization Data mining
  • 4. Tasks of data-mining 1. Classification 2. Prognosing 3. Visualization 4. Reasoning 5. Analysis 6. Expert systems
  • 5. Big data in materials science EXAMPLE: nearly for the last 4 years with my colleagues-theoreticians we produced: over 9000 simulation output files over 50 articles
  • 6.
  • 7. 1. Accelrys Pipeline Pilot and Materials Studio, http://accelrys.com/products 2. AFLOW framework and Aflowlib.org repository, http://www.aflowlib.org 3. AIDA, Bosch LLC 4. Blue Obelisk Data Repository (XSLT, XML), http://bodr.sourceforge.net 5. CCLib (Python), http://cclib.sf.net 6. CDF (Python), http://kitchingroup.cheme.cmu.edu/cdf 7. CMR (Python), https://wiki.fysik.dtu.dk/cmr 8. Comp. Chem. Comparison and Benchmark Database, http://cccbdb.nist.gov 9. cctbx: Computational Crystallography Toolbox, http://cctbx.sourceforge.net 10. ESTEST (Python, XQuery), http://estest.ucdavis.edu 11. J-ICE online viewer (based on Jmol, Java), http://j-ice.sourceforge.net 12. Materials Project (Python), http://www.materialsproject.org 13. PAULING FILE world largest database for inorganic compounds, http://paulingfile.com 14. Quixote, http://quixote.wikispot.org 15. Scipio (Java), https://scipio.iciq.es 16. WebMO: Web-based interface to computational chemistry packages (Java, Perl), http://webmo.net New type of modeling software
  • 8. …and smart codes ENCUT = 500 IBRION = 2 ISIF = 3 NSW = 20 IDIOT = 3 NELMIN = 5 EDIFF = 1.0e-08 EDIFFG = -1.0e-08 IALGO = 38 ISMEAR = 0 LREAL = .FALSE. LWAVE = .FALSE. *** VASP MASTER: I AM SURE YOU KNOW WHAT YOU ARE DOING ***
  • 9. d-metal oxides band gap problem standard DFT GGA approach Hartree-Fock admixing LCAO approximation Usage of Gaussian basis sets good atomization energy Example of inference over an ontology
  • 10.
  • 11. Open data, open standards, open source in chemistry
  • 12. Open data, open standards, open source in chemistry 1.Elsevier, Wiley, Springer publishers are “evil” 2.“The right to read is right to mine” 3.“Jailbreaking” the scientific data from PDFs: access, reuse, integrity 4.Why the level of collaboration is so low?
  • 13. Materials Project Prof. G. Ceder, MIT, Boston
  • 14. Guido van Rossum, Google, Dropbox http://goo.gl/FtFS7h Python programming language
  • 15. Advantages of Python Syntax: tabulation, syntactic sugar, speech- like, flexibility, expression VERY fast prototyping Great popularity in scientific community 100% cross-platform and portable
  • 16. Disadvantages of Python Relatively slow speed comparing to compiled languages like C++ or Fortran Global Interpreter Lock (GIL) Historically not popular in some narrow scientific areas (“reigns” of Java)
  • 17. Two examples list = [x**2 for x in range(10)] numbers = [10, 4, 2, -1, 6] filter(lambda x: x < 5, numbers)
  • 18. 1. Multi-dimensional array manipulation (fast!) 2. Discrete fourier transform 3. Linear Algebra 4. Mathematical functions 5. Matrix library 6. Polynomials 7. Set routines 8. Sorting, searching and counting 9. Statistics
  • 19. eigvals, eigvecs = numpy.linalg.eigh(dynmat) Solving eigenvalue problem for a dynamical matrix (phonopy code):