SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Joel Saltz MD, PhD
Emory University
February 2013
Exascale Challenges: Space,
Time, Experimental Science and
Self Driving Cars 	
  
	
  
CenterforComprehensiveInformatics
Integrate Information from Sensors, Images, Cameras
•  Multi-dimensional spatial-temporal datasets
–  Radiology and Microscopy Image Analyses
–  Oil Reservoir Simulation/Carbon Sequestration/
Groundwater Pollution Remediation
–  Biomass monitoring and disaster surveillance using multiple
types of satellite imagery
–  Weather prediction using satellite and ground sensor data
–  Analysis of Results from Large Scale Simulations
–  Square Kilometer Array
–  Google Self Driving Car
•  Correlative and cooperative analysis of data from multiple
sensor modalities and sources
•  Equivalent from standpoint of data access patterns –
we propose a integrative sensor data mini-App
CenterforComprehensiveInformatics
CenterforComprehensiveInformatics
Deep Learning from Medical
Imaging
Emory In Silico Center for Brain Tumor
Research (PI = Dan Brat, PD= Joel Saltz)
CenterforComprehensiveInformatics
Integrative Cancer Research with Digital Pathology
histology neuroimaging
clincalpathology
Integrated
Analysis
molecular
High-resolution whole-slide microscopy
Multiplex IHC
CenterforComprehensiveInformatics
Morphological Tissue Classification
Nuclei Segmentation
Cellular Features
Lee Cooper,
Jun Kong
Whole Slide Imaging
CenterforComprehensiveInformatics
Direct Study of Relationship Between
vs
Lee Cooper,
Carlos Moreno
CenterforComprehensiveInformatics
HPC Segmentation and Feature Extraction Pipeline
Tony Pan and George Teodoro
CenterforComprehensiveInformatics
CenterforComprehensiveInformatics
Laser Captured Microdissection -- Spatio-temporal
“omic” studies
Macroscopic 3-D Tissue at Micron Resolution: OSU
BISTI NBIB Center Big Data (2005)
Associate genotype with
phenotype
Big science experiments on
cancer, heart disease,
pathogen host response
Tissue specimen -- 1 cm3
0.3 µ resolution – roughly 1013
bytes
Molecular data (spatial location)
can add additional significant
factor; e.g. 102
Multispectral imaging, laser
captured microdissection,
Imaging Mass Spec, Multiplex
QD
Multiple tissue specimens; another
factor of 103
Total: 1018 bytes – exabyte per big
science experiment
CenterforComprehensiveInformatics
CenterforComprehensiveInformatics
Complex Structure/Function Interactions – Very,
Very Active Materials
Image from: Gritsenko et al,
J Pathol 2012; 226: 185–199
CenterforComprehensiveInformatics
Core Transformations
•  Data Cleaning and Low Level Transformations
•  Data Subsetting, Filtering, Subsampling
•  Spatio-temporal Mapping and Registration
•  Object Segmentation
•  Feature Extraction
•  Object/Region/Feature Classification
•  Spatio-temporal Aggregation
•  Change Detection, Comparison, and Quantification
A Data Intense Challenge:
The Instrumented Oil Field of the Future
The Tyranny of Scale
(Tinsley Oden - U Texas)
process scale
field scale
km
cm
simulation scale
µm
pore scale
Why Applications Get Big
•  Physical world or simulation results
•  Detailed description of two, three (or more)
dimensional space
•  High resolution in each dimension, lots of
timesteps
•  e.g. oil reservoir code -- simulate 100 km by
100 km region to 1 km depth at resolution of
100 cm:
– 10^6*10^6*10^4 mesh points, 10^2 bytes per
mesh point, 10^6 timesteps --- 10^24 bytes
(Yottabyte) of data!!!
Detect and track changes in data during production
Invert data for reservoir properties
Detect and track reservoir changes
Assimilate data & reservoir properties into
the evolving reservoir model
Use simulation and optimization to guide future production
Oil Field Management – Joint ITR with Mary Wheeler,
Paul Stoffa
Coupled Ground Water and Surface Water Simulations
Multiple codes -- e.g. fluid code, contaminant
transport code
Different space and time scales
Data from a given fluid code run is used in different
contaminant transport code scenarios
Bioremediation Simulation
Microbe colonies
(magenta)
Dissolved NAPL (blue)
Mineral oxidation
products (green)
abiotic reactions
compete with
microbes,
reduce extent of
biodegradation
CenterforComprehensiveInformatics
Core Transformations (Again)
•  Data Cleaning and Low Level Transformations
•  Data Subsetting, Filtering, Subsampling
•  Spatio-temporal Mapping and Registration
•  Object Segmentation
•  Feature Extraction
•  Object/Region/Feature Classification
•  Spatio-temporal Aggregation
•  Change Detection, Comparison, and Quantification
CenterforComprehensiveInformatics
Sensor Integration and Analysis -- “Mini-app”
and Co-Design Vehicle
•  Tremendous commonality between applications that
compose and analyze information from multiple sensors/
cameras/scientific simulations
•  Recommendation – define and publicize “abstract
application class” that captures essential aspects
•  Spatio-temporal data is often accompanied by chemical
species, ‘omics” or appropriate domain specific chemical
information
•  Computationally similar applications are currently
independently developed by many research communities
•  Biology/Medicine is not a special case!!!
•  Mini-app is quite doable and would be tremendously
useful if accompanied by domain area buy-in
CenterforComprehensiveInformatics
Runtime Support
•  Hierarchical dataflow/task management
•  Programming model and runtime support need to
work together: specify/extract tasks, dependencies
•  Concurrent Collections (CnC), ParalleX Execution
Model, Region Templates
CenterforComprehensiveInformatics
Runtime Support Objectives
•  Coordinated mapping of data and computation to
complex memory hierarchies
•  Hierarchical work assignment with flexibility capable
of dealing with data dependent computational
patterns, fluctuations in computational speed
associated with power management, faults
•  Linked to comprehensible programming model –
model targeted at abstract application class but not
to application domain (In the sensor, image,
camera case -- Region Templates)
•  Software stack including coordinated compiler/
runtime support/autotuning frameworks
CenterforComprehensiveInformatics
Coarse Grain Workflows: Interoperability
•  Pipelines are complex and written in multiple languages
and designed to run on multiple environments
•  Key components needed to tackle problems may be
purpose built to run on particular environments, may be
proprietary
•  Patient privacy and HIPPA issues can constrain portions
of computations to institutional or highly secure, HIPPA
certified environments
•  Last mile bandwidth issues, performance/storage
availability where data needs to be staged. Large
number of tactical pitfalls which erode researcher
productivity
•  Data generation is cheap and often local, still expensive
to move multiple TB/PB data to supercomputer centers
CenterforComprehensiveInformatics
Exascale Hardware/Software Architecture
•  Need to stage very large datasets for relatively
short periods of time -- large aggregate bandwidth
to non volatile scratch storage -- distributed flash
and disk
•  Globally addressed/indexed persistent data
collections -- e.g. DataSpaces, Region Templates
(GIS analogy), persistent PGAS
•  Intelligent I/O with in-transit processing, data
reduction (e.g. ADIOS)
•  Visualizations need to be carried out interactively
and in situ as data is produced and as computations
proceed – efficient streaming data
CenterforComprehensiveInformatics
Data Representation and Query
Ø  Complex data models capturing multi-faceted
information including markups, annotations,
algorithm provenance etc.
Ø  Much data modeling can be re-used in sensor image
camera application class
Ø  Efficient implementations of data models
Ø  ADIOS, in transit processing – data, format
transformations, reductions, summarizations close
to data
Ø  Key-value stores capable of supporting efficient
application data access in deep, complex storage
hierarchies

Contenu connexe

Tendances

The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...inside-BigData.com
 
CloudLightning at a Glance Infographic
CloudLightning at a Glance InfographicCloudLightning at a Glance Infographic
CloudLightning at a Glance InfographicCloudLightning
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeTERN Australia
 
How TERN Data Infrastructure works
How TERN Data Infrastructure worksHow TERN Data Infrastructure works
How TERN Data Infrastructure worksTERN Australia
 
AURIN - Overview
AURIN - OverviewAURIN - Overview
AURIN - OverviewARDC
 
Australian Ecosystems Science Cloud
Australian Ecosystems Science CloudAustralian Ecosystems Science Cloud
Australian Ecosystems Science CloudTERN Australia
 

Tendances (7)

The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
The Transformation of HPC: Simulation and Cognitive Methods in the Era of Big...
 
CloudLightning at a Glance Infographic
CloudLightning at a Glance InfographicCloudLightning at a Glance Infographic
CloudLightning at a Glance Infographic
 
AusPlots field data collection with AusScribe
AusPlots field data collection with AusScribeAusPlots field data collection with AusScribe
AusPlots field data collection with AusScribe
 
How TERN Data Infrastructure works
How TERN Data Infrastructure worksHow TERN Data Infrastructure works
How TERN Data Infrastructure works
 
RPI Retinal Testbed
RPI Retinal TestbedRPI Retinal Testbed
RPI Retinal Testbed
 
AURIN - Overview
AURIN - OverviewAURIN - Overview
AURIN - Overview
 
Australian Ecosystems Science Cloud
Australian Ecosystems Science CloudAustralian Ecosystems Science Cloud
Australian Ecosystems Science Cloud
 

En vedette

Experimental Computer Science - Approaches and Instruments
Experimental Computer Science - Approaches and InstrumentsExperimental Computer Science - Approaches and Instruments
Experimental Computer Science - Approaches and InstrumentsFrederic Desprez
 
Before you graduate. Things to learn for every computer science student
Before you graduate. Things to learn for every computer science studentBefore you graduate. Things to learn for every computer science student
Before you graduate. Things to learn for every computer science studentAshish Gaikwad
 
Experimental Economics - Steve Tucker
Experimental Economics - Steve TuckerExperimental Economics - Steve Tucker
Experimental Economics - Steve TuckerCharlies1000
 
Scientific methods in computer science
Scientific methods in computer scienceScientific methods in computer science
Scientific methods in computer scienceParinda Rajapaksha
 
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Joel Saltz
 
Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Joel Saltz
 
Day 2 meet shilpa - measuring software quality-are you up-to-date on what an...
Day 2 meet shilpa  - measuring software quality-are you up-to-date on what an...Day 2 meet shilpa  - measuring software quality-are you up-to-date on what an...
Day 2 meet shilpa - measuring software quality-are you up-to-date on what an...XBOSoft
 
Extreme Spatio-Temporal Data Analysis
Extreme Spatio-Temporal Data AnalysisExtreme Spatio-Temporal Data Analysis
Extreme Spatio-Temporal Data AnalysisJoel Saltz
 
Integrative Multi-Scale Analyses
Integrative Multi-Scale AnalysesIntegrative Multi-Scale Analyses
Integrative Multi-Scale AnalysesJoel Saltz
 
Autodeterminació i aprenentatge
Autodeterminació i aprenentatge Autodeterminació i aprenentatge
Autodeterminació i aprenentatge ebau
 
Photography Research
Photography ResearchPhotography Research
Photography Research2593Thomas
 
Inquiry labs for science
Inquiry labs for scienceInquiry labs for science
Inquiry labs for scienceSorayalu
 
Zo Williams Marketing Deck Final (1)
Zo Williams Marketing Deck Final (1)Zo Williams Marketing Deck Final (1)
Zo Williams Marketing Deck Final (1)ZOWILLIAMS
 
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...March 18 webinar Meet Shilpa! She will tell if you are driving software quali...
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...XBOSoft
 
discapacitat i logopèdia
discapacitat i logopèdiadiscapacitat i logopèdia
discapacitat i logopèdiaebau
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...Joel Saltz
 
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...Joel Saltz
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-InformaticsJoel Saltz
 
Photography research2
Photography research2Photography research2
Photography research22593Thomas
 
Inquiry labs for science
Inquiry labs for scienceInquiry labs for science
Inquiry labs for scienceSorayalu
 

En vedette (20)

Experimental Computer Science - Approaches and Instruments
Experimental Computer Science - Approaches and InstrumentsExperimental Computer Science - Approaches and Instruments
Experimental Computer Science - Approaches and Instruments
 
Before you graduate. Things to learn for every computer science student
Before you graduate. Things to learn for every computer science studentBefore you graduate. Things to learn for every computer science student
Before you graduate. Things to learn for every computer science student
 
Experimental Economics - Steve Tucker
Experimental Economics - Steve TuckerExperimental Economics - Steve Tucker
Experimental Economics - Steve Tucker
 
Scientific methods in computer science
Scientific methods in computer scienceScientific methods in computer science
Scientific methods in computer science
 
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
Biomedical Informatics Program -- Atlanta CTSA (ACTSI)
 
Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing
 
Day 2 meet shilpa - measuring software quality-are you up-to-date on what an...
Day 2 meet shilpa  - measuring software quality-are you up-to-date on what an...Day 2 meet shilpa  - measuring software quality-are you up-to-date on what an...
Day 2 meet shilpa - measuring software quality-are you up-to-date on what an...
 
Extreme Spatio-Temporal Data Analysis
Extreme Spatio-Temporal Data AnalysisExtreme Spatio-Temporal Data Analysis
Extreme Spatio-Temporal Data Analysis
 
Integrative Multi-Scale Analyses
Integrative Multi-Scale AnalysesIntegrative Multi-Scale Analyses
Integrative Multi-Scale Analyses
 
Autodeterminació i aprenentatge
Autodeterminació i aprenentatge Autodeterminació i aprenentatge
Autodeterminació i aprenentatge
 
Photography Research
Photography ResearchPhotography Research
Photography Research
 
Inquiry labs for science
Inquiry labs for scienceInquiry labs for science
Inquiry labs for science
 
Zo Williams Marketing Deck Final (1)
Zo Williams Marketing Deck Final (1)Zo Williams Marketing Deck Final (1)
Zo Williams Marketing Deck Final (1)
 
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...March 18 webinar Meet Shilpa! She will tell if you are driving software quali...
March 18 webinar Meet Shilpa! She will tell if you are driving software quali...
 
discapacitat i logopèdia
discapacitat i logopèdiadiscapacitat i logopèdia
discapacitat i logopèdia
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
 
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
MICCAI - Workshop on High Performance and Distributed Computing for Medical I...
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
 
Photography research2
Photography research2Photography research2
Photography research2
 
Inquiry labs for science
Inquiry labs for scienceInquiry labs for science
Inquiry labs for science
 

Similaire à Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars

High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor NetworksOscar Corcho
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchYehia El-khatib
 
Data analytics for engineers- introduction
Data analytics for engineers-  introductionData analytics for engineers-  introduction
Data analytics for engineers- introductionRINUSATHYAN
 
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...israel edem
 
Starfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticsStarfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticssai Pramoda
 
ACES QuakeSim 2011
ACES QuakeSim 2011ACES QuakeSim 2011
ACES QuakeSim 2011marpierc
 
temporal and spatial database.pptx
temporal and spatial database.pptxtemporal and spatial database.pptx
temporal and spatial database.pptx64837JAYAASRIK
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeCarole Goble
 
Big Data Analytics for connected home
Big Data Analytics for connected homeBig Data Analytics for connected home
Big Data Analytics for connected homeHéloïse Nonne
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingJames Arnold Faeldon
 
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSBitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSPhilip Filleul
 
Graham Pryor
Graham PryorGraham Pryor
Graham PryorEduserv
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer OverlordsIan Foster
 
Uniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITUniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITgssg
 
Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool developmentAnubhav Jain
 
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...lleonardSlideShare
 

Similaire à Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars (20)

High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Semantics in Sensor Networks
Semantics in Sensor NetworksSemantics in Sensor Networks
Semantics in Sensor Networks
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Adoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific ResearchAdoption of Cloud Computing in Scientific Research
Adoption of Cloud Computing in Scientific Research
 
Data analytics for engineers- introduction
Data analytics for engineers-  introductionData analytics for engineers-  introduction
Data analytics for engineers- introduction
 
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
The Paradigm of Fog Computing with Bio-inspired Search Methods and the “5Vs” ...
 
Starfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analyticsStarfish-A self tuning system for bigdata analytics
Starfish-A self tuning system for bigdata analytics
 
ACES QuakeSim 2011
ACES QuakeSim 2011ACES QuakeSim 2011
ACES QuakeSim 2011
 
Big Data
Big Data Big Data
Big Data
 
temporal and spatial database.pptx
temporal and spatial database.pptxtemporal and spatial database.pptx
temporal and spatial database.pptx
 
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, RomeWorkflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
Workflows, provenance and reporting: a lifecycle perspective at BIH 2013, Rome
 
Big Data Analytics for connected home
Big Data Analytics for connected homeBig Data Analytics for connected home
Big Data Analytics for connected home
 
Data Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather ForecastingData Centric HPC for Numerical Weather Forecasting
Data Centric HPC for Numerical Weather Forecasting
 
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FSBitkom Cray presentation - on HPC affecting big data analytics in FS
Bitkom Cray presentation - on HPC affecting big data analytics in FS
 
Graham Pryor
Graham PryorGraham Pryor
Graham Pryor
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
Uniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream ITUniting traditional GIS and mainstream IT
Uniting traditional GIS and mainstream IT
 
Overview of DuraMat software tool development
Overview of DuraMat software tool developmentOverview of DuraMat software tool development
Overview of DuraMat software tool development
 
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...
Data‐intensive hydrologic modeling: A Cloud strategy for integrating PIHM, GI...
 
SciDB
SciDBSciDB
SciDB
 

Plus de Joel Saltz

AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersJoel Saltz
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillanceJoel Saltz
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingJoel Saltz
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataJoel Saltz
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Joel Saltz
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerJoel Saltz
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineJoel Saltz
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicineJoel Saltz
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataJoel Saltz
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Joel Saltz
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeJoel Saltz
 
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Joel Saltz
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Joel Saltz
 
Data Science, Big Data and You
Data Science, Big Data and YouData Science, Big Data and You
Data Science, Big Data and YouJoel Saltz
 
Data and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsData and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsJoel Saltz
 
Role of Biomedical Informatics in Translational Cancer Research
Role of Biomedical Informatics in Translational Cancer ResearchRole of Biomedical Informatics in Translational Cancer Research
Role of Biomedical Informatics in Translational Cancer ResearchJoel Saltz
 
Presentation at UHC Annual Meeting
Presentation at UHC  Annual MeetingPresentation at UHC  Annual Meeting
Presentation at UHC Annual MeetingJoel Saltz
 
Indiana 4 2011 Final Final
Indiana 4 2011 Final FinalIndiana 4 2011 Final Final
Indiana 4 2011 Final FinalJoel Saltz
 

Plus de Joel Saltz (20)

AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale Computing
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming Data
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision Medicine
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology Phenotype
 
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
 
Data Science, Big Data and You
Data Science, Big Data and YouData Science, Big Data and You
Data Science, Big Data and You
 
Data and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsData and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical Informatics
 
Role of Biomedical Informatics in Translational Cancer Research
Role of Biomedical Informatics in Translational Cancer ResearchRole of Biomedical Informatics in Translational Cancer Research
Role of Biomedical Informatics in Translational Cancer Research
 
Presentation at UHC Annual Meeting
Presentation at UHC  Annual MeetingPresentation at UHC  Annual Meeting
Presentation at UHC Annual Meeting
 
Indiana 4 2011 Final Final
Indiana 4 2011 Final FinalIndiana 4 2011 Final Final
Indiana 4 2011 Final Final
 

Dernier

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Dernier (20)

Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars

  • 1. Joel Saltz MD, PhD Emory University February 2013 Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars    
  • 2. CenterforComprehensiveInformatics Integrate Information from Sensors, Images, Cameras •  Multi-dimensional spatial-temporal datasets –  Radiology and Microscopy Image Analyses –  Oil Reservoir Simulation/Carbon Sequestration/ Groundwater Pollution Remediation –  Biomass monitoring and disaster surveillance using multiple types of satellite imagery –  Weather prediction using satellite and ground sensor data –  Analysis of Results from Large Scale Simulations –  Square Kilometer Array –  Google Self Driving Car •  Correlative and cooperative analysis of data from multiple sensor modalities and sources •  Equivalent from standpoint of data access patterns – we propose a integrative sensor data mini-App
  • 5. Emory In Silico Center for Brain Tumor Research (PI = Dan Brat, PD= Joel Saltz)
  • 6. CenterforComprehensiveInformatics Integrative Cancer Research with Digital Pathology histology neuroimaging clincalpathology Integrated Analysis molecular High-resolution whole-slide microscopy Multiplex IHC
  • 7. CenterforComprehensiveInformatics Morphological Tissue Classification Nuclei Segmentation Cellular Features Lee Cooper, Jun Kong Whole Slide Imaging
  • 8. CenterforComprehensiveInformatics Direct Study of Relationship Between vs Lee Cooper, Carlos Moreno
  • 9. CenterforComprehensiveInformatics HPC Segmentation and Feature Extraction Pipeline Tony Pan and George Teodoro
  • 12. Macroscopic 3-D Tissue at Micron Resolution: OSU BISTI NBIB Center Big Data (2005) Associate genotype with phenotype Big science experiments on cancer, heart disease, pathogen host response Tissue specimen -- 1 cm3 0.3 µ resolution – roughly 1013 bytes Molecular data (spatial location) can add additional significant factor; e.g. 102 Multispectral imaging, laser captured microdissection, Imaging Mass Spec, Multiplex QD Multiple tissue specimens; another factor of 103 Total: 1018 bytes – exabyte per big science experiment
  • 14. CenterforComprehensiveInformatics Complex Structure/Function Interactions – Very, Very Active Materials Image from: Gritsenko et al, J Pathol 2012; 226: 185–199
  • 15. CenterforComprehensiveInformatics Core Transformations •  Data Cleaning and Low Level Transformations •  Data Subsetting, Filtering, Subsampling •  Spatio-temporal Mapping and Registration •  Object Segmentation •  Feature Extraction •  Object/Region/Feature Classification •  Spatio-temporal Aggregation •  Change Detection, Comparison, and Quantification
  • 16. A Data Intense Challenge: The Instrumented Oil Field of the Future
  • 17.
  • 18. The Tyranny of Scale (Tinsley Oden - U Texas) process scale field scale km cm simulation scale µm pore scale
  • 19. Why Applications Get Big •  Physical world or simulation results •  Detailed description of two, three (or more) dimensional space •  High resolution in each dimension, lots of timesteps •  e.g. oil reservoir code -- simulate 100 km by 100 km region to 1 km depth at resolution of 100 cm: – 10^6*10^6*10^4 mesh points, 10^2 bytes per mesh point, 10^6 timesteps --- 10^24 bytes (Yottabyte) of data!!!
  • 20. Detect and track changes in data during production Invert data for reservoir properties Detect and track reservoir changes Assimilate data & reservoir properties into the evolving reservoir model Use simulation and optimization to guide future production Oil Field Management – Joint ITR with Mary Wheeler, Paul Stoffa
  • 21. Coupled Ground Water and Surface Water Simulations Multiple codes -- e.g. fluid code, contaminant transport code Different space and time scales Data from a given fluid code run is used in different contaminant transport code scenarios
  • 22. Bioremediation Simulation Microbe colonies (magenta) Dissolved NAPL (blue) Mineral oxidation products (green) abiotic reactions compete with microbes, reduce extent of biodegradation
  • 23. CenterforComprehensiveInformatics Core Transformations (Again) •  Data Cleaning and Low Level Transformations •  Data Subsetting, Filtering, Subsampling •  Spatio-temporal Mapping and Registration •  Object Segmentation •  Feature Extraction •  Object/Region/Feature Classification •  Spatio-temporal Aggregation •  Change Detection, Comparison, and Quantification
  • 24. CenterforComprehensiveInformatics Sensor Integration and Analysis -- “Mini-app” and Co-Design Vehicle •  Tremendous commonality between applications that compose and analyze information from multiple sensors/ cameras/scientific simulations •  Recommendation – define and publicize “abstract application class” that captures essential aspects •  Spatio-temporal data is often accompanied by chemical species, ‘omics” or appropriate domain specific chemical information •  Computationally similar applications are currently independently developed by many research communities •  Biology/Medicine is not a special case!!! •  Mini-app is quite doable and would be tremendously useful if accompanied by domain area buy-in
  • 25. CenterforComprehensiveInformatics Runtime Support •  Hierarchical dataflow/task management •  Programming model and runtime support need to work together: specify/extract tasks, dependencies •  Concurrent Collections (CnC), ParalleX Execution Model, Region Templates
  • 26. CenterforComprehensiveInformatics Runtime Support Objectives •  Coordinated mapping of data and computation to complex memory hierarchies •  Hierarchical work assignment with flexibility capable of dealing with data dependent computational patterns, fluctuations in computational speed associated with power management, faults •  Linked to comprehensible programming model – model targeted at abstract application class but not to application domain (In the sensor, image, camera case -- Region Templates) •  Software stack including coordinated compiler/ runtime support/autotuning frameworks
  • 27. CenterforComprehensiveInformatics Coarse Grain Workflows: Interoperability •  Pipelines are complex and written in multiple languages and designed to run on multiple environments •  Key components needed to tackle problems may be purpose built to run on particular environments, may be proprietary •  Patient privacy and HIPPA issues can constrain portions of computations to institutional or highly secure, HIPPA certified environments •  Last mile bandwidth issues, performance/storage availability where data needs to be staged. Large number of tactical pitfalls which erode researcher productivity •  Data generation is cheap and often local, still expensive to move multiple TB/PB data to supercomputer centers
  • 28. CenterforComprehensiveInformatics Exascale Hardware/Software Architecture •  Need to stage very large datasets for relatively short periods of time -- large aggregate bandwidth to non volatile scratch storage -- distributed flash and disk •  Globally addressed/indexed persistent data collections -- e.g. DataSpaces, Region Templates (GIS analogy), persistent PGAS •  Intelligent I/O with in-transit processing, data reduction (e.g. ADIOS) •  Visualizations need to be carried out interactively and in situ as data is produced and as computations proceed – efficient streaming data
  • 29. CenterforComprehensiveInformatics Data Representation and Query Ø  Complex data models capturing multi-faceted information including markups, annotations, algorithm provenance etc. Ø  Much data modeling can be re-used in sensor image camera application class Ø  Efficient implementations of data models Ø  ADIOS, in transit processing – data, format transformations, reductions, summarizations close to data Ø  Key-value stores capable of supporting efficient application data access in deep, complex storage hierarchies