SlideShare a Scribd company logo
1 of 55
Integrative Multi-Scale Biomedical Informatics Joel Saltz MD, PhD Director Center for Comprehensive Informatics
Leverage exascale data and computer resources to squeeze the most out of image, sensor or simulation data Run lots of different algorithms to derive same features Run lots of algorithms to derive complementary features Data models and data management infrastructure to manage data products, feature sets and results from classification and machine learning algorithms Squeezing Information from Spatial Datasets
Outline Integrative biomedical informatics analysis –feature sets obtained from Pathology and Radiology studies Techniques, tools and methodologies for derivation, management and analysis of feature sets
INTEGrative Biomedical Informatics Analysis Reproducible anatomic/functional characterization at gross level (Radiology) and fine level (Pathology) Integration of anatomic/functional characterization with multiple types of “omic” information Create categories of jointly classified data to describe pathophysiology, predict prognosis, response to treatment
In Silico Center for Brain Tumor Research Specific Aims: Influence of necrosis/ hypoxia on gene expression and genetic classification. 2.     Molecular correlates of high resolution nuclear morphometry. Gene expression profiles  that predict glioma progression. 4. 	Molecular correlates of MRI enhancement patterns.
Integration of heterogeneous multiscale information Radiology Imaging Patient  Outcome “Omic” Data       ,[object Object]
Exploit synergies between all initiatives to improve ability to forecast survival & response.Pathologic Features
Example: Pathology and Gene Expression Joint Predictors of Recurrence/Survival Lee Cooper   Carlos Moreno
Feature Characterization in Pathology and Radiology Role – In silico Brain Tumor Research Algorithms Scaling Requirements
In Silico Center for Brain Tumor Research Key Data Sets REMBRANDT: Gene expression and genomics data set of all glioma subtypes The Cancer Genome Atlas (TCGA): Rich “omics” set of GBM, digitized Pathology and Radiology Pathology and Radiology Images from Henry Ford Hospital, Emory, Thomas Jefferson U, MD Anderson and others
TCGA Research Network Digital Pathology Neuroimaging
Progression to GBM Anaplastic Astrocytoma (WHO grade III) Glioblastoma (WHO grade IV)
TCGA Neuropathology Attributes 120 TCGA specimens; 3 Reviewers  Presence and Degree of: Microvascular hyperplasia ,[object Object]
Endothelial hyperplasiaNecrosis  ,[object Object]
Zonal necrosisInflammation ,[object Object]
Lymphocytes
Neutrophils  Differentiation: ,[object Object]
Gemistocytes
Oligodendroglial
Multi-nucleated/giant cells
Epithelial metaplasia          
Mesenchymal metaplasiaOther Features ,[object Object]
Entrapped gray or white matter
Micro-mineralization  ,[object Object]
TCGA Whole Slide Images Feature Extraction Jun Kong
AstrocytomavsOligodendroglimaOverlap in genetics, gene expression, histology AstrocytomavsOligodendroglima ,[object Object],[object Object]
Clustergram of selected features used in consensus clustering Nuclear Features Used to Classify GBMs
Nuclear Features Used to Classify GBMs Consensus clustering of morphological signatures Study includes 200 million nuclei taken from 480 slides corresponding to 167 distinct patients.
Survival of morphological clusters
Survival of patients by molecular tumor subtype
Articulate Physical Interpretations of Results
Images
Multiscale Systems Biology Employ multi-resolution methods to characterize necrosis, angiogenesis and correlate these with “omics” Rim-enhancement Vascular Changes Rapid progression No enhancement Normal Vessels Stable lesion ?
Correlation of Necrosis, Angiogenesis and “omics” GBMs display variable and regionally heterogeneous degrees of necrosis (asterisk) and angiogenesis  These factors may impact gene expression profiles
Genes Correlated with Necrosis include Transcription Factors Identified as Regulators of the Mesenchymal Transition ,[object Object]
Extent of both necrosis and angiogenesis calculated as a percentage of total tissue areaCarro MS, et al. Nature 263: 318-25, 2010
Feature Sets in Radiology(Adam Flanders, TJU;  Dan Rubin, Stanford, Lori Dodd, NCI) Require standardized validated feature sets to describe de novo disease. Fundamental obstacle to new imaging criteria as treatment biomarkers is lack of standard terminology: To define a comprehensive set of imaging features of cancer For reporting imaging results To provide a more quantitative, reproducible basis for assessing baseline disease and treatment response
Defining Rich Set of Qualitative and Quantitative Image Biomarkers Community-driven ontology development project; collaboration with ASNR Imaging features (5 categories) Locationof lesion Morphology of lesion margin(definition, thickness, enhancement, diffusion) Morphology of lesion substance (enhancement, PS characteristics, focality/multicentricity, necrosis, cysts, midline invasion, cortical involvement, T1/FLAIR ratio) Alterations in vicinity of lesion (edema, edema crossing midline, hemorrhage, pial invasion, ependymal invasion, satellites, deep WM invasion, calvarial remodeling) Resection features (extent of nCE tissue, CE tissue, resected components)
Imaging Predictors of survival and molecular profiles in the TCGA Glioblastoma Data set The TCGA glioma working group 1Emory University Hospital, Atlanta, GA 2National Cancer Institute, Bethesda, MD.  3Thomas Jefferson University Hospital, Philadelphia, PA. 4Henry Ford University Hospital, Detroit, Michigan. 5National Institute of Health, Bethesda, MD.  6Boston University School of Medicine, Boston, MA. 7SAIC-Frederick, Inc., Frederick, MD. 8University of Virginia, Charlottesville, VA. 9Northwestern University Chicago, IL
Assumed Dependence Between Features F1: Tumor Location F3: Eloq. Brain F5: Prop. Enhance F6: Prop. nCET F7: Prop. Necrosis F9: Distribution F10: T1/FLAIR F11: En. Marg.Thick. F13: Def. Non.Marg. F14: Prop. Edema F16: Hemorrhage F18: Pial Invasion F19: Ependymal F20: Cort. Involve. F21: Deep WM Inv. F22: nCET Cross. Mid. F24: Satellites F18 F20 F9 F3 F16 F7 F22 F21 F1 F6 F11 F5 F14 F19 F10 F13 F24 NOTE: each feature omitted from this graph is independent of every other feature. Slide thanks to Eric Huang, NIH
Estimation Problem Size Reduction Can ignore seven of the features  size of contingency table reduced from 2.64 × 1012 cells to 1.34 × 1010 cells. Collapsibility reduces size of contingency table even further:  Any binary feature Fj connected to only one feature on graph (i.e. given the feature Fj is connected to, Fj is independent of all other features) can also be ignored Eliminates need to deal with Hemorrhage (F16), Pial Invasion (F18), Cortical Involvement (F20), and Satellites (F24). Reduces size of contingency table to 1.68 × 109 cells. Additional analogous considerations can be used to reduce size of contingency table by more than two additional orders of magnitude Slide thanks to Eric Huang, NIH
Correlative Imaging Results Minimal enhancing tumor (≤5%) strongly associated with Proneural classification (p=0.0006).  >5% proportion of necrosis and the presence of microvascular hyperplasia in pathology slides (p=0.008). Greater maximum tumor dimension (T2 signal) associated with present/abundant microvascular hyperplasia (p=0.001). < 5% Enhancement
Correlative  Imaging Results TP53 mutant tumors had a smaller mean tumor sizes (p=0.002) on T2-weighted or FLAIR images.  EGFR mutant tumors were significantly larger than TP53 mutant tumors (p=0.0005).  High level EGFR amplification was associated with >5% enhancement and >5% proportion of necrosis (p < 0.01).  > 5% Necrosis
Squeezing Information from Spatial Datasets Leverage exascale data and computer resources to squeeze the most out of image, sensor or simulation data Run lots of different algorithms to derive same features Run lots of algorithms to derive complementary features Data models and data management infrastructure to manage data products, feature sets and results from classification and machine learning algorithms
Pipeline for Whole Slide Feature Characterization 1010 pixels for each whole slide image 10 whole slide images per patient 108  image features per whole slide image 10,000 brain tumor patients 1015  pixels 1013 features Hundreds of algorithms Annotations and markups from dozens of humans
PAIS Database ,[object Object]
Represented by a complex data model capturing multi-faceted information including markups, annotations, algorithm provenance, specimen, etc.
Support for complex relationships and spatial query: multi-level granularities, relationships between markups and annotations, spatial and nested relationshipsPAIS Database ,[object Object]
Represented by a complex data model capturing multi-faceted information including markups, annotations, algorithm provenance, specimen, etc.
Support for complex relationships and spatial query: multi-level granularities, relationships between markups and annotations, spatial and nested relationships
Support for high-level data statistical analysis,[object Object]
Pathology Imaging GIS Image analysis PAIS model PAIS data management  Modeling and management of markup and annotation for querying and sharing through parallel RDBMS + spatial DBMS Segmentation HDFS data staging MapReduce based queries On the fly data processing for algorithm validation/algorithm sensitivity studies, or discovery of preliminary results Feature extraction
Generation and Analysis of Imaging Features In-transit data processing using filter/stream systems Semantic  Workflows Hierarchical pipeline design with coarse and fine grained components Adaptivity and Quality of Service
Same basic story in multiple domains
Classification using DataCutter Filter Stream Workflow

More Related Content

What's hot

MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...
MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...
MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...European School of Oncology
 
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathology
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathologyManuel Salto-Tellez on Personalised medicine and the future of tissue pathology
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathologyCirdan
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathologynehaSingh1543
 
Dr. Martin H. Hager IMPaCT Conference Orlando
Dr. Martin H. Hager IMPaCT Conference Orlando Dr. Martin H. Hager IMPaCT Conference Orlando
Dr. Martin H. Hager IMPaCT Conference Orlando Dr. Martin Hager, MBA
 
Application of immunotherapy therapeutics in prostate adenocarcinoma slides
Application of immunotherapy therapeutics in prostate adenocarcinoma slidesApplication of immunotherapy therapeutics in prostate adenocarcinoma slides
Application of immunotherapy therapeutics in prostate adenocarcinoma slidesJamie Cooper
 
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...Yoon Sup Choi
 
Dr. Hager Scholarship at Harvard (2012)
Dr. Hager Scholarship at Harvard (2012)Dr. Hager Scholarship at Harvard (2012)
Dr. Hager Scholarship at Harvard (2012)Dr. Martin Hager, MBA
 
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...Cirdan
 
Circulating tumor cells in crc
Circulating tumor cells in crcCirculating tumor cells in crc
Circulating tumor cells in crcNilesh Kucha
 
Future direction in the management of high risk LOW GRADE GLIOMA
Future direction in the management of high risk LOW GRADE GLIOMAFuture direction in the management of high risk LOW GRADE GLIOMA
Future direction in the management of high risk LOW GRADE GLIOMAapollo seminar group
 
High grade gliomas 8 august 2016
High grade gliomas 8 august 2016High grade gliomas 8 august 2016
High grade gliomas 8 august 2016Gaurav Kumar
 
molecular imaging with PET & SPECT
molecular imaging with PET & SPECTmolecular imaging with PET & SPECT
molecular imaging with PET & SPECTShatha M
 
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...Clinical Surgery Research Communications
 
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...CFTCC
 
Molecular Imaging
Molecular ImagingMolecular Imaging
Molecular ImagingChaz874
 

What's hot (20)

MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...
MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...
MCO 2011 - Slide 33 - C. Svedman - Spotlight session - Criteria to evaluate g...
 
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathology
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathologyManuel Salto-Tellez on Personalised medicine and the future of tissue pathology
Manuel Salto-Tellez on Personalised medicine and the future of tissue pathology
 
2014 Northwest Melanoma Symposium Slide Deck
2014 Northwest Melanoma Symposium Slide Deck2014 Northwest Melanoma Symposium Slide Deck
2014 Northwest Melanoma Symposium Slide Deck
 
Artificial Intelligence in pathology
Artificial Intelligence in pathologyArtificial Intelligence in pathology
Artificial Intelligence in pathology
 
Dr. Martin H. Hager IMPaCT Conference Orlando
Dr. Martin H. Hager IMPaCT Conference Orlando Dr. Martin H. Hager IMPaCT Conference Orlando
Dr. Martin H. Hager IMPaCT Conference Orlando
 
Application of immunotherapy therapeutics in prostate adenocarcinoma slides
Application of immunotherapy therapeutics in prostate adenocarcinoma slidesApplication of immunotherapy therapeutics in prostate adenocarcinoma slides
Application of immunotherapy therapeutics in prostate adenocarcinoma slides
 
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
Clinical Genomics for Personalized Cancer Medicine: Recent Advances, Challeng...
 
HNSS Poster Draft v6
HNSS Poster Draft v6HNSS Poster Draft v6
HNSS Poster Draft v6
 
Beyond Eye Treatment
Beyond Eye TreatmentBeyond Eye Treatment
Beyond Eye Treatment
 
Dr. Hager Scholarship at Harvard (2012)
Dr. Hager Scholarship at Harvard (2012)Dr. Hager Scholarship at Harvard (2012)
Dr. Hager Scholarship at Harvard (2012)
 
Molecular imaging – a new field for a new world By Zaver M. Bhujwalla
Molecular imaging – a new field for a new world By Zaver M. BhujwallaMolecular imaging – a new field for a new world By Zaver M. Bhujwalla
Molecular imaging – a new field for a new world By Zaver M. Bhujwalla
 
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...
Peter Hamilton on Next generation Imaging and Computer Vision in Pathology: p...
 
Circulating tumor cells in crc
Circulating tumor cells in crcCirculating tumor cells in crc
Circulating tumor cells in crc
 
Future direction in the management of high risk LOW GRADE GLIOMA
Future direction in the management of high risk LOW GRADE GLIOMAFuture direction in the management of high risk LOW GRADE GLIOMA
Future direction in the management of high risk LOW GRADE GLIOMA
 
High grade gliomas 8 august 2016
High grade gliomas 8 august 2016High grade gliomas 8 august 2016
High grade gliomas 8 august 2016
 
I1803056267
I1803056267I1803056267
I1803056267
 
molecular imaging with PET & SPECT
molecular imaging with PET & SPECTmolecular imaging with PET & SPECT
molecular imaging with PET & SPECT
 
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...
Solitary cerebral metastasis from undiagnosed prostate cancer. potential role...
 
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...
2014 CFTCC Annual Symposium: Circulating Tumor Cells: Linking Analysis to Sur...
 
Molecular Imaging
Molecular ImagingMolecular Imaging
Molecular Imaging
 

Similar to Indiana 4 2011 Final Final

Mining Gene Expression Data Focusing Cancer Therapeutics: A Digest
Mining Gene Expression Data Focusing Cancer Therapeutics: A DigestMining Gene Expression Data Focusing Cancer Therapeutics: A Digest
Mining Gene Expression Data Focusing Cancer Therapeutics: A DigestKaashivInfoTech Company
 
Gene expression profiling reveals molecularly and clinically distinct subtype...
Gene expression profiling reveals molecularly and clinically distinct subtype...Gene expression profiling reveals molecularly and clinically distinct subtype...
Gene expression profiling reveals molecularly and clinically distinct subtype...Yu Liang
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineJoel Saltz
 
Evolution of treatment strategies of brain tumors
Evolution of treatment strategies of brain tumorsEvolution of treatment strategies of brain tumors
Evolution of treatment strategies of brain tumorsAnil Gupta
 
Amelia glioblastoma
Amelia glioblastomaAmelia glioblastoma
Amelia glioblastomaAmelia Wan
 
Genomic oncology and personalized medicine
Genomic oncology and personalized medicine Genomic oncology and personalized medicine
Genomic oncology and personalized medicine C. Jeff Chang, MD, PhD
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Ronak Shah
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataJoel Saltz
 
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...cscpconf
 
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...csandit
 
Geveart Lab SIMR Paper
Geveart Lab SIMR PaperGeveart Lab SIMR Paper
Geveart Lab SIMR PaperNathan Dalal
 
SNOMED CT concept model for molecular pathology_final.pptx
SNOMED CT concept model for molecular pathology_final.pptxSNOMED CT concept model for molecular pathology_final.pptx
SNOMED CT concept model for molecular pathology_final.pptxHariHaran685388
 
Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Ronak Shah
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara CarvalhoSara Carvalho
 
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptx
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptxCSF-Derived cell-free DNA for Diagnosis and Characterization of.pptx
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptxAmit Ghosh
 
508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...SHAPE Society
 

Similar to Indiana 4 2011 Final Final (20)

Mining Gene Expression Data Focusing Cancer Therapeutics: A Digest
Mining Gene Expression Data Focusing Cancer Therapeutics: A DigestMining Gene Expression Data Focusing Cancer Therapeutics: A Digest
Mining Gene Expression Data Focusing Cancer Therapeutics: A Digest
 
Gene expression profiling reveals molecularly and clinically distinct subtype...
Gene expression profiling reveals molecularly and clinically distinct subtype...Gene expression profiling reveals molecularly and clinically distinct subtype...
Gene expression profiling reveals molecularly and clinically distinct subtype...
 
ACC Cancer Cell May 2016
ACC Cancer Cell May 2016ACC Cancer Cell May 2016
ACC Cancer Cell May 2016
 
Digital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision MedicineDigital Pathology, FDA Approval and Precision Medicine
Digital Pathology, FDA Approval and Precision Medicine
 
Evolution of treatment strategies of brain tumors
Evolution of treatment strategies of brain tumorsEvolution of treatment strategies of brain tumors
Evolution of treatment strategies of brain tumors
 
Amelia glioblastoma
Amelia glioblastomaAmelia glioblastoma
Amelia glioblastoma
 
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
 
Genomic oncology and personalized medicine
Genomic oncology and personalized medicine Genomic oncology and personalized medicine
Genomic oncology and personalized medicine
 
Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...Developing a framework for for detection of low frequency somatic genetic alt...
Developing a framework for for detection of low frequency somatic genetic alt...
 
Integrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming DataIntegrative Everything, Deep Learning and Streaming Data
Integrative Everything, Deep Learning and Streaming Data
 
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
 
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
MEDICAL IMAGING MUTIFRACTAL ANALYSIS IN PREDICTION OF EFFICIENCY OF CANCER TH...
 
Geveart Lab SIMR Paper
Geveart Lab SIMR PaperGeveart Lab SIMR Paper
Geveart Lab SIMR Paper
 
SNOMED CT concept model for molecular pathology_final.pptx
SNOMED CT concept model for molecular pathology_final.pptxSNOMED CT concept model for molecular pathology_final.pptx
SNOMED CT concept model for molecular pathology_final.pptx
 
Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...Detecting clinically actionable somatic structural aberrations from targeted ...
Detecting clinically actionable somatic structural aberrations from targeted ...
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara Carvalho
 
UNMSymposium2014
UNMSymposium2014UNMSymposium2014
UNMSymposium2014
 
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptx
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptxCSF-Derived cell-free DNA for Diagnosis and Characterization of.pptx
CSF-Derived cell-free DNA for Diagnosis and Characterization of.pptx
 
508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...
 
508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...508 search for genomic and proteomic risk factors and protective factors asso...
508 search for genomic and proteomic risk factors and protective factors asso...
 

More from Joel Saltz

AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersJoel Saltz
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillanceJoel Saltz
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingJoel Saltz
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Joel Saltz
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerJoel Saltz
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeJoel Saltz
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicineJoel Saltz
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataJoel Saltz
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Joel Saltz
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...Joel Saltz
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeJoel Saltz
 
Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Joel Saltz
 
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Joel Saltz
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Joel Saltz
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataJoel Saltz
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-InformaticsJoel Saltz
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Joel Saltz
 
Data Science, Big Data and You
Data Science, Big Data and YouData Science, Big Data and You
Data Science, Big Data and YouJoel Saltz
 
Data and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsData and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsJoel Saltz
 

More from Joel Saltz (20)

AI and whole slide imaging biomarkers
AI and whole slide imaging biomarkersAI and whole slide imaging biomarkers
AI and whole slide imaging biomarkers
 
Pathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer SurveillancePathomics, Clinical Studies, and Cancer Surveillance
Pathomics, Clinical Studies, and Cancer Surveillance
 
Learning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale ComputingLearning, Training,  Classification,  Common Sense and Exascale Computing
Learning, Training,  Classification,  Common Sense and Exascale Computing
 
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
Digital Pathology: Precision Medicine, Deep Learning and Computer Aided Inter...
 
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure CancerExtreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
Extreme Computing, Clinical Medicine and GPUs or Can GPUs Cure Cancer
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
Twenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase ChangeTwenty Years of Whole Slide Imaging - the Coming Phase Change
Twenty Years of Whole Slide Imaging - the Coming Phase Change
 
Pathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision MedicinePathomics Based Biomarkers and Precision Medicine
Pathomics Based Biomarkers and Precision Medicine
 
Machine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of DataMachine Learning and Deep Contemplation of Data
Machine Learning and Deep Contemplation of Data
 
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
Integrative Multi-Scale Analysis in Biomedical Data Science: Tools, Methods a...
 
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...Tools to Analyze Morphology and Spatially Mapped Molecular Data -  Informatio...
Tools to Analyze Morphology and Spatially Mapped Molecular Data - Informatio...
 
Generation and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology PhenotypeGeneration and Use of Quantitative Pathology Phenotype
Generation and Use of Quantitative Pathology Phenotype
 
Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing Big Data and Extreme Scale Computing
Big Data and Extreme Scale Computing
 
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
Spatio-­‐temporal Sensor Integration, Analysis, Classification or Can Exascal...
 
Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014Computational Pathology Workshop July 8 2014
Computational Pathology Workshop July 8 2014
 
Exascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor DataExascale Computing and Experimental Sensor Data
Exascale Computing and Experimental Sensor Data
 
High Dimensional Fused-Informatics
High Dimensional Fused-InformaticsHigh Dimensional Fused-Informatics
High Dimensional Fused-Informatics
 
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
Exascale Challenges: Space, Time, Experimental Science and Self Driving Cars
 
Data Science, Big Data and You
Data Science, Big Data and YouData Science, Big Data and You
Data Science, Big Data and You
 
Data and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical InformaticsData and Computational Challenges in Integrative Biomedical Informatics
Data and Computational Challenges in Integrative Biomedical Informatics
 

Indiana 4 2011 Final Final

  • 1. Integrative Multi-Scale Biomedical Informatics Joel Saltz MD, PhD Director Center for Comprehensive Informatics
  • 2. Leverage exascale data and computer resources to squeeze the most out of image, sensor or simulation data Run lots of different algorithms to derive same features Run lots of algorithms to derive complementary features Data models and data management infrastructure to manage data products, feature sets and results from classification and machine learning algorithms Squeezing Information from Spatial Datasets
  • 3. Outline Integrative biomedical informatics analysis –feature sets obtained from Pathology and Radiology studies Techniques, tools and methodologies for derivation, management and analysis of feature sets
  • 4. INTEGrative Biomedical Informatics Analysis Reproducible anatomic/functional characterization at gross level (Radiology) and fine level (Pathology) Integration of anatomic/functional characterization with multiple types of “omic” information Create categories of jointly classified data to describe pathophysiology, predict prognosis, response to treatment
  • 5. In Silico Center for Brain Tumor Research Specific Aims: Influence of necrosis/ hypoxia on gene expression and genetic classification. 2. Molecular correlates of high resolution nuclear morphometry. Gene expression profiles that predict glioma progression. 4. Molecular correlates of MRI enhancement patterns.
  • 6.
  • 7. Exploit synergies between all initiatives to improve ability to forecast survival & response.Pathologic Features
  • 8. Example: Pathology and Gene Expression Joint Predictors of Recurrence/Survival Lee Cooper Carlos Moreno
  • 9. Feature Characterization in Pathology and Radiology Role – In silico Brain Tumor Research Algorithms Scaling Requirements
  • 10. In Silico Center for Brain Tumor Research Key Data Sets REMBRANDT: Gene expression and genomics data set of all glioma subtypes The Cancer Genome Atlas (TCGA): Rich “omics” set of GBM, digitized Pathology and Radiology Pathology and Radiology Images from Henry Ford Hospital, Emory, Thomas Jefferson U, MD Anderson and others
  • 11. TCGA Research Network Digital Pathology Neuroimaging
  • 12. Progression to GBM Anaplastic Astrocytoma (WHO grade III) Glioblastoma (WHO grade IV)
  • 13.
  • 14.
  • 15.
  • 17.
  • 22.
  • 23. Entrapped gray or white matter
  • 24.
  • 25. TCGA Whole Slide Images Feature Extraction Jun Kong
  • 26.
  • 27. Clustergram of selected features used in consensus clustering Nuclear Features Used to Classify GBMs
  • 28. Nuclear Features Used to Classify GBMs Consensus clustering of morphological signatures Study includes 200 million nuclei taken from 480 slides corresponding to 167 distinct patients.
  • 30. Survival of patients by molecular tumor subtype
  • 33. Multiscale Systems Biology Employ multi-resolution methods to characterize necrosis, angiogenesis and correlate these with “omics” Rim-enhancement Vascular Changes Rapid progression No enhancement Normal Vessels Stable lesion ?
  • 34. Correlation of Necrosis, Angiogenesis and “omics” GBMs display variable and regionally heterogeneous degrees of necrosis (asterisk) and angiogenesis These factors may impact gene expression profiles
  • 35.
  • 36. Extent of both necrosis and angiogenesis calculated as a percentage of total tissue areaCarro MS, et al. Nature 263: 318-25, 2010
  • 37. Feature Sets in Radiology(Adam Flanders, TJU; Dan Rubin, Stanford, Lori Dodd, NCI) Require standardized validated feature sets to describe de novo disease. Fundamental obstacle to new imaging criteria as treatment biomarkers is lack of standard terminology: To define a comprehensive set of imaging features of cancer For reporting imaging results To provide a more quantitative, reproducible basis for assessing baseline disease and treatment response
  • 38. Defining Rich Set of Qualitative and Quantitative Image Biomarkers Community-driven ontology development project; collaboration with ASNR Imaging features (5 categories) Locationof lesion Morphology of lesion margin(definition, thickness, enhancement, diffusion) Morphology of lesion substance (enhancement, PS characteristics, focality/multicentricity, necrosis, cysts, midline invasion, cortical involvement, T1/FLAIR ratio) Alterations in vicinity of lesion (edema, edema crossing midline, hemorrhage, pial invasion, ependymal invasion, satellites, deep WM invasion, calvarial remodeling) Resection features (extent of nCE tissue, CE tissue, resected components)
  • 39. Imaging Predictors of survival and molecular profiles in the TCGA Glioblastoma Data set The TCGA glioma working group 1Emory University Hospital, Atlanta, GA 2National Cancer Institute, Bethesda, MD.  3Thomas Jefferson University Hospital, Philadelphia, PA. 4Henry Ford University Hospital, Detroit, Michigan. 5National Institute of Health, Bethesda, MD.  6Boston University School of Medicine, Boston, MA. 7SAIC-Frederick, Inc., Frederick, MD. 8University of Virginia, Charlottesville, VA. 9Northwestern University Chicago, IL
  • 40. Assumed Dependence Between Features F1: Tumor Location F3: Eloq. Brain F5: Prop. Enhance F6: Prop. nCET F7: Prop. Necrosis F9: Distribution F10: T1/FLAIR F11: En. Marg.Thick. F13: Def. Non.Marg. F14: Prop. Edema F16: Hemorrhage F18: Pial Invasion F19: Ependymal F20: Cort. Involve. F21: Deep WM Inv. F22: nCET Cross. Mid. F24: Satellites F18 F20 F9 F3 F16 F7 F22 F21 F1 F6 F11 F5 F14 F19 F10 F13 F24 NOTE: each feature omitted from this graph is independent of every other feature. Slide thanks to Eric Huang, NIH
  • 41. Estimation Problem Size Reduction Can ignore seven of the features  size of contingency table reduced from 2.64 × 1012 cells to 1.34 × 1010 cells. Collapsibility reduces size of contingency table even further: Any binary feature Fj connected to only one feature on graph (i.e. given the feature Fj is connected to, Fj is independent of all other features) can also be ignored Eliminates need to deal with Hemorrhage (F16), Pial Invasion (F18), Cortical Involvement (F20), and Satellites (F24). Reduces size of contingency table to 1.68 × 109 cells. Additional analogous considerations can be used to reduce size of contingency table by more than two additional orders of magnitude Slide thanks to Eric Huang, NIH
  • 42. Correlative Imaging Results Minimal enhancing tumor (≤5%) strongly associated with Proneural classification (p=0.0006). >5% proportion of necrosis and the presence of microvascular hyperplasia in pathology slides (p=0.008). Greater maximum tumor dimension (T2 signal) associated with present/abundant microvascular hyperplasia (p=0.001). < 5% Enhancement
  • 43. Correlative Imaging Results TP53 mutant tumors had a smaller mean tumor sizes (p=0.002) on T2-weighted or FLAIR images. EGFR mutant tumors were significantly larger than TP53 mutant tumors (p=0.0005). High level EGFR amplification was associated with >5% enhancement and >5% proportion of necrosis (p < 0.01). > 5% Necrosis
  • 44. Squeezing Information from Spatial Datasets Leverage exascale data and computer resources to squeeze the most out of image, sensor or simulation data Run lots of different algorithms to derive same features Run lots of algorithms to derive complementary features Data models and data management infrastructure to manage data products, feature sets and results from classification and machine learning algorithms
  • 45. Pipeline for Whole Slide Feature Characterization 1010 pixels for each whole slide image 10 whole slide images per patient 108 image features per whole slide image 10,000 brain tumor patients 1015 pixels 1013 features Hundreds of algorithms Annotations and markups from dozens of humans
  • 46.
  • 47. Represented by a complex data model capturing multi-faceted information including markups, annotations, algorithm provenance, specimen, etc.
  • 48.
  • 49. Represented by a complex data model capturing multi-faceted information including markups, annotations, algorithm provenance, specimen, etc.
  • 50. Support for complex relationships and spatial query: multi-level granularities, relationships between markups and annotations, spatial and nested relationships
  • 51.
  • 52. Pathology Imaging GIS Image analysis PAIS model PAIS data management Modeling and management of markup and annotation for querying and sharing through parallel RDBMS + spatial DBMS Segmentation HDFS data staging MapReduce based queries On the fly data processing for algorithm validation/algorithm sensitivity studies, or discovery of preliminary results Feature extraction
  • 53. Generation and Analysis of Imaging Features In-transit data processing using filter/stream systems Semantic Workflows Hierarchical pipeline design with coarse and fine grained components Adaptivity and Quality of Service
  • 54. Same basic story in multiple domains
  • 55. Classification using DataCutter Filter Stream Workflow
  • 56. Slides’ Preparation 8x 40x 64990 x 59412 pixels in full resolution Original Size: 10.8 Gb; Compressed Sized: ≈ 833Mb
  • 57. Computerized Classification System for Grading Neuroblastoma Background? Yes Image Tile Label Initialization I = L No Create Image I(L) Training Tiles Segmentation I = I -1 Down-sampling Feature Construction Segmentation Yes No I > 1? Feature Extraction Feature Construction Feature Extraction Classification Classifier Training Within Confidence Region ? No Yes TRAINING TESTING Background Identification Image Decomposition (Multi-resolution levels) Image Segmentation (EMLDA) Feature Construction (2nd order statistics, Tonal Features) Feature Extraction (LDA) + Classification (Bayesian) Multi-resolution Layer Controller (Confidence Region)
  • 58. Segmentation A typical segmentation result of an image from undifferentiated class with components segmented by this method is shown. (a) Original image; (b) Partitioned image shown in color; (c)Nuclei; (d)Cytoplasm; (e)Neuropil; (f)Background component.
  • 59.
  • 60. Search for the most appropriate implementation of both components and workflows
  • 62. Select among implementation variants of the same computation
  • 63. Derive integer values of optimization parameters
  • 64. Only search promising code variants and a restricted parameter space
  • 66.
  • 67. Framework Description Module (Wings): Describe application workflow using semantics of workflow components Execution module (Pegasus, DataCutter, Condor): Maps to resources, generates and places fine grained filter/stream pipelines Tradeoff Module: Schedules execution based on application level QOS
  • 68.
  • 70.
  • 71.
  • 72.
  • 73. Minority Health Genomics and Translational Research Bio-Repository Database (MH-GRID)
  • 74. ACTSI Cardiovascular, Diabetes, Brain Tumor Registry
  • 76. CFAR (Center for AIDS Research) HIV/Cancer Project
  • 77. Radiation Therapy and Quantitative Imaging
  • 78. Integrative Analysis of Text and Discrete Data Related to Smoking Cessation and Asthma
  • 79.
  • 81. Thanks to: In silico center team: Dan Brat (Science PI), Tahsin Kurc, Ashish Sharma, Tony Pan, David Gutman, Jun Kong, Sharath Cholleti, Carlos Moreno, Chad Holder, Erwin Van Meir, Daniel Rubin, Tom Mikkelsen, Adam Flanders, Joel Saltz (Director) caGrid Knowledge Center: Joel Saltz, Mike Caliguiri, Steve Langella co-Directors; Tahsin Kurc, Himanshu Rathod Emory leads caBIG In vivo imaging team: Eliot Siegel, Paul Mulhern, Adam Flanders, David Channon, Daniel Rubin, Fred Prior, Larry Tarbox and many others In vivo imaging Emory team: Tony Pan, Ashish Sharma, Joel Saltz Emory ATC Supplement team: Tim Fox, Ashish Sharma, Tony Pan, Edi Schreibmann, Paul Pantalone Digital Pathology R01: Foran and Saltz; Jun Kong, Sharath Cholleti, Fusheng Wang, Tony Pan, Tahsin Kurc, Ashish Sharma, David Gutman (Emory), Wenjin Chen, Vicky Chu, Jun Hu, Lin Yang, David J. Foran (Rutgers) NIH/in silico TCGA Imaging Group: Scott Hwang, Bob Clifford, Erich Huang, DimaHammoud, ManalJilwan, PrashantRaghavan, Max Wintermark, David Gutman, Carlos Moreno, Lee Cooper, John Freymann, Justin Kirby, Arun Krishnan, Seena Dehkharghani, Carl Jaffe ACTSI Biomedical Informatics Program: Marc Overcash, Tim Morris, Tahsin Kurc, Alexander Quarshie, Circe Tsui, Adam Davis, Sharon Mason, Andrew Post, Alfredo Tirado-Ramos NSF Scientific Workflow Collaboration: Vijay Kumar, Yolanda Gil, Mary Hall, EwaDeelman, Tahsin Kurc, P. Sadayappan, Gaurang Mehta, Karan Vahi