SlideShare une entreprise Scribd logo
1  sur  74
#globusworld
@globus
New to Globus?
Join us for Globus 101++
tomorrow @ 11am–2:30pm CDT
globusworld.org/program
GlobusWorld 2020 Sponsors
A Decade of
Enabling Science
Ian Foster
Rachana Ananthakrishnan
Vas Vasiliadis
April 29, 2020
Our Mission
Increase the efficiency and
effectiveness of researchers
engaged in data-driven
science and scholarship
through sustainable software
7
Our mission today
CanonicalizationChemical Databases Compute Features
Fingerprinting
ML based filtering
Similarity Search
Generate Images DNN filtering
Computing
Resources
cureFFI
GDB
MOSES
ZINC15
and more…
Supporting the nCoV collaboration
2019-ncovgroup.github.io
First release: 21 sources, 3.9B molecules, 80 TB computed features
2019-ncovgroup.github.io
ENAMINE REAL 1.2 billion molecules
which comply with “rule of 5“ and Veber
criteria: MW≤500, SlogP≤5, HBA≤10,
HBD≤5, rotatable bonds≤10, TPSA≤140.
21 sources, 3.9B molecules,
80 TB computed features
2019-ncovgroup.github.io
12
xxxxxxxxxxxx
13
xxxxxxxxxxxx
Katrin Heitmann
14
Google Cloud upload 5GB
Google Cloud download 5GB
Charting future missions
Globus Labs Mission
To make research data research data
are reliably, rapidly, and securely
accessible, discoverable, and usable
By..
Developing an automated and scalable
platform for reproducible research that
can exploit heterogenous resources
that span the computing continuum
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
Portable code Any access Any computer
Python
Docker, Shifter,
Singularity
Clusters,
clouds, HPC,
accelerators
Cloud API,
cluster or HPC
scheduler
funcX distributed function as a service
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
funcX: Transform clouds, clusters, and supercomputers into
high-performance function serving systems
18
EP(x) EP(x) EP(x) EP(x)
funcX
Simply deploy funcX endpoint to transform
a computer into a function serving system
repo2dockerRegister
EP(x)
f(x) g(x)
h(x) k(x)
f(x) g(x)
EP(x) h(x) k(x)
f(x), …
+
depend
-encies
19
EP(x) EP(x) EP(x) EP(x)
f(x)
g(x)
h(x)
k(x)
repo2dockerRegister
f(x) g(x)
h(x) k(x)
Registration
f(x), g(x), … + dependencies
EP(x) registry
Execution
f(x), …
[1,2,3 … n]
Simply deploy funcX endpoint to transform
a computer into a function serving system
repo2dockerRegister
EP(x)
f(x) g(x)
h(x) k(x)
f(x) g(x)
EP(x) h(x) k(x)
f(x), …
+
depend
-encies
funcX: Transform clouds, clusters, and supercomputers into
high-performance function serving systems
Parsl: parallel programming in
Python
arxiv.org/pdf/1905.02158 parsl-project.org
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
Cost-aware computing with
heterogeneous platforms
Incremental construction of a personalized cost map
• Build black-box performance models from observed
execution times for different codes on different
platforms
• Transfer learning across codes, problem sizes, and
hardware platforms
• Experiment design to choose experiments that maximize
reduction in uncertainty
• Evolve models over time as codes and platforms change
• Use models for instance selection and scheduling
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
22
Virtual CPUs
RAM(GB)
Example: A cost map for bioinformatics applications on
different AWS instance types IndexBam performs better on compute-
optimized instances. Poorly chosen
experiments mislead the model
On average, within 30% of final error after 4 experiments and within 2.3% after 6
Metadata extraction at the edge
• Dynamic extraction pipelines composed of many
independent extractors
– Metadata and content (images, text, tables, maps, …)
• Centralized vs edge extractor execution to weigh
tradeoffs between compute and transfer costs
23
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
DLHub: model publication and
serving
dlhub.orgarxiv.org/abs/1811.11213
ƒuncX
Model
registry
Flows
Cost
map
Write
programs
Function
fabric
Data/Trust
fabric
Automate
DLHub
Globus
SCRIMP
Metadata
Extraction Xtract
Assets:
RNAseq, variants,
patient phenotypes, expression
profiles to small molecules
At multiples sites:
Managed/hosted
by specialists
Goals:
Increase discoverability
Combine, reuse, share assets
Increase analysis, enabling
clinical research
NIH Common Fund Data Ecosystem
Data
automation
Data Ingest
Index
Search
Analyze
Product updates
Simplifying the Globus Connect Personal Experience
• Option to login in from
the application during
installation
• Setup key method
available for automation
use cases
• Available next week
Simplifying the Globus Connect Personal Experience
Simplifying the Globus Connect Personal Experience
The new Globus Connect v5
architecture provides numerous new
features for users and administrators,
and serves as a platform for richer
data management capabilities.
30
For users and developers
• Web addressable storage system in addition to bulk data
access
• Credential management for cloud storage systems
• No re-authentication needed for duration of tasks
• Eliminate user certificates and move to OAuth tokens
• …
31
For administrators
• Single DTN pool connect multiple storage systems
• Eliminate need for shared file system across DTNs
• Complete backup and recovery solution
• Configuration management API
• …
32
Next point release GCSv5.4
• Targeted for May 2020
• Deployments with multiple DTNs
• Support both standard data access and high assurance access
• Custom mapping from user identity (user@domain.edu) to local
account
• Role based management for GCS
• Guest collection root selection via browse
• Connectors supported:
– POSIX, Google Drive, Google Cloud, Box, Ceph, AWS S3 SpectraLogic
Black Pearl
GCSv5 - Multiple DTNs architecture
34
35
Globus Connect Server v5
• Continue to add
features as point
releases
• Migration tools
from v4 to v5
Access Google Cloud Storage
and other on-prem/cloud storage
via the same familiar, interface
Data-appropriate storage
Google Drive for project admin files
Google Cloud Storage for core research files
Fire-and forget transfers to
Google storage resources
e.g. automatic retry on errors
Maximize value of your
Google cloud investment
Including share data with collaborators
Continue to grow S3 compatible systems
Globus Connectors
ActiveScale
Object
Storage
Growing the connector ecosystem
Other product updates
• For users: Several new features in web app
– Consolidated view options, HTTPS upload/download via browser,
custom message on access, accessibility improvements…
• For admins: Transfer updates for checksum handling
– Support for additional algorithms (SHA1, SHA256, SHA512), custom
checksum value to verify file integrity
• For developers: Globus Groups platform service
– First release with minimal feature to get group membership
information
43
Some of the research
we’re enabling…
NIH data access at scale for cancer researchers
DataCite switches to Globus Auth for authentication
• Globus Auth to secure
their Profiles services
• Brings federated login
to DataCite users
• Ongoing collaboration
to use Globus Auth for
securing other API
• Globus to use DataCite
for persistent identifiers
46
blog.datacite.org/globus-authentication
Cancer Registry Records for Research (CR3)
• Vision: enable broad, controlled, access to cancer
patient data
• Solution: Build a network of federated cancer registries
– Self service data exploration across registries
– Secure, auditable, access controls for data sharing
• Federation via Globus: network scale  local control
– Owners input/export data, apply QC, set access
policies
– Registry data remain at generating institution
– Identities provided/authenticated by the institution
CR3
Discovery
Portal
Cohort
aggregate
counts
Login with
UPMC/Pitt
credentials
SearchAuth
UPMC/Pitt
Identity
Providers
Authentication
Auth
initiated to
Globus Auth
Cohort search
initiated to
Globus
Search
Researcher
Cohort
aggregate
counts
returned
CR3 Architecture
Transfer
Registry Staff
Data transfer from registrar to
researcher mediated by Globus
Manage
authorization
Request
Service
Cancer Registry De-identified
Data Index (minimal criteria
data: e.g., staging)
Programmatic adoption of Globus
49
“…over 60 research groups …moving over 2PB of
data off aging near-line storage…”
“Globus sharing and group functionality have also
eased the thorny issue of sharing access with
remote collaborators in a more controlled manner.” www.technology.pitt.edu/blog/globus
Instrument data delivery at scale
Use Globus to deliver
100s of TB of genomic
data to researchers
Credits: Joe George, University of Michigan
Simplified data sharing for ALCF users
Argonne Leadership Computing
Facility (ALCF) ​“Eagle” provides
a 50 PB community file system to
make data-sharing easier than
ever among ALCF users, their
collaborators and with third
parties.
Eagle Community
File System
Globus sharing
Looking ahead…
Current service enhancements
• MFA policy for data access
• IPv6 support
• Conditional fault handling
• Enhancements for storage
with staging requirements
53
• Enhancements to
application registration
and management
• Groups service
– Membership API
– Management API
Platform Challenge
54
Transform how research applications and services are…
created, used and
delivered
orchestrated to
achieve automation
sustained
Enable an interoperable ecosystem of research
applications and services
Globus platform services
• Identity and Access Management (IAM)
– Auth
– Groups
• Data Services
– Connect
– Transfer
– Manifest
• Search
• Identifiers (collaboration with DataCite)
• Flows
55
Globus Platform: Automation
56
Automation Action Providers
Delete ACLs
Search
DLHub
User Form Notification
Expression
Evaluation
Describe
Web FormIdentifier
Transfer
Ingest
Xtract
funcX
Globus action
providers
Custom action
providers
Enabling serial crystallography at scale
• Serially image chips with
thousands of embedded crystals
• Quality control first 1,000 to report
failures
• Analyze batches of images as they
are collected
• Report statistics and images during
experiment
• Return crystal structure to scientist
Darren Sherrell, Gyorgy Babnigg, Andrzej Joachimiak
SSX Automation
funcX
Analyze
Transfer
Return
results
Auth
Get
credentials
funcX
Preprocess
Stop?
Threshold
Transfer
Transfer
data
Publish
Publish
results
PaaS: develop custom action providers
• Directly use the platform to build and run
extensible flows
• Develop action providers
– Fit for purpose
– Developed and deployed by the project
– Plugged into their flows
• Action Provider Development toolkit
60
XPCS: X-ray Photon Correlation Spectroscopy
ALCF Data
Portal
Argonne
JLSE
Argonne
Leadership
Computing
Facility
APS
Publication5
Lab Server 1
Acquisition2Imaging1
Plot results4
XPCS-Eigen3
Science!6
● Automate flows stage
data to ALCF for on-
demand analysis and
publication
● Metadata and plots
dynamically extracted,
and published into a
search catalog
● Scientists can select
datasets and initiate
flows to perform batch
analysis tasks
Suresh Narayanan, Nicholas Schwarz
Automating XPCS
Search
Ingest
funcX
Auth
Get
credentials
Automate
Plot
Results
Transfer
Transfer
HDF5
Transfer
Transfer
IMM
funcX
Run Corr
Share
Set ACL
Transfer
Return
Results
SaaS: instrument data management
• Templated solution
• Configurable…
– Set transfer triggers
– Select destination(s)
– Define metadata
• Extensible…
– Add/remove actions
– Change action providers
• No development required
Cryo EM
Lightsheet
Sequencer
….
Indexing for
search
Image reconstruction,
analysis, visualization
Automated egress
from device
--/cohort045
--/cohort096
--/cohort127
Transfer
funcXXtract
Materials Data Facility
> 40 TB of data
> 320 published
authors
> 400 datasets
• Accept data from many
locations with flexible
interfaces
• Index dataset contents in
science-aware ways
• Dispatch data to the
community
• Using Automate to
simplify building
composable flows of
services
MDF Data Publication Automation
Ingest
Bulk
Ingest
Auth
Get
Credentials
Automate
Transfer
Transfer
Dataset
XTract
Extract
Metadata
Share
Set
permissions
Transfer
Move
metadata
Transfer
Transfer
Dataset
Transfers
Transfer
Dataset
Identifier
Mint DOI
Web form
Metadata
Notify
Notify
Curator
Web form
Curation
Notify
Notify
user
SaaS: Data Management Plans
• “Turnkey” DMP enablement
• Select dataset (collection)…
• …add metadata for indexing
• …generate persistent ID
(DOI, ARK, etc.)
66
Transfer
Identifier
Ingest
“Point & Click” to
findable and
accessible data
67
Data portals currently leveraging the platform
Sustainability Update
Why subscribe?
To go (way) beyond file transfer…
• Remove friction for external collaborators
• Automate/scale research data flows
• Diversify research storage options—with a unified interface
• Gain visibility into research storage utilization
• Integrate robust data management into research apps
• Optimize data transfer performance
• Access expert support resources
70
To help our community share the load…
0
1000
2000
3000
4000
5000
6000
2015/04 2015/08 2015/12 2016/04 2016/08 2016/12 2017/04 2017/08 2017/12 2018/04 2018/08 2018/12 2019/04 2019/08 2019/12
Active Endpoints by Month
Subscribed Free
Thank you, funders...
U . S . D E P A R T M E N T O F
ENERGY
Thank you, GlobusWorld sponsors

Contenu connexe

Tendances

Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Globus
 
Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobus
 
Globus Portal Framework (APS Workshop)
Globus Portal Framework (APS Workshop)Globus Portal Framework (APS Workshop)
Globus Portal Framework (APS Workshop)Globus
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Globus
 
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...Globus
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobus
 
GlobusWorld 2021 Tutorial: Building with the Globus Platform
GlobusWorld 2021 Tutorial: Building with the Globus PlatformGlobusWorld 2021 Tutorial: Building with the Globus Platform
GlobusWorld 2021 Tutorial: Building with the Globus PlatformGlobus
 
"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018Globus
 
Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Globus
 
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)Globus
 
Data Orchestration at Scale (GlobusWorld Tour West)
Data Orchestration at Scale (GlobusWorld Tour West)Data Orchestration at Scale (GlobusWorld Tour West)
Data Orchestration at Scale (GlobusWorld Tour West)Globus
 
Tutorial: Leveraging Globus in your Research Applications
Tutorial: Leveraging Globus in your Research ApplicationsTutorial: Leveraging Globus in your Research Applications
Tutorial: Leveraging Globus in your Research ApplicationsGlobus
 
Gateways 2020 Tutorial - Large Scale Data Transfer with Globus
Gateways 2020 Tutorial - Large Scale Data Transfer with GlobusGateways 2020 Tutorial - Large Scale Data Transfer with Globus
Gateways 2020 Tutorial - Large Scale Data Transfer with GlobusGlobus
 
Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)Globus
 
Gateways 2020 Tutorial - Automated Data Ingest and Search with Globus
Gateways 2020 Tutorial - Automated Data Ingest and Search with GlobusGateways 2020 Tutorial - Automated Data Ingest and Search with Globus
Gateways 2020 Tutorial - Automated Data Ingest and Search with GlobusGlobus
 
Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Mary Bass
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
20090701 Climate Data Staging
20090701 Climate Data Staging20090701 Climate Data Staging
20090701 Climate Data StagingHenning Bergmeyer
 
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusRecent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusGlobus
 

Tendances (20)

Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)Connecting Your System to Globus (APS Workshop)
Connecting Your System to Globus (APS Workshop)
 
Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)Globus Command Line Interface (APS Workshop)
Globus Command Line Interface (APS Workshop)
 
GlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System AdministratorsGlobusWorld 2021 Tutorial: Globus for System Administrators
GlobusWorld 2021 Tutorial: Globus for System Administrators
 
Globus Portal Framework (APS Workshop)
Globus Portal Framework (APS Workshop)Globus Portal Framework (APS Workshop)
Globus Portal Framework (APS Workshop)
 
Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)Introduction to Globus (APS Workshop)
Introduction to Globus (APS Workshop)
 
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...
Gladier: The Globus Architecture for Data Intensive Experimental Research (AP...
 
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDKGlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
GlobusWorld 2021 Tutorial: The Globus CLI, Platform and SDK
 
GlobusWorld 2021 Tutorial: Building with the Globus Platform
GlobusWorld 2021 Tutorial: Building with the Globus PlatformGlobusWorld 2021 Tutorial: Building with the Globus Platform
GlobusWorld 2021 Tutorial: Building with the Globus Platform
 
"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018"What's New With Globus" Webinar: Spring 2018
"What's New With Globus" Webinar: Spring 2018
 
Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)Introduction to Globus (GlobusWorld Tour West)
Introduction to Globus (GlobusWorld Tour West)
 
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
Introduction to Globus for New Users (GlobusWorld Tour - UCSD)
 
Data Orchestration at Scale (GlobusWorld Tour West)
Data Orchestration at Scale (GlobusWorld Tour West)Data Orchestration at Scale (GlobusWorld Tour West)
Data Orchestration at Scale (GlobusWorld Tour West)
 
Tutorial: Leveraging Globus in your Research Applications
Tutorial: Leveraging Globus in your Research ApplicationsTutorial: Leveraging Globus in your Research Applications
Tutorial: Leveraging Globus in your Research Applications
 
Gateways 2020 Tutorial - Large Scale Data Transfer with Globus
Gateways 2020 Tutorial - Large Scale Data Transfer with GlobusGateways 2020 Tutorial - Large Scale Data Transfer with Globus
Gateways 2020 Tutorial - Large Scale Data Transfer with Globus
 
Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)Enabling Secure Data Discoverability (SC21 Tutorial)
Enabling Secure Data Discoverability (SC21 Tutorial)
 
Gateways 2020 Tutorial - Automated Data Ingest and Search with Globus
Gateways 2020 Tutorial - Automated Data Ingest and Search with GlobusGateways 2020 Tutorial - Automated Data Ingest and Search with Globus
Gateways 2020 Tutorial - Automated Data Ingest and Search with Globus
 
Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17Globus: Research Data Management as Service and Platform - pearc17
Globus: Research Data Management as Service and Platform - pearc17
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
20090701 Climate Data Staging
20090701 Climate Data Staging20090701 Climate Data Staging
20090701 Climate Data Staging
 
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using GlobusRecent Upgrades to ARM Data Transfer and Delivery Using Globus
Recent Upgrades to ARM Data Transfer and Delivery Using Globus
 

Similaire à GlobusWorld 2020 Keynote

Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformGlobus
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013Kirill Osipov
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryIan Foster
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSEd Dodds
 
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryIan Foster
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisYaoyu Wang
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataRobert Grossman
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Ian Foster
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer OverlordsIan Foster
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Vivien Bonazzi
 
(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte ScaleAmazon Web Services
 
Introduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialIntroduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialGlobus
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformYaoyu Wang
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuantUniversity
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduriRavi Madduri
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011Ian Foster
 
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...Globus
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataPhilip Bourne
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationIan Foster
 

Similaire à GlobusWorld 2020 Keynote (20)

Simplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus PlatformSimplified Research Data Management with the Globus Platform
Simplified Research Data Management with the Globus Platform
 
Science cloud foster june 2013
Science cloud foster june 2013Science cloud foster june 2013
Science cloud foster june 2013
 
Science as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate DiscoveryScience as a Service: How On-Demand Computing can Accelerate Discovery
Science as a Service: How On-Demand Computing can Accelerate Discovery
 
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWSExperiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
Experiences In Building Globus Genomics Using Galaxy, Globus Online and AWS
 
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific DiscoveryA Global Research Data Platform: How Globus Services Enable Scientific Discovery
A Global Research Data Platform: How Globus Services Enable Scientific Discovery
 
Cloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysisCloud Native Analysis Platform for NGS analysis
Cloud Native Analysis Platform for NGS analysis
 
A Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate DataA Gen3 Perspective of Disparate Data
A Gen3 Perspective of Disparate Data
 
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
Science Services and Science Platforms: Using the Cloud to Accelerate and Dem...
 
So Long Computer Overlords
So Long Computer OverlordsSo Long Computer Overlords
So Long Computer Overlords
 
Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2Bonazzi commons bd2 k ahm 2016 v2
Bonazzi commons bd2 k ahm 2016 v2
 
(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale(SEC313) Security & Compliance at the Petabyte Scale
(SEC313) Security & Compliance at the Petabyte Scale
 
Introduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 TutorialIntroduction to Globus - XSEDE14 Tutorial
Introduction to Globus - XSEDE14 Tutorial
 
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
 
CCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud PlatformCCCB Germline Variant Analysis on Cloud Platform
CCCB Germline Variant Analysis on Cloud Platform
 
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
QuTrack: Model Life Cycle Management for AI and ML models using a Blockchain ...
 
re:Invent 2013-foster-madduri
re:Invent 2013-foster-maddurire:Invent 2013-foster-madduri
re:Invent 2013-foster-madduri
 
Rpi talk foster september 2011
Rpi talk foster september 2011Rpi talk foster september 2011
Rpi talk foster september 2011
 
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
Globus: A Data Management Platform for Collaborative Research (CHPC 2019 - So...
 
The Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big DataThe Commons: Leveraging the Power of the Cloud for Big Data
The Commons: Leveraging the Power of the Cloud for Big Data
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
 

Plus de Globus

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration TopicsGlobus
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowGlobus
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaSGlobus
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesGlobus
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusGlobus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for ResearchersGlobus
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with GlobusGlobus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersGlobus
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersGlobus
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Globus
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeGlobus
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsGlobus
 
Globus Automation
Globus AutomationGlobus Automation
Globus AutomationGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 

Plus de Globus (20)

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration Topics
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a Flow
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaS
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All Scales
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using Globus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for Researchers
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with Globus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for Researchers
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for Developers
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and Compute
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus Platform
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and Portals
 
Globus Automation
Globus AutomationGlobus Automation
Globus Automation
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 

Dernier

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 

Dernier (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 

GlobusWorld 2020 Keynote

  • 2. New to Globus? Join us for Globus 101++ tomorrow @ 11am–2:30pm CDT globusworld.org/program
  • 4. A Decade of Enabling Science Ian Foster Rachana Ananthakrishnan Vas Vasiliadis April 29, 2020
  • 5.
  • 6. Our Mission Increase the efficiency and effectiveness of researchers engaged in data-driven science and scholarship through sustainable software
  • 7. 7
  • 9. CanonicalizationChemical Databases Compute Features Fingerprinting ML based filtering Similarity Search Generate Images DNN filtering Computing Resources cureFFI GDB MOSES ZINC15 and more… Supporting the nCoV collaboration 2019-ncovgroup.github.io
  • 10. First release: 21 sources, 3.9B molecules, 80 TB computed features 2019-ncovgroup.github.io
  • 11. ENAMINE REAL 1.2 billion molecules which comply with “rule of 5“ and Veber criteria: MW≤500, SlogP≤5, HBA≤10, HBD≤5, rotatable bonds≤10, TPSA≤140. 21 sources, 3.9B molecules, 80 TB computed features 2019-ncovgroup.github.io
  • 14. 14 Google Cloud upload 5GB Google Cloud download 5GB
  • 16. Globus Labs Mission To make research data research data are reliably, rapidly, and securely accessible, discoverable, and usable By.. Developing an automated and scalable platform for reproducible research that can exploit heterogenous resources that span the computing continuum ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 17. Portable code Any access Any computer Python Docker, Shifter, Singularity Clusters, clouds, HPC, accelerators Cloud API, cluster or HPC scheduler funcX distributed function as a service ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 18. funcX: Transform clouds, clusters, and supercomputers into high-performance function serving systems 18 EP(x) EP(x) EP(x) EP(x) funcX Simply deploy funcX endpoint to transform a computer into a function serving system repo2dockerRegister EP(x) f(x) g(x) h(x) k(x) f(x) g(x) EP(x) h(x) k(x) f(x), … + depend -encies
  • 19. 19 EP(x) EP(x) EP(x) EP(x) f(x) g(x) h(x) k(x) repo2dockerRegister f(x) g(x) h(x) k(x) Registration f(x), g(x), … + dependencies EP(x) registry Execution f(x), … [1,2,3 … n] Simply deploy funcX endpoint to transform a computer into a function serving system repo2dockerRegister EP(x) f(x) g(x) h(x) k(x) f(x) g(x) EP(x) h(x) k(x) f(x), … + depend -encies funcX: Transform clouds, clusters, and supercomputers into high-performance function serving systems
  • 20. Parsl: parallel programming in Python arxiv.org/pdf/1905.02158 parsl-project.org ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 21. Cost-aware computing with heterogeneous platforms Incremental construction of a personalized cost map • Build black-box performance models from observed execution times for different codes on different platforms • Transfer learning across codes, problem sizes, and hardware platforms • Experiment design to choose experiments that maximize reduction in uncertainty • Evolve models over time as codes and platforms change • Use models for instance selection and scheduling ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 22. 22 Virtual CPUs RAM(GB) Example: A cost map for bioinformatics applications on different AWS instance types IndexBam performs better on compute- optimized instances. Poorly chosen experiments mislead the model On average, within 30% of final error after 4 experiments and within 2.3% after 6
  • 23. Metadata extraction at the edge • Dynamic extraction pipelines composed of many independent extractors – Metadata and content (images, text, tables, maps, …) • Centralized vs edge extractor execution to weigh tradeoffs between compute and transfer costs 23 ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 24. DLHub: model publication and serving dlhub.orgarxiv.org/abs/1811.11213 ƒuncX Model registry Flows Cost map Write programs Function fabric Data/Trust fabric Automate DLHub Globus SCRIMP Metadata Extraction Xtract
  • 25. Assets: RNAseq, variants, patient phenotypes, expression profiles to small molecules At multiples sites: Managed/hosted by specialists Goals: Increase discoverability Combine, reuse, share assets Increase analysis, enabling clinical research NIH Common Fund Data Ecosystem Data automation Data Ingest Index Search Analyze
  • 27. Simplifying the Globus Connect Personal Experience • Option to login in from the application during installation • Setup key method available for automation use cases • Available next week
  • 28. Simplifying the Globus Connect Personal Experience
  • 29. Simplifying the Globus Connect Personal Experience
  • 30. The new Globus Connect v5 architecture provides numerous new features for users and administrators, and serves as a platform for richer data management capabilities. 30
  • 31. For users and developers • Web addressable storage system in addition to bulk data access • Credential management for cloud storage systems • No re-authentication needed for duration of tasks • Eliminate user certificates and move to OAuth tokens • … 31
  • 32. For administrators • Single DTN pool connect multiple storage systems • Eliminate need for shared file system across DTNs • Complete backup and recovery solution • Configuration management API • … 32
  • 33. Next point release GCSv5.4 • Targeted for May 2020 • Deployments with multiple DTNs • Support both standard data access and high assurance access • Custom mapping from user identity (user@domain.edu) to local account • Role based management for GCS • Guest collection root selection via browse • Connectors supported: – POSIX, Google Drive, Google Cloud, Box, Ceph, AWS S3 SpectraLogic Black Pearl
  • 34. GCSv5 - Multiple DTNs architecture 34
  • 35. 35 Globus Connect Server v5 • Continue to add features as point releases • Migration tools from v4 to v5
  • 36. Access Google Cloud Storage and other on-prem/cloud storage via the same familiar, interface
  • 37. Data-appropriate storage Google Drive for project admin files Google Cloud Storage for core research files
  • 38. Fire-and forget transfers to Google storage resources e.g. automatic retry on errors
  • 39. Maximize value of your Google cloud investment Including share data with collaborators
  • 40. Continue to grow S3 compatible systems
  • 43. Other product updates • For users: Several new features in web app – Consolidated view options, HTTPS upload/download via browser, custom message on access, accessibility improvements… • For admins: Transfer updates for checksum handling – Support for additional algorithms (SHA1, SHA256, SHA512), custom checksum value to verify file integrity • For developers: Globus Groups platform service – First release with minimal feature to get group membership information 43
  • 44. Some of the research we’re enabling…
  • 45. NIH data access at scale for cancer researchers
  • 46. DataCite switches to Globus Auth for authentication • Globus Auth to secure their Profiles services • Brings federated login to DataCite users • Ongoing collaboration to use Globus Auth for securing other API • Globus to use DataCite for persistent identifiers 46 blog.datacite.org/globus-authentication
  • 47. Cancer Registry Records for Research (CR3) • Vision: enable broad, controlled, access to cancer patient data • Solution: Build a network of federated cancer registries – Self service data exploration across registries – Secure, auditable, access controls for data sharing • Federation via Globus: network scale  local control – Owners input/export data, apply QC, set access policies – Registry data remain at generating institution – Identities provided/authenticated by the institution
  • 48. CR3 Discovery Portal Cohort aggregate counts Login with UPMC/Pitt credentials SearchAuth UPMC/Pitt Identity Providers Authentication Auth initiated to Globus Auth Cohort search initiated to Globus Search Researcher Cohort aggregate counts returned CR3 Architecture Transfer Registry Staff Data transfer from registrar to researcher mediated by Globus Manage authorization Request Service Cancer Registry De-identified Data Index (minimal criteria data: e.g., staging)
  • 49. Programmatic adoption of Globus 49 “…over 60 research groups …moving over 2PB of data off aging near-line storage…” “Globus sharing and group functionality have also eased the thorny issue of sharing access with remote collaborators in a more controlled manner.” www.technology.pitt.edu/blog/globus
  • 50. Instrument data delivery at scale Use Globus to deliver 100s of TB of genomic data to researchers Credits: Joe George, University of Michigan
  • 51. Simplified data sharing for ALCF users Argonne Leadership Computing Facility (ALCF) ​“Eagle” provides a 50 PB community file system to make data-sharing easier than ever among ALCF users, their collaborators and with third parties. Eagle Community File System Globus sharing
  • 53. Current service enhancements • MFA policy for data access • IPv6 support • Conditional fault handling • Enhancements for storage with staging requirements 53 • Enhancements to application registration and management • Groups service – Membership API – Management API
  • 54. Platform Challenge 54 Transform how research applications and services are… created, used and delivered orchestrated to achieve automation sustained Enable an interoperable ecosystem of research applications and services
  • 55. Globus platform services • Identity and Access Management (IAM) – Auth – Groups • Data Services – Connect – Transfer – Manifest • Search • Identifiers (collaboration with DataCite) • Flows 55
  • 57. Automation Action Providers Delete ACLs Search DLHub User Form Notification Expression Evaluation Describe Web FormIdentifier Transfer Ingest Xtract funcX Globus action providers Custom action providers
  • 58. Enabling serial crystallography at scale • Serially image chips with thousands of embedded crystals • Quality control first 1,000 to report failures • Analyze batches of images as they are collected • Report statistics and images during experiment • Return crystal structure to scientist Darren Sherrell, Gyorgy Babnigg, Andrzej Joachimiak
  • 60. PaaS: develop custom action providers • Directly use the platform to build and run extensible flows • Develop action providers – Fit for purpose – Developed and deployed by the project – Plugged into their flows • Action Provider Development toolkit 60
  • 61. XPCS: X-ray Photon Correlation Spectroscopy ALCF Data Portal Argonne JLSE Argonne Leadership Computing Facility APS Publication5 Lab Server 1 Acquisition2Imaging1 Plot results4 XPCS-Eigen3 Science!6 ● Automate flows stage data to ALCF for on- demand analysis and publication ● Metadata and plots dynamically extracted, and published into a search catalog ● Scientists can select datasets and initiate flows to perform batch analysis tasks Suresh Narayanan, Nicholas Schwarz
  • 63. SaaS: instrument data management • Templated solution • Configurable… – Set transfer triggers – Select destination(s) – Define metadata • Extensible… – Add/remove actions – Change action providers • No development required Cryo EM Lightsheet Sequencer …. Indexing for search Image reconstruction, analysis, visualization Automated egress from device --/cohort045 --/cohort096 --/cohort127 Transfer funcXXtract
  • 64. Materials Data Facility > 40 TB of data > 320 published authors > 400 datasets • Accept data from many locations with flexible interfaces • Index dataset contents in science-aware ways • Dispatch data to the community • Using Automate to simplify building composable flows of services
  • 65. MDF Data Publication Automation Ingest Bulk Ingest Auth Get Credentials Automate Transfer Transfer Dataset XTract Extract Metadata Share Set permissions Transfer Move metadata Transfer Transfer Dataset Transfers Transfer Dataset Identifier Mint DOI Web form Metadata Notify Notify Curator Web form Curation Notify Notify user
  • 66. SaaS: Data Management Plans • “Turnkey” DMP enablement • Select dataset (collection)… • …add metadata for indexing • …generate persistent ID (DOI, ARK, etc.) 66 Transfer Identifier Ingest “Point & Click” to findable and accessible data
  • 67. 67 Data portals currently leveraging the platform
  • 70. To go (way) beyond file transfer… • Remove friction for external collaborators • Automate/scale research data flows • Diversify research storage options—with a unified interface • Gain visibility into research storage utilization • Integrate robust data management into research apps • Optimize data transfer performance • Access expert support resources 70
  • 71. To help our community share the load… 0 1000 2000 3000 4000 5000 6000 2015/04 2015/08 2015/12 2016/04 2016/08 2016/12 2017/04 2017/08 2017/12 2018/04 2018/08 2018/12 2019/04 2019/08 2019/12 Active Endpoints by Month Subscribed Free
  • 72.
  • 73. Thank you, funders... U . S . D E P A R T M E N T O F ENERGY