SlideShare une entreprise Scribd logo
1  sur  27
“The Pacific Research Platform:
Building a Distributed Big-Data Machine-Learning
Cyberinfrastructure”
Briefing
Jacobs School of Engineering
University of California San Diego
July 18, 2019
Dr. Larry Smarr
Director, California Institute for Telecommunications and Information Technology
Harry E. Gruber Professor,
Dept. of Computer Science and Engineering
Jacobs School of Engineering, UCSD
http://lsmarr.calit2.net
UC San Diego’s Calit2 & SDSC Have Pioneered Big-Data Cyberinfrastructure for 17 Years
with NSF Grants: OptIPuter, Quartzite, Prism, CHERuB, PRP, CHASE-CI, TNRP
OptIPuter
PI Smarr,
Co-PI DeFanti
Co-PI Papadopoulos, Ellisman
2002-2009
Quartzite
PI Papadopoulos,
Co-PI Smarr, Ford,
Fainman
2013-2015: Creating a “Big Data” Backplane on Campus:
NSF CC-NIE Funded Prism@UCSD and CHERuB
Prism@UCSD, Phil Papadopoulos, SDSC, Calit2, PI; Smarr co-PI
CHERuB, Mike Norman, SDSC PI
CHERuB
(GDC)
2015-2020: The Pacific Research Platform Connects Campus “Big Data Freeways”
to Create a Regional End-to-End Science-Driven “Big Data Superhighway” System
NSF CC*DNI Grant
$6M 10/2015-10/2020
PI: Larry Smarr, UC San Diego Calit2
Co-PIs:
• Camille Crittenden, UC Berkeley CITRIS,
• Tom DeFanti, UC San Diego Calit2/QI,
• Philip Papadopoulos, UCSD SDSC,
• Frank Wuerthwein, UCSD Physics and SDSC
Letters of Commitment from:
• 50 Researchers from 15 Campuses
• 32 IT/Network Organization Leaders
Source: John Hess, CENIC
UCOP CIO Tom Andriola
Provided Funds and ITLC Support
for Using Ten UC Campuses
For Advanced Technology Testing
2017-2020: CHASE-CI Adds
Machine-Learning to the Data-Science Community Cyberinfrastructure
Caltech
UCB
UCI UCR
UCSD
UCSC
Stanford
MSU
UCM
SDSU
NSF Grant for 256 High Speed “Cloud” GPUs
For 32 ML Faculty & Their Students at 10 Campuses
To Train AI Algorithms on Big Data
PRP Engineers Designed and Built Several Generations
of Optical-Fiber Big-Data Flash I/O Network Appliances (FIONAs)
UCSD-Designed FIONAs Solved the Disk-to-Disk Data Transfer Problem
at Near Full Speed on Best-Effort 10G, 40G and 100G Networks
FIONAs Designed by UCSD’s Phil Papadopoulos, John Graham,
Joe Keefe, and Tom DeFanti
FIONette—
1G, $250
Used for
Training 50
Engineers in
2018-2019
Two FIONA DTNs at UC Santa Cruz: 40G & 100G
Up to 200 TeraByte Rotating Storage
Add Up to 8 Nvidia GPUs Per FIONA
To Add Machine Learning Capability
Over 100 Now Deployed on PRP
48 GPUs for
OSG Applications
UCSD Has Added >350 Game GPUs to Data Sciences Cyberinfrastructure -
Devoted to Data Analytics and Machine Learning
SunCAVE 70 GPUs
WAVE + Vroom 48 GPUs
FIONA with
8-Game GPUs
104 GPUs
for Students
CHASE-CI Grant Provides
96 GPUs at UCSD
for Training AI Algorithms on Big Data
Plus 288 64-bit GPUs
On SDSC’s Comet
UCSD’s ITS Adapted PRP FIONA8s
To Support Data Science Courses
Instructional Data Science
Machine Learning Platform:
Instead of Spending
~$20,000/Quarter/Course on
Commercial Clouds:
97 Courses over 6 Quarters 
$4M vs. $240K over 12 Quarters
At least 20,000 Students
Adam Tilghman, ITS
Source: UCSD ITS
The Student GPUs
Have Supported a Broad Set of Courses Across Campus
Source: UCSD ITS
The ITS GPUs
Have Supported Thousands of Students
Source: UCSD ITS
Student GPU Demand Is Variable
Allowing for Other Student Uses
Available to Support:
Independent Study,
For-Credit Research,
External Barter
Source: UCSD ITS
2018-2019: PRP Game Changer!
Using Kubernetes to Orchestrate Containers Across the PRP
“Kubernetes is a way of stitching together
a collection of machines into,
basically, a big computer,”
--Craig Mcluckie, Google
and now CEO and Founder of Heptio
"Everything at Google runs in a container."
--Joe Beda,Google
1 FIONA8
1 FIONA8
100G NVMe 6.4TB
100G NVMe 6.4TB
Caltech
40G 160TB
UCAR
40G 192TB
UCSF
40G 160TB HPWREN
40G 160TB
4 FIONA8s
Calit2/UCI
35 FIONA2s
12 FIONA8s
2x40G 160TB HPWREN
UCSD
100G Epyc NVMe
100G Gold NVMe
8 FIONA8s + 5 FIONA8s
SDSC @ UCSD
40G 160TB
UCR
40G 160TB
USC
2x40G 160TB
UCLA
40G 160TB
Stanford U
2 FIONA8s
40G 192TB
UCSB
4.5 FIONA8s
100G NVMe 6.4TB
40G 160TB
UCSC
40G 160TB
U Hawaii
Nautilus Kubernetes Cluster
Connected by CENIC in California
10 FIONA2s
1 FIONA8
40G 160TB
UCM
100Gb/s HPR
17 Campus Nautilus Cluster:
3300 CPU Cores 82 Hosts
~4 PB Storage
>350 GPUs: >30M core/hrs/day
40G 160TB HPWREN
100G NVMe 6.4TB
1 FIONA8 2 FIONA4s
FPGAs + 2PB BeeGFS
SDSU
40G FIONA1
UIC
CHASE-CI PRP Disks
10G 3TB
CSUSB
40G 192TB
U Washington
Minority Serving Institution
Major CHASE-CI Usage by UCI
Over PRP to UCSD CPUs/GPUs
Cognitive Anteater
Robotics Laboratory
(CARL) supervised
by
Prof. Jeff Krichmar
UCICompVis Group
supervised by
Prof. Charless Fowlkes
#ofCores
Demo
Last Night
From
Data Think Tank Lab
2 Months
Very Cost-Effective for Academic Machine Learning and Data Sharing
• Data science researchers need DTNs with lots of storage, encryption and lots of GPUS
• One UC spends $40,000 in cloud GPU per published grad student paper
• Another spends $20,000 for undergrad ML AWS access in just one course
• Instead, add to our Nautilus hypercluster (or clone it & federate):
– UCSD ECE Department bought 4 FIONA8s, buying 4 more
– UCSD Physics Department. bought 3 FIONA8s, buying 3 more
– UCSD CSE researchers bought/are buying FIONA8s to add to Nautilus
– UCSD Instructional IT has 13 FIONA8s for Machine Learning/AI class labs
• Working Storage on Nautilus FIONAs is
– very inexpensive (12TB drives are ~$430 each—16 per FIONA. FISA encrypted drives @ same cost)
– and very high speed (most FIONAs are 40/100G and are located in ScienceDMZs)
Clemson’s Alex Feltus: “I cannot wait to add a node to the
Nautilus compute fabric!” 5/22/2019
Nautilus Usage
April 17, 2019 to July 17, 2019
Biggest Nautilus GPU Users
December – April, 2019
CSE ECE Struc. Eng
Extra slides
Original PRP
CENIC/PW Link
2018-2019: National-Scale Pilot -
Using CENIC & Internet2 to Connect Quilt Regional R&E Networks
Announced May 8, 2018
Internet2 Global Summit
“Towards
The NRP”
3-Year Grant
Funded
by NSF
$2.5M
OAC-1826967
PI Smarr
Co-PIs Altintas
Papadopoulos
Wuerthwein
Rosing
Mgr: DeFanti
NRP Pilot
NSF CENIC Link
CENIC/PW Link
40G 3TB
U Hawaii
40G 160TB
NCAR-WY
40G 192TB
UWashington
100G FIONA
I2 Chicago
100G FIONA
I2 Kansas City
10G FIONA1
40G FIONA
UIC
100G FIONA
I2 NYC
40G 3TB
StarLight
United States PRP Nautilus Hypercluster FIONAs
We Now Connect 3 More Regionals and 3 Internet2 sites
Global PRP Nautilus Hypercluster Is Rapidly Increasing
Partners Beyond Our Original Partner in Amsterdam—May 2019
PRP
PRPv2
Nautilus
Transoceanic
Nodes
Guam
Asian Pacific RP
Transoceanic
Nodes
Australia
Korea
Singapore
Netherlands
10G 35TB
UvA
40G FIONA6
40G 28TB
KISTI
10G (coming)
U of Guam
100G 35TB
U of Queensland
Transoceanic Nodes Show Distance is Not the Barrier
to Above 5Gb/s Disk-to-Disk Performance
PRP is Science-Driven:
Connecting Multi-Campus Application Teams and Devices
Earth
Sciences
UC San Diego UCBerkeley
Director: F. Martin Ralph
Big Data Collaboration with:
Source: Scott Sellers, PhD CHRS; Postdoc CW3E
Collaboration on Atmospheric Water in the West
Between UC San Diego and UC Irvine
Director, Soroosh Sorooshian, UCSD
Calit2’s FIONA
SDSC’s COMET
Calit2’s FIONA
Pacific Research Platform (10-100 Gb/s)
GPUsGPUs
Complete Workflow Time: 19.2 Days52 Minutes!
UC, Irvine UC, San Diego
PRP Shortened Scott Sellar’s Workflow From 19.2 Days to 52 Minutes -
532 Times Faster!
Source: Scott Sellers, US State Dept.
OSG IceCube Usage on PRP (Purple Segment) 3/9/19:
Using 126 GPUs + 142 CPUs + 49 GB RAM
GPU Simulations Needed to Improve Ice Model.
=> Results in Significant Improvement in Pointing Resolution
for Multi-Messenger Astrophysics
IceCube
PRP Actively Develops Diversity
• Grants
– 3 Female co-PIs
– 1 Hispanic co-PI
• Campuses
– 8 Minority-Serving Institutions in PRP/Nautilus
• Workshops
– NRP’18 Workshop Program Committee 80% Female
– Multiple MSI, EPSCoR Focused Workshops Jackson State University
PRP MSI Workshop
Presenting
FIONettes
Installing FIONAs Across California in Late 2018 and Early 2019
To Enhance User’s CPU and GPU Computing, Data Posting, and Data Transfers
UC Merced
Stanford UC Santa Barbara
UC Riverside
UC Santa Cruz
UC Irvine

Contenu connexe

Tendances

Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformLarry Smarr
 
PRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path ForwardPRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path ForwardLarry Smarr
 
Toward Greener Cyberinfrastructure
Toward Greener CyberinfrastructureToward Greener Cyberinfrastructure
Toward Greener CyberinfrastructureLarry Smarr
 
The Pacific Research Platform: Leading Up to the National Research Platform
The Pacific Research Platform:  Leading Up to the National Research PlatformThe Pacific Research Platform:  Leading Up to the National Research Platform
The Pacific Research Platform: Leading Up to the National Research PlatformLarry Smarr
 
An Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive ResearchAn Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive ResearchLarry Smarr
 
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesThe Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesLarry Smarr
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureToward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureLarry Smarr
 
High Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchHigh Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchLarry Smarr
 
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...Larry Smarr
 
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...Larry Smarr
 
High Performance Collaboration
High Performance CollaborationHigh Performance Collaboration
High Performance CollaborationLarry Smarr
 
Calit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative InnovationCalit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative InnovationLarry Smarr
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningThe Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningLarry Smarr
 
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
The Future of the Internet and its Impact on Digitally Enabled Genomic MedicineThe Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
The Future of the Internet and its Impact on Digitally Enabled Genomic MedicineLarry Smarr
 
The OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGridThe OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGridLarry Smarr
 
What I’ve Learned About “Green”
What I’ve Learned About “Green”What I’ve Learned About “Green”
What I’ve Learned About “Green”Larry Smarr
 
Peering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkPeering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkLarry Smarr
 
Why Researchers are Using Advanced Networks
Why Researchers are Using Advanced NetworksWhy Researchers are Using Advanced Networks
Why Researchers are Using Advanced NetworksLarry Smarr
 
The Optiputer - Toward a Terabit LAN
The Optiputer - Toward a Terabit LANThe Optiputer - Toward a Terabit LAN
The Optiputer - Toward a Terabit LANLarry Smarr
 
The Creation of Calit2
The Creation of Calit2The Creation of Calit2
The Creation of Calit2Larry Smarr
 

Tendances (20)

Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
 
PRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path ForwardPRP, NRP, GRP & the Path Forward
PRP, NRP, GRP & the Path Forward
 
Toward Greener Cyberinfrastructure
Toward Greener CyberinfrastructureToward Greener Cyberinfrastructure
Toward Greener Cyberinfrastructure
 
The Pacific Research Platform: Leading Up to the National Research Platform
The Pacific Research Platform:  Leading Up to the National Research PlatformThe Pacific Research Platform:  Leading Up to the National Research Platform
The Pacific Research Platform: Leading Up to the National Research Platform
 
An Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive ResearchAn Integrated Science Cyberinfrastructure for Data-Intensive Research
An Integrated Science Cyberinfrastructure for Data-Intensive Research
 
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesThe Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
 
Toward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureToward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing Cyberinfrastructure
 
High Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive ResearchHigh Performance Cyberinfrastructure for Data-Intensive Research
High Performance Cyberinfrastructure for Data-Intensive Research
 
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
Advanced Global-Scale Networking Supporting Data-Intensive Artificial Intelli...
 
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
The Jump to Light Speed - Data Intensive Earth Sciences are Leading the Way t...
 
High Performance Collaboration
High Performance CollaborationHigh Performance Collaboration
High Performance Collaboration
 
Calit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative InnovationCalit2 as a Model for Collaborative Innovation
Calit2 as a Model for Collaborative Innovation
 
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-LearningThe Pacific Research Platform Enables Distributed Big-Data Machine-Learning
The Pacific Research Platform Enables Distributed Big-Data Machine-Learning
 
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
The Future of the Internet and its Impact on Digitally Enabled Genomic MedicineThe Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
The Future of the Internet and its Impact on Digitally Enabled Genomic Medicine
 
The OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGridThe OptIPuter Project: From the Grid to the LambdaGrid
The OptIPuter Project: From the Grid to the LambdaGrid
 
What I’ve Learned About “Green”
What I’ve Learned About “Green”What I’ve Learned About “Green”
What I’ve Learned About “Green”
 
Peering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains NetworkPeering The Pacific Research Platform With The Great Plains Network
Peering The Pacific Research Platform With The Great Plains Network
 
Why Researchers are Using Advanced Networks
Why Researchers are Using Advanced NetworksWhy Researchers are Using Advanced Networks
Why Researchers are Using Advanced Networks
 
The Optiputer - Toward a Terabit LAN
The Optiputer - Toward a Terabit LANThe Optiputer - Toward a Terabit LAN
The Optiputer - Toward a Terabit LAN
 
The Creation of Calit2
The Creation of Calit2The Creation of Calit2
The Creation of Calit2
 

Similaire à The Pacific Research Platform: Building a Distributed Big-Data Machine-Learning Cyberinfrastructure

From the Pacific Research Platform to a National Research Platform
From the Pacific Research Platform to a National Research PlatformFrom the Pacific Research Platform to a National Research Platform
From the Pacific Research Platform to a National Research PlatformLarry Smarr
 
Toward a Global Research Platform for Big Data Analysis
Toward a Global Research Platform for Big Data AnalysisToward a Global Research Platform for Big Data Analysis
Toward a Global Research Platform for Big Data AnalysisLarry Smarr
 
The PRP and Its Applications
The PRP and Its ApplicationsThe PRP and Its Applications
The PRP and Its ApplicationsLarry Smarr
 
Creating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayCreating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayLarry Smarr
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...Larry Smarr
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...Larry Smarr
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research PlatformLarry Smarr
 
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...Larry Smarr
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningLarry Smarr
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningLarry Smarr
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research PlatformLarry Smarr
 
Towards a High-Performance National Research Platform Enabling Digital Research
Towards a High-Performance National Research Platform Enabling Digital ResearchTowards a High-Performance National Research Platform Enabling Digital Research
Towards a High-Performance National Research Platform Enabling Digital ResearchLarry Smarr
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
The Pacific Research Platform Two Years In
The Pacific Research Platform Two Years InThe Pacific Research Platform Two Years In
The Pacific Research Platform Two Years InLarry Smarr
 
UC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchUC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchLarry Smarr
 
Toward A National Big Data Superhighway
Toward A National Big Data SuperhighwayToward A National Big Data Superhighway
Toward A National Big Data SuperhighwayLarry Smarr
 
Using the Pacific Research Platform for Earth Sciences Big Data
Using the Pacific Research Platform for Earth Sciences Big DataUsing the Pacific Research Platform for Earth Sciences Big Data
Using the Pacific Research Platform for Earth Sciences Big DataLarry Smarr
 
The Synergy Between CHASE-CI and CineGrid
The Synergy Between CHASE-CI and CineGridThe Synergy Between CHASE-CI and CineGrid
The Synergy Between CHASE-CI and CineGridLarry Smarr
 
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway SystemLarry Smarr
 

Similaire à The Pacific Research Platform: Building a Distributed Big-Data Machine-Learning Cyberinfrastructure (20)

From the Pacific Research Platform to a National Research Platform
From the Pacific Research Platform to a National Research PlatformFrom the Pacific Research Platform to a National Research Platform
From the Pacific Research Platform to a National Research Platform
 
Toward a Global Research Platform for Big Data Analysis
Toward a Global Research Platform for Big Data AnalysisToward a Global Research Platform for Big Data Analysis
Toward a Global Research Platform for Big Data Analysis
 
The PRP and Its Applications
The PRP and Its ApplicationsThe PRP and Its Applications
The PRP and Its Applications
 
Creating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data SuperhighwayCreating a Science-Driven Big Data Superhighway
Creating a Science-Driven Big Data Superhighway
 
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
The Pacific Research Platform: A Regional-Scale Big Data Analytics Cyberinfra...
 
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
The Pacific Research Platform: Building a Distributed Big Data Machine Learni...
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research Platform
 
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...
Panel Presentation - Tom DeFanti with Larry Smarr and Frank Wuerthwein - Naut...
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine Learning
 
Distributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine LearningDistributed Cyberinfrastructure to Support Big Data Machine Learning
Distributed Cyberinfrastructure to Support Big Data Machine Learning
 
Toward a National Research Platform
Toward a National Research PlatformToward a National Research Platform
Toward a National Research Platform
 
Towards a High-Performance National Research Platform Enabling Digital Research
Towards a High-Performance National Research Platform Enabling Digital ResearchTowards a High-Performance National Research Platform Enabling Digital Research
Towards a High-Performance National Research Platform Enabling Digital Research
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
The Pacific Research Platform Two Years In
The Pacific Research Platform Two Years InThe Pacific Research Platform Two Years In
The Pacific Research Platform Two Years In
 
UC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive ResearchUC-Wide Cyberinfrastructure for Data-Intensive Research
UC-Wide Cyberinfrastructure for Data-Intensive Research
 
Toward A National Big Data Superhighway
Toward A National Big Data SuperhighwayToward A National Big Data Superhighway
Toward A National Big Data Superhighway
 
Using the Pacific Research Platform for Earth Sciences Big Data
Using the Pacific Research Platform for Earth Sciences Big DataUsing the Pacific Research Platform for Earth Sciences Big Data
Using the Pacific Research Platform for Earth Sciences Big Data
 
The Synergy Between CHASE-CI and CineGrid
The Synergy Between CHASE-CI and CineGridThe Synergy Between CHASE-CI and CineGrid
The Synergy Between CHASE-CI and CineGrid
 
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway SystemThe Pacific Research Platform: A Science-Driven Big-Data Freeway System
The Pacific Research Platform: A Science-Driven Big-Data Freeway System
 

Plus de Larry Smarr

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsLarry Smarr
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Larry Smarr
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsLarry Smarr
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...Larry Smarr
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Larry Smarr
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonLarry Smarr
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsLarry Smarr
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewLarry Smarr
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksLarry Smarr
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownLarry Smarr
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questionsLarry Smarr
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Larry Smarr
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharLarry Smarr
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Larry Smarr
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Larry Smarr
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardLarry Smarr
 

Plus de Larry Smarr (20)

My Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 YearsMy Remembrances of Mike Norman Over The Last 45 Years
My Remembrances of Mike Norman Over The Last 45 Years
 
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
Metagenics How Do I Quantify My Body and Try to Improve its Health? June 18 2019
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Global Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated SystemsGlobal Network Advancement Group - Next Generation Network-Integrated Systems
Global Network Advancement Group - Next Generation Network-Integrated Systems
 
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 Wireless FasterData and Distributed Open Compute Opportunities and (some) Us... Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
Wireless FasterData and Distributed Open Compute Opportunities and (some) Us...
 
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...Panel Discussion: Engaging underrepresented technologists, researchers, and e...
Panel Discussion: Engaging underrepresented technologists, researchers, and e...
 
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon MoonThe Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
The Asia Pacific and Korea Research Platforms: An Overview Jeonghoon Moon
 
Panel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving InstitutionsPanel: Reaching More Minority Serving Institutions
Panel: Reaching More Minority Serving Institutions
 
Panel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An OverviewPanel: The Global Research Platform: An Overview
Panel: The Global Research Platform: An Overview
 
Panel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical NetworksPanel: Future Wireless Extensions of Regional Optical Networks
Panel: Future Wireless Extensions of Regional Optical Networks
 
Global Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine BrownGlobal Research Platform Workshops - Maxine Brown
Global Research Platform Workshops - Maxine Brown
 
Built around answering questions
Built around answering questionsBuilt around answering questions
Built around answering questions
 
Panel: NRP Science Impacts​
Panel: NRP Science Impacts​Panel: NRP Science Impacts​
Panel: NRP Science Impacts​
 
Democratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish ParasharDemocratizing Science through Cyberinfrastructure - Manish Parashar
Democratizing Science through Cyberinfrastructure - Manish Parashar
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
 
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
Open Force Field: Scavenging pre-emptible CPU hours* in the age of COVID - Je...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
Panel: Open Infrastructure for an Open Society: OSG, Commercial Clouds, and B...
 
Frank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forwardFrank Würthwein - NRP and the Path forward
Frank Würthwein - NRP and the Path forward
 

Dernier

Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...KokoStevan
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104misteraugie
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 

Dernier (20)

Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
SECOND SEMESTER TOPIC COVERAGE SY 2023-2024 Trends, Networks, and Critical Th...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 
Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104Nutritional Needs Presentation - HLTH 104
Nutritional Needs Presentation - HLTH 104
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 

The Pacific Research Platform: Building a Distributed Big-Data Machine-Learning Cyberinfrastructure

  • 1. “The Pacific Research Platform: Building a Distributed Big-Data Machine-Learning Cyberinfrastructure” Briefing Jacobs School of Engineering University of California San Diego July 18, 2019 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD http://lsmarr.calit2.net
  • 2. UC San Diego’s Calit2 & SDSC Have Pioneered Big-Data Cyberinfrastructure for 17 Years with NSF Grants: OptIPuter, Quartzite, Prism, CHERuB, PRP, CHASE-CI, TNRP OptIPuter PI Smarr, Co-PI DeFanti Co-PI Papadopoulos, Ellisman 2002-2009 Quartzite PI Papadopoulos, Co-PI Smarr, Ford, Fainman
  • 3. 2013-2015: Creating a “Big Data” Backplane on Campus: NSF CC-NIE Funded Prism@UCSD and CHERuB Prism@UCSD, Phil Papadopoulos, SDSC, Calit2, PI; Smarr co-PI CHERuB, Mike Norman, SDSC PI CHERuB
  • 4. (GDC) 2015-2020: The Pacific Research Platform Connects Campus “Big Data Freeways” to Create a Regional End-to-End Science-Driven “Big Data Superhighway” System NSF CC*DNI Grant $6M 10/2015-10/2020 PI: Larry Smarr, UC San Diego Calit2 Co-PIs: • Camille Crittenden, UC Berkeley CITRIS, • Tom DeFanti, UC San Diego Calit2/QI, • Philip Papadopoulos, UCSD SDSC, • Frank Wuerthwein, UCSD Physics and SDSC Letters of Commitment from: • 50 Researchers from 15 Campuses • 32 IT/Network Organization Leaders Source: John Hess, CENIC UCOP CIO Tom Andriola Provided Funds and ITLC Support for Using Ten UC Campuses For Advanced Technology Testing
  • 5. 2017-2020: CHASE-CI Adds Machine-Learning to the Data-Science Community Cyberinfrastructure Caltech UCB UCI UCR UCSD UCSC Stanford MSU UCM SDSU NSF Grant for 256 High Speed “Cloud” GPUs For 32 ML Faculty & Their Students at 10 Campuses To Train AI Algorithms on Big Data
  • 6. PRP Engineers Designed and Built Several Generations of Optical-Fiber Big-Data Flash I/O Network Appliances (FIONAs) UCSD-Designed FIONAs Solved the Disk-to-Disk Data Transfer Problem at Near Full Speed on Best-Effort 10G, 40G and 100G Networks FIONAs Designed by UCSD’s Phil Papadopoulos, John Graham, Joe Keefe, and Tom DeFanti FIONette— 1G, $250 Used for Training 50 Engineers in 2018-2019 Two FIONA DTNs at UC Santa Cruz: 40G & 100G Up to 200 TeraByte Rotating Storage Add Up to 8 Nvidia GPUs Per FIONA To Add Machine Learning Capability Over 100 Now Deployed on PRP
  • 7. 48 GPUs for OSG Applications UCSD Has Added >350 Game GPUs to Data Sciences Cyberinfrastructure - Devoted to Data Analytics and Machine Learning SunCAVE 70 GPUs WAVE + Vroom 48 GPUs FIONA with 8-Game GPUs 104 GPUs for Students CHASE-CI Grant Provides 96 GPUs at UCSD for Training AI Algorithms on Big Data Plus 288 64-bit GPUs On SDSC’s Comet
  • 8. UCSD’s ITS Adapted PRP FIONA8s To Support Data Science Courses Instructional Data Science Machine Learning Platform: Instead of Spending ~$20,000/Quarter/Course on Commercial Clouds: 97 Courses over 6 Quarters  $4M vs. $240K over 12 Quarters At least 20,000 Students Adam Tilghman, ITS Source: UCSD ITS
  • 9. The Student GPUs Have Supported a Broad Set of Courses Across Campus Source: UCSD ITS
  • 10. The ITS GPUs Have Supported Thousands of Students Source: UCSD ITS
  • 11. Student GPU Demand Is Variable Allowing for Other Student Uses Available to Support: Independent Study, For-Credit Research, External Barter Source: UCSD ITS
  • 12. 2018-2019: PRP Game Changer! Using Kubernetes to Orchestrate Containers Across the PRP “Kubernetes is a way of stitching together a collection of machines into, basically, a big computer,” --Craig Mcluckie, Google and now CEO and Founder of Heptio "Everything at Google runs in a container." --Joe Beda,Google
  • 13. 1 FIONA8 1 FIONA8 100G NVMe 6.4TB 100G NVMe 6.4TB Caltech 40G 160TB UCAR 40G 192TB UCSF 40G 160TB HPWREN 40G 160TB 4 FIONA8s Calit2/UCI 35 FIONA2s 12 FIONA8s 2x40G 160TB HPWREN UCSD 100G Epyc NVMe 100G Gold NVMe 8 FIONA8s + 5 FIONA8s SDSC @ UCSD 40G 160TB UCR 40G 160TB USC 2x40G 160TB UCLA 40G 160TB Stanford U 2 FIONA8s 40G 192TB UCSB 4.5 FIONA8s 100G NVMe 6.4TB 40G 160TB UCSC 40G 160TB U Hawaii Nautilus Kubernetes Cluster Connected by CENIC in California 10 FIONA2s 1 FIONA8 40G 160TB UCM 100Gb/s HPR 17 Campus Nautilus Cluster: 3300 CPU Cores 82 Hosts ~4 PB Storage >350 GPUs: >30M core/hrs/day 40G 160TB HPWREN 100G NVMe 6.4TB 1 FIONA8 2 FIONA4s FPGAs + 2PB BeeGFS SDSU 40G FIONA1 UIC CHASE-CI PRP Disks 10G 3TB CSUSB 40G 192TB U Washington Minority Serving Institution
  • 14. Major CHASE-CI Usage by UCI Over PRP to UCSD CPUs/GPUs Cognitive Anteater Robotics Laboratory (CARL) supervised by Prof. Jeff Krichmar UCICompVis Group supervised by Prof. Charless Fowlkes #ofCores Demo Last Night From Data Think Tank Lab 2 Months
  • 15. Very Cost-Effective for Academic Machine Learning and Data Sharing • Data science researchers need DTNs with lots of storage, encryption and lots of GPUS • One UC spends $40,000 in cloud GPU per published grad student paper • Another spends $20,000 for undergrad ML AWS access in just one course • Instead, add to our Nautilus hypercluster (or clone it & federate): – UCSD ECE Department bought 4 FIONA8s, buying 4 more – UCSD Physics Department. bought 3 FIONA8s, buying 3 more – UCSD CSE researchers bought/are buying FIONA8s to add to Nautilus – UCSD Instructional IT has 13 FIONA8s for Machine Learning/AI class labs • Working Storage on Nautilus FIONAs is – very inexpensive (12TB drives are ~$430 each—16 per FIONA. FISA encrypted drives @ same cost) – and very high speed (most FIONAs are 40/100G and are located in ScienceDMZs) Clemson’s Alex Feltus: “I cannot wait to add a node to the Nautilus compute fabric!” 5/22/2019
  • 16. Nautilus Usage April 17, 2019 to July 17, 2019
  • 17. Biggest Nautilus GPU Users December – April, 2019 CSE ECE Struc. Eng
  • 19. Original PRP CENIC/PW Link 2018-2019: National-Scale Pilot - Using CENIC & Internet2 to Connect Quilt Regional R&E Networks Announced May 8, 2018 Internet2 Global Summit “Towards The NRP” 3-Year Grant Funded by NSF $2.5M OAC-1826967 PI Smarr Co-PIs Altintas Papadopoulos Wuerthwein Rosing Mgr: DeFanti NRP Pilot NSF CENIC Link
  • 20. CENIC/PW Link 40G 3TB U Hawaii 40G 160TB NCAR-WY 40G 192TB UWashington 100G FIONA I2 Chicago 100G FIONA I2 Kansas City 10G FIONA1 40G FIONA UIC 100G FIONA I2 NYC 40G 3TB StarLight United States PRP Nautilus Hypercluster FIONAs We Now Connect 3 More Regionals and 3 Internet2 sites
  • 21. Global PRP Nautilus Hypercluster Is Rapidly Increasing Partners Beyond Our Original Partner in Amsterdam—May 2019 PRP PRPv2 Nautilus Transoceanic Nodes Guam Asian Pacific RP Transoceanic Nodes Australia Korea Singapore Netherlands 10G 35TB UvA 40G FIONA6 40G 28TB KISTI 10G (coming) U of Guam 100G 35TB U of Queensland Transoceanic Nodes Show Distance is Not the Barrier to Above 5Gb/s Disk-to-Disk Performance
  • 22. PRP is Science-Driven: Connecting Multi-Campus Application Teams and Devices Earth Sciences UC San Diego UCBerkeley
  • 23. Director: F. Martin Ralph Big Data Collaboration with: Source: Scott Sellers, PhD CHRS; Postdoc CW3E Collaboration on Atmospheric Water in the West Between UC San Diego and UC Irvine Director, Soroosh Sorooshian, UCSD
  • 24. Calit2’s FIONA SDSC’s COMET Calit2’s FIONA Pacific Research Platform (10-100 Gb/s) GPUsGPUs Complete Workflow Time: 19.2 Days52 Minutes! UC, Irvine UC, San Diego PRP Shortened Scott Sellar’s Workflow From 19.2 Days to 52 Minutes - 532 Times Faster! Source: Scott Sellers, US State Dept.
  • 25. OSG IceCube Usage on PRP (Purple Segment) 3/9/19: Using 126 GPUs + 142 CPUs + 49 GB RAM GPU Simulations Needed to Improve Ice Model. => Results in Significant Improvement in Pointing Resolution for Multi-Messenger Astrophysics IceCube
  • 26. PRP Actively Develops Diversity • Grants – 3 Female co-PIs – 1 Hispanic co-PI • Campuses – 8 Minority-Serving Institutions in PRP/Nautilus • Workshops – NRP’18 Workshop Program Committee 80% Female – Multiple MSI, EPSCoR Focused Workshops Jackson State University PRP MSI Workshop Presenting FIONettes
  • 27. Installing FIONAs Across California in Late 2018 and Early 2019 To Enhance User’s CPU and GPU Computing, Data Posting, and Data Transfers UC Merced Stanford UC Santa Barbara UC Riverside UC Santa Cruz UC Irvine