SlideShare une entreprise Scribd logo
1  sur  12
Télécharger pour lire hors ligne
Arecibo
Observatory
Data Movement:
so much more than data
2021.05.12
Julio Alvarado Negron
Big Data Program Manager @ Arecibo Observatory
George B. Robb III,
EPOC - Performance Chaser
ESnet - Infrastructure Team
Globus World 2021
What is Big Data?
A collection of data that is huge in volume and yet
growing exponentially with time. In short such data is so
large and complex that limited traditional data
management tools are able to store it or process it
efficiently.
Examples
The New York Stock Exchange generates about
1TB of new trade data per day. Facebook generates over
500TB of data daily. A jet engine generates over 10TB of
data in 30 minutes of flight.
AO has the capability to generate over 80TB per
day, with a total of over 3PB of data stored.
Big Data @ AO
- Data Management and Governance practices
implementation
- Facilitate access to community to Arecibo’s data
- Enables access to High-Performing Computing
- Implement best practices and lessons learned from
partner observatories and research community
Big Data Overview
A Full Spectrum Pioneer of Sciences Since 1963
Is the study of radio waves
produced by a astronomical objects
such as Sun, planets, pulsars,
stars, etc. Arecibo radio telescope
sensitivity allows astronomers to
detect faint radio signals from
far-off regions of the universe.
Fast Radio Bursts, Pulsars,
Spectral line, Exoplanets, VLBI.
More Info Here
Radio Astronomy
Is the investigation of the earth's
gaseous envelope. The Arecibo
Radio Telescope can measure the
growth and decay of disturbances
in ionosphere (altitudes above 30
miles). The "big dish" is also used
to study plasma physics processes
in the electrically charged regions
where radio waves are influenced
most.
More Info Here
Atmospheric Sciences
The Arecibo Observatory was the
world's most powerful planetary
radar system. The 305 meter
Arecibo telescope equipped with a
1 MW transmitter at S-band (12.6
cm, 2380 MHz) was used for
studies of small bodies in the solar
system, terrestrial planets, and
planetary satellites including the
Moon.
Near Earth Asteroids
characterization, Surface Structure
(spacecrafts landing)
More Info Here
Planetary Radar
ALFA
The Arecibo L-band Feed Array (ALFA) is a seven feed system that allows large-scale surveys of the sky to be
conducted with unprecedented sensitivity using the 305-m Arecibo telescope in Puerto Rico. ALFA, operating near 1.4 GHz,
consists of a cluster of seven cooled dual-polarization feeds, a fiber-optical transmission system, and digital back-end signal
processors.
Most of this projects are considered “surveys” due to their nature. The radar is left static in a position while the Earth
rotates, allowing to “drift scan” the sky above Arecibo.
It could generate an aggregate of 875MB/s, 76TB per day.
Knowing the Sources and Discoveries
Using ALFA for ALFALFA
Knowing the Sources and Discoveries
Venus Characterization
Venus is covered in a thick layer of clouds, but Arecibo’s radar beams were able to cut through that haze and
bounce off of the rocky planet’s surface, allowing researchers to map the terrain.
In the figures, we can compare the first large scale view of Venus (1971) and the 2015 image with improved
equipments.
- Arecibo Discoveries
Knowing the Sources and Discoveries
Fast Radio Bursts
Fast radio bursts, or FRBs, are brief, brilliant blasts of radio waves with unknown origins. The first FRB known to
give off multiple bursts was FRB 121102, which Arecibo first spotted in 2012 and again in 2015.
Arecibo’s discovery backed up the theory from the Charles Parkes telescope in Australia that FRB’s are events
that come farther than the Milky Way.
Radio bursts are observed during 90 days followed by a silent period of 67 days. The same behaviour then repeats
every 157 days.
- Arecibo Discoveries
50+ Years of Contributions
50+ Years of Contributions
First Cable Snaps
On August 10th a first cable snaps
causing damage to the dish.
Second Cable Snaps
On November 6th a second cable
snaps causing major damage to the
dish.
December’s Check Mate
A main support cable broke from
Tower 4, causing the platform to fall
over the dish.
The team got together and
realized that the data safety and
integrity was a priority.
A Sequence of Snaps
The Big Picture
Arecibo Observatory holds over
3PB of data onsite. This amount is
spread between active hard drives,
offline disks and the tape library.
Arecibo also has copies of data
stored on various institutions across the
globe, to which we refer to as offsite
data.
Not enough fiber
Arecibo’s Internet connection is
limited to 1Gbps due to the condition of
the infrastructure to the site.
With the existing connection,
transferring 3PB would need over 24
months.
The Data in Numbers and Infra Limitations
The Call for Help
Right after the collapse, the team
at Arecibo understood the urgency of
adding redundancy and safekeep the
data. Immediately, we reached the
Office of Research at UCF. From there,
the logistics were driven funneled
through the Research community.
Getting the Teams Together
In a matter of days, Arecibo got
connected to working teams, BIG THANKS:
- EPOC/ESnet - transfer optimization
and hardware
- CICoE - data management practices
- TACC - high performance
computing and storage
- Univ of Puerto Rico HPCf - 10Gbps
connectivity (I2 - AMPATH)
- Engine-4 - 10Gbps connectivity
- Globus - data transfer optimization
The SOS Call THANK
YOU!
Data Migration
Once the working groups worked intensely
to establish the processes and the mechanisms, the
team at Arecibo proceeded to load the data to the
NAS boxes.
Those boxes are being taken to our partners,
University of Puerto Rico at Mayaguez (UPRM) and
Engine-4 (E4) in Bayamon. From there, the data is
uploaded to the TACC via 10Gbps links.
The UPRM has a 10Gbps via the AMPATH (I2)
and E4 has a 10Gbps via commercial route.
Benchmarks
Before utilizing Globus, the team relied in
rsync to move the data from Arecibo and the
partners. That resulted in an avg transfer speed of
47MBps via 10Gbps wire.
Once Globus Connect Personal was installed
and configure in the NAS, the Effective Speed
reported has been sustained at over 200MBps.
The Data Transfer
Arecibo Data Transfer
Project
Data uploaded to Computing Center
2
S
t
o
r
a
g
e
t
r
a
n
s
p
o
r
t
e
d
b
a
c
k
t
o
A
O
3
1
D
a
t
a
t
r
a
n
s
p
o
r
t
e
d
t
o
P
a
r
t
n
e
r

Contenu connexe

Tendances

Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025Larry Smarr
 
Solving Network Throughput Problems at the Diamond Light Source
Solving Network Throughput Problems at the Diamond Light SourceSolving Network Throughput Problems at the Diamond Light Source
Solving Network Throughput Problems at the Diamond Light SourceJisc
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformLarry Smarr
 
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldInternet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldLarry Smarr
 
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesLarry Smarr
 
AAPG GTW 2017: Deep Water and Shelf Reservoirs
AAPG GTW 2017: Deep Water and Shelf ReservoirsAAPG GTW 2017: Deep Water and Shelf Reservoirs
AAPG GTW 2017: Deep Water and Shelf ReservoirsDustin Dewett
 
Improving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainImproving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainClaudia Vitolo
 
Long Term Ecological Research Network
Long Term Ecological Research NetworkLong Term Ecological Research Network
Long Term Ecological Research NetworkTERN Australia
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat Mission
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat MissionApplying the Systems Engineering Process to a Conceptual Merucry CubeSat Mission
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat MissionKaren Grothe
 
Ceoa Nov 2005 Final Small
Ceoa Nov 2005 Final SmallCeoa Nov 2005 Final Small
Ceoa Nov 2005 Final SmallLarry Smarr
 
Pacific Research Platform Science Drivers
Pacific Research Platform Science DriversPacific Research Platform Science Drivers
Pacific Research Platform Science DriversLarry Smarr
 
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...Larry Smarr
 
AusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesAusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesTERN Australia
 
Provisioning Janet
Provisioning JanetProvisioning Janet
Provisioning JanetJisc
 
big_data_casestudies_2.ppt
big_data_casestudies_2.pptbig_data_casestudies_2.ppt
big_data_casestudies_2.pptvishal choudhary
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Laurent Lefort
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRLucaCinquini
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cubeLaurent Lefort
 

Tendances (20)

Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025Looking Back, Looking Forward NSF CI Funding 1985-2025
Looking Back, Looking Forward NSF CI Funding 1985-2025
 
Solving Network Throughput Problems at the Diamond Light Source
Solving Network Throughput Problems at the Diamond Light SourceSolving Network Throughput Problems at the Diamond Light Source
Solving Network Throughput Problems at the Diamond Light Source
 
Security Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research PlatformSecurity Challenges and the Pacific Research Platform
Security Challenges and the Pacific Research Platform
 
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained WorldInternet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
Internet & Climate Change: Cyberinfrastructure for a Carbon-Constrained World
 
Cyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean ObservatoriesCyberinfrastructure to Support Ocean Observatories
Cyberinfrastructure to Support Ocean Observatories
 
AAPG GTW 2017: Deep Water and Shelf Reservoirs
AAPG GTW 2017: Deep Water and Shelf ReservoirsAAPG GTW 2017: Deep Water and Shelf Reservoirs
AAPG GTW 2017: Deep Water and Shelf Reservoirs
 
Improving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domainImproving access to geospatial Big Data in the hydrology domain
Improving access to geospatial Big Data in the hydrology domain
 
Long Term Ecological Research Network
Long Term Ecological Research NetworkLong Term Ecological Research Network
Long Term Ecological Research Network
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat Mission
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat MissionApplying the Systems Engineering Process to a Conceptual Merucry CubeSat Mission
Applying the Systems Engineering Process to a Conceptual Merucry CubeSat Mission
 
Ceoa Nov 2005 Final Small
Ceoa Nov 2005 Final SmallCeoa Nov 2005 Final Small
Ceoa Nov 2005 Final Small
 
Pacific Research Platform Science Drivers
Pacific Research Platform Science DriversPacific Research Platform Science Drivers
Pacific Research Platform Science Drivers
 
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
LambdaGrids--Earth and Planetary Sciences Driving High Performance Networks a...
 
AusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data CubesAusCover Earth Observation Services and Data Cubes
AusCover Earth Observation Services and Data Cubes
 
Provisioning Janet
Provisioning JanetProvisioning Janet
Provisioning Janet
 
big_data_casestudies_2.ppt
big_data_casestudies_2.pptbig_data_casestudies_2.ppt
big_data_casestudies_2.ppt
 
EOSDIS Status
EOSDIS StatusEOSDIS Status
EOSDIS Status
 
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
Using the Data Cube vocabulary for Publishing Environmental Linked Data on la...
 
ApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTRApacheCon NA 2013 VFASTR
ApacheCon NA 2013 VFASTR
 
Linked Sensor Data cube
Linked Sensor Data cubeLinked Sensor Data cube
Linked Sensor Data cube
 

Similaire à GlobusWorld 2021: Saving Arecibo Observatory Data

Toward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureToward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureLarry Smarr
 
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesThe Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesLarry Smarr
 
LSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing SystemLSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing SystemMario Juric
 
GaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data ProductGaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data ProductMario Juric
 
PERICLES Preserving space data
PERICLES Preserving space dataPERICLES Preserving space data
PERICLES Preserving space dataPERICLES_FP7
 
Information Technology Infrastructure Committee (ITIC)
Information Technology Infrastructure Committee (ITIC)Information Technology Infrastructure Committee (ITIC)
Information Technology Infrastructure Committee (ITIC)Larry Smarr
 
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...Cybera Inc.
 
SOI Annual Report 2
SOI Annual Report 2SOI Annual Report 2
SOI Annual Report 2Kelly Young
 
Monitoring Oceans - Chris Atherton - SRD23
Monitoring Oceans - Chris Atherton - SRD23Monitoring Oceans - Chris Atherton - SRD23
Monitoring Oceans - Chris Atherton - SRD23SURFevents
 
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...ESCAPE EU
 
CENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataCENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataLarry Smarr
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research PlatformLarry Smarr
 
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...Joint ALMA Observatory
 
Round Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogsRound Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogsMario Juric
 
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...ESCAPE EU
 
TERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart PhinnTERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart PhinnTERN Australia
 
The Coming Revolution in Environmental Awareness
The Coming Revolution in Environmental AwarenessThe Coming Revolution in Environmental Awareness
The Coming Revolution in Environmental AwarenessLarry Smarr
 

Similaire à GlobusWorld 2021: Saving Arecibo Observatory Data (20)

Toward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing CyberinfrastructureToward a Global Interactive Earth Observing Cyberinfrastructure
Toward a Global Interactive Earth Observing Cyberinfrastructure
 
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean SciencesThe Emerging Cyberinfrastructure for Earth and Ocean Sciences
The Emerging Cyberinfrastructure for Earth and Ocean Sciences
 
LSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing SystemLSST/DM: Building a Next Generation Survey Data Processing System
LSST/DM: Building a Next Generation Survey Data Processing System
 
GaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data ProductGaiaCal2014: Creating and Calibrating LSST Data Product
GaiaCal2014: Creating and Calibrating LSST Data Product
 
PERICLES Preserving space data
PERICLES Preserving space dataPERICLES Preserving space data
PERICLES Preserving space data
 
6%2E2017-2021
6%2E2017-20216%2E2017-2021
6%2E2017-2021
 
Information Technology Infrastructure Committee (ITIC)
Information Technology Infrastructure Committee (ITIC)Information Technology Infrastructure Committee (ITIC)
Information Technology Infrastructure Committee (ITIC)
 
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...
Virtual Observatories as the Drivers of Space Science - Robert Rankin, Univer...
 
SOI Annual Report 2
SOI Annual Report 2SOI Annual Report 2
SOI Annual Report 2
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
Monitoring Oceans - Chris Atherton - SRD23
Monitoring Oceans - Chris Atherton - SRD23Monitoring Oceans - Chris Atherton - SRD23
Monitoring Oceans - Chris Atherton - SRD23
 
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...
ESCAPE Kick-off meeting - FAIR, Facility for Antiproton and Ion Research (Feb...
 
CENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big DataCENIC: Pacific Wave and PRP Update Big News for Big Data
CENIC: Pacific Wave and PRP Update Big News for Big Data
 
The Pacific Research Platform
The Pacific Research PlatformThe Pacific Research Platform
The Pacific Research Platform
 
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
Building a National Virtual Observatory: The Case of the Spanish Virtual Obse...
 
Round Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogsRound Table Introduction: Analytics on 100 TB+ catalogs
Round Table Introduction: Analytics on 100 TB+ catalogs
 
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
ESCAPE Kick-off meeting - KM3Net, Opening a new window on our universe (Feb 2...
 
The Next Decade of ISS and Beyond
The Next Decade of ISS and BeyondThe Next Decade of ISS and Beyond
The Next Decade of ISS and Beyond
 
TERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart PhinnTERN Facility Portals - Stuart Phinn
TERN Facility Portals - Stuart Phinn
 
The Coming Revolution in Environmental Awareness
The Coming Revolution in Environmental AwarenessThe Coming Revolution in Environmental Awareness
The Coming Revolution in Environmental Awareness
 

Plus de Globus

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration TopicsGlobus
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowGlobus
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaSGlobus
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesGlobus
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusGlobus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for ResearchersGlobus
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with GlobusGlobus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System AdministratorsGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersGlobus
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersGlobus
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Globus
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeGlobus
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System AdministratorsGlobus
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New UsersGlobus
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsGlobus
 
Globus Automation
Globus AutomationGlobus Automation
Globus AutomationGlobus
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System AdministrationGlobus
 

Plus de Globus (20)

Advanced Globus System Administration Topics
Advanced Globus System Administration TopicsAdvanced Globus System Administration Topics
Advanced Globus System Administration Topics
 
Instrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a FlowInstrument Data Automation: The Life of a Flow
Instrument Data Automation: The Life of a Flow
 
Building Research Applications with Globus PaaS
Building Research Applications with Globus PaaSBuilding Research Applications with Globus PaaS
Building Research Applications with Globus PaaS
 
Reliable, Remote Computation at All Scales
Reliable, Remote Computation at All ScalesReliable, Remote Computation at All Scales
Reliable, Remote Computation at All Scales
 
Best Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using GlobusBest Practices for Data Sharing Using Globus
Best Practices for Data Sharing Using Globus
 
An Introduction to Globus for Researchers
An Introduction to Globus for ResearchersAn Introduction to Globus for Researchers
An Introduction to Globus for Researchers
 
Introduction to Research Automation with Globus
Introduction to Research Automation with GlobusIntroduction to Research Automation with Globus
Introduction to Research Automation with Globus
 
Globus for System Administrators
Globus for System AdministratorsGlobus for System Administrators
Globus for System Administrators
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for ResearchersIntroduction to Data Transfer and Sharing for Researchers
Introduction to Data Transfer and Sharing for Researchers
 
Introduction to the Globus Platform for Developers
Introduction to the Globus Platform for DevelopersIntroduction to the Globus Platform for Developers
Introduction to the Globus Platform for Developers
 
Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)Introduction to the Command Line Interface (CLI)
Introduction to the Command Line Interface (CLI)
 
Automating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and ComputeAutomating Research Data with Globus Flows and Compute
Automating Research Data with Globus Flows and Compute
 
Automating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus PlatformAutomating Research Data Flows and Introduction to the Globus Platform
Automating Research Data Flows and Introduction to the Globus Platform
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 
Introduction to Globus for System Administrators
Introduction to Globus for System AdministratorsIntroduction to Globus for System Administrators
Introduction to Globus for System Administrators
 
Introduction to Globus for New Users
Introduction to Globus for New UsersIntroduction to Globus for New Users
Introduction to Globus for New Users
 
Working with Globus Platform Services and Portals
Working with Globus Platform Services and PortalsWorking with Globus Platform Services and Portals
Working with Globus Platform Services and Portals
 
Globus Automation
Globus AutomationGlobus Automation
Globus Automation
 
Advanced Globus System Administration
Advanced Globus System AdministrationAdvanced Globus System Administration
Advanced Globus System Administration
 

Dernier

Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension AidPhilip Schwarz
 
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park masabamasaba
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplatePresentation.STUDIO
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareJim McKeeth
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...masabamasaba
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Bert Jan Schrijver
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisamasabamasaba
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...masabamasaba
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park masabamasaba
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024VictoriaMetrics
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benonimasabamasaba
 

Dernier (20)

Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
WSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security ProgramWSO2CON 2024 - How to Run a Security Program
WSO2CON 2024 - How to Run a Security Program
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park %in kempton park+277-882-255-28 abortion pills for sale in kempton park
%in kempton park+277-882-255-28 abortion pills for sale in kempton park
 
AI & Machine Learning Presentation Template
AI & Machine Learning Presentation TemplateAI & Machine Learning Presentation Template
AI & Machine Learning Presentation Template
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
Announcing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK SoftwareAnnouncing Codolex 2.0 from GDK Software
Announcing Codolex 2.0 from GDK Software
 
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
%in Bahrain+277-882-255-28 abortion pills for sale in Bahrain
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
Devoxx UK 2024 - Going serverless with Quarkus, GraalVM native images and AWS...
 
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
Abortion Pills In Pretoria ](+27832195400*)[ 🏥 Women's Abortion Clinic In Pre...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
%+27788225528 love spells in Boston Psychic Readings, Attraction spells,Bring...
 
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park %in ivory park+277-882-255-28 abortion pills for sale in ivory park
%in ivory park+277-882-255-28 abortion pills for sale in ivory park
 
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
WSO2CON 2024 - Building the API First Enterprise – Running an API Program, fr...
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni%in Benoni+277-882-255-28 abortion pills for sale in Benoni
%in Benoni+277-882-255-28 abortion pills for sale in Benoni
 

GlobusWorld 2021: Saving Arecibo Observatory Data

  • 1. Arecibo Observatory Data Movement: so much more than data 2021.05.12 Julio Alvarado Negron Big Data Program Manager @ Arecibo Observatory George B. Robb III, EPOC - Performance Chaser ESnet - Infrastructure Team Globus World 2021
  • 2. What is Big Data? A collection of data that is huge in volume and yet growing exponentially with time. In short such data is so large and complex that limited traditional data management tools are able to store it or process it efficiently. Examples The New York Stock Exchange generates about 1TB of new trade data per day. Facebook generates over 500TB of data daily. A jet engine generates over 10TB of data in 30 minutes of flight. AO has the capability to generate over 80TB per day, with a total of over 3PB of data stored. Big Data @ AO - Data Management and Governance practices implementation - Facilitate access to community to Arecibo’s data - Enables access to High-Performing Computing - Implement best practices and lessons learned from partner observatories and research community Big Data Overview
  • 3. A Full Spectrum Pioneer of Sciences Since 1963 Is the study of radio waves produced by a astronomical objects such as Sun, planets, pulsars, stars, etc. Arecibo radio telescope sensitivity allows astronomers to detect faint radio signals from far-off regions of the universe. Fast Radio Bursts, Pulsars, Spectral line, Exoplanets, VLBI. More Info Here Radio Astronomy Is the investigation of the earth's gaseous envelope. The Arecibo Radio Telescope can measure the growth and decay of disturbances in ionosphere (altitudes above 30 miles). The "big dish" is also used to study plasma physics processes in the electrically charged regions where radio waves are influenced most. More Info Here Atmospheric Sciences The Arecibo Observatory was the world's most powerful planetary radar system. The 305 meter Arecibo telescope equipped with a 1 MW transmitter at S-band (12.6 cm, 2380 MHz) was used for studies of small bodies in the solar system, terrestrial planets, and planetary satellites including the Moon. Near Earth Asteroids characterization, Surface Structure (spacecrafts landing) More Info Here Planetary Radar
  • 4. ALFA The Arecibo L-band Feed Array (ALFA) is a seven feed system that allows large-scale surveys of the sky to be conducted with unprecedented sensitivity using the 305-m Arecibo telescope in Puerto Rico. ALFA, operating near 1.4 GHz, consists of a cluster of seven cooled dual-polarization feeds, a fiber-optical transmission system, and digital back-end signal processors. Most of this projects are considered “surveys” due to their nature. The radar is left static in a position while the Earth rotates, allowing to “drift scan” the sky above Arecibo. It could generate an aggregate of 875MB/s, 76TB per day. Knowing the Sources and Discoveries Using ALFA for ALFALFA
  • 5. Knowing the Sources and Discoveries Venus Characterization Venus is covered in a thick layer of clouds, but Arecibo’s radar beams were able to cut through that haze and bounce off of the rocky planet’s surface, allowing researchers to map the terrain. In the figures, we can compare the first large scale view of Venus (1971) and the 2015 image with improved equipments. - Arecibo Discoveries
  • 6. Knowing the Sources and Discoveries Fast Radio Bursts Fast radio bursts, or FRBs, are brief, brilliant blasts of radio waves with unknown origins. The first FRB known to give off multiple bursts was FRB 121102, which Arecibo first spotted in 2012 and again in 2015. Arecibo’s discovery backed up the theory from the Charles Parkes telescope in Australia that FRB’s are events that come farther than the Milky Way. Radio bursts are observed during 90 days followed by a silent period of 67 days. The same behaviour then repeats every 157 days. - Arecibo Discoveries
  • 7. 50+ Years of Contributions
  • 8. 50+ Years of Contributions
  • 9. First Cable Snaps On August 10th a first cable snaps causing damage to the dish. Second Cable Snaps On November 6th a second cable snaps causing major damage to the dish. December’s Check Mate A main support cable broke from Tower 4, causing the platform to fall over the dish. The team got together and realized that the data safety and integrity was a priority. A Sequence of Snaps
  • 10. The Big Picture Arecibo Observatory holds over 3PB of data onsite. This amount is spread between active hard drives, offline disks and the tape library. Arecibo also has copies of data stored on various institutions across the globe, to which we refer to as offsite data. Not enough fiber Arecibo’s Internet connection is limited to 1Gbps due to the condition of the infrastructure to the site. With the existing connection, transferring 3PB would need over 24 months. The Data in Numbers and Infra Limitations
  • 11. The Call for Help Right after the collapse, the team at Arecibo understood the urgency of adding redundancy and safekeep the data. Immediately, we reached the Office of Research at UCF. From there, the logistics were driven funneled through the Research community. Getting the Teams Together In a matter of days, Arecibo got connected to working teams, BIG THANKS: - EPOC/ESnet - transfer optimization and hardware - CICoE - data management practices - TACC - high performance computing and storage - Univ of Puerto Rico HPCf - 10Gbps connectivity (I2 - AMPATH) - Engine-4 - 10Gbps connectivity - Globus - data transfer optimization The SOS Call THANK YOU!
  • 12. Data Migration Once the working groups worked intensely to establish the processes and the mechanisms, the team at Arecibo proceeded to load the data to the NAS boxes. Those boxes are being taken to our partners, University of Puerto Rico at Mayaguez (UPRM) and Engine-4 (E4) in Bayamon. From there, the data is uploaded to the TACC via 10Gbps links. The UPRM has a 10Gbps via the AMPATH (I2) and E4 has a 10Gbps via commercial route. Benchmarks Before utilizing Globus, the team relied in rsync to move the data from Arecibo and the partners. That resulted in an avg transfer speed of 47MBps via 10Gbps wire. Once Globus Connect Personal was installed and configure in the NAS, the Effective Speed reported has been sustained at over 200MBps. The Data Transfer Arecibo Data Transfer Project Data uploaded to Computing Center 2 S t o r a g e t r a n s p o r t e d b a c k t o A O 3 1 D a t a t r a n s p o r t e d t o P a r t n e r