SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
November 13, 2014 I Las Vegas 
Matt Carroll, CTO, Defense & Intelligence 
CSC
The problem 
Over 400+ apps within its 
enterprise 
Over 1000+ active data 
sources consuming data on 
the order of TBs daily 
Network supports over 
230,000 daily users with 
mission and business needs 
Apps 
Data 
Users 
Network 
Security 
Multiple networks deployed 
worldwide on multiple continents 
Every capability runs through a 
lengthy certification and 
accreditation process (4–6 mo) 
Disparate activities across apps 
and data have left little 
quantitative data 
We faced a highly complex environment for a US Government customer that 
had a large dependency on legacy systems with a need to modernize quickly 
Metrics
Customer challenges 
Budget 
•Not enough money to transition every app to take advantage of Big Data or a distributed system 
•Outsourcing IaaSneeds to be monitored for accounting, security, scale, etcwithout complex software 
•Application elasticity is critical to understanding the true costs of operations and maintenance 
•Storage (data) is a much bigger cost than expected 
•Need to consolidate systems engineering support 
While we faced many challenges it became clear early on that budgetand ease of integration for appsmust be our two driving forces 
App migration is not simple 
•Most apps are CRUD based; write a report, find a report 
•Security business logic is baked into each app 
•Number one question: Why can’t I choose the technology that best fits my app? 
•Cannot disrupt operations by any means! 
•Applications must reside on multiple networks and work together 
•Takes too long to get started, laying down databases, web tiers, etc 
Security is the ultimate killer of time 
The process around security became complicated, burdensome and still insufficient to counter threats at scale
Our mission 
Our Missionis to facilitate Big Data analytics across the enterprise by providing the tools necessary to align the work of the application engineer, analytic developer, and data scientist —freeing them tofocus on end products, not infrastructure;we provide this through EzBake 
Big Data should be easy 
Big Data should drive insight 
Big Data should be ubiquitous 
Big Data should be secure
EzBake 
It’s all about making application transition easier!!! Rather than assembling your own big data stack, EzBakeprovides an integrated way to compose the different elements of your application: collecting, processing, storing, and querying data 
Ease of application development 
•Time to market of apps and reuse 
•Autodeploymentand high-availability scaling 
•Integrated analytics and audit trails for logs, metrics, data access, and security events 
Built-in security layer 
•Role-based access and complex policies 
•Down to the object / cell-level controls 
•Encryption in transit 
Data layer 
•Ubiquitous data access (no stovepipes!) 
•Simplified streaming / batch analytics 
•Tailorableand technology agnostic 
•Abstracted index patterns 
Data layer 
Custom applications 
Physical databases 
MongoDBAccumulo 
PostgreSQL(RDS) RedishBase 
ElasticsearchTitan +Custom 
Execution layer 
Stream Batch Query 
Events +More 
Security layer
Key features 
Scaled and commonly used thrift services, typically used during streaming ingest 
Interface for building data flow topologies which abstract physical stream processors 
Both direct access to indices and aggregate query across the various data sets 
Indexing patterns exposed as thrift services and abstracts the physical database 
Amazon Elastic MapReduce (EMR) abstractions that enable complex, multidimensional discovery 
Both at the data persistence and user access layers 
Automated elasticity through a GUI-based deployment 
Streaming ingest (Frack) 
Common services 
Data persistence 
Distributed query 
Security 
Batch analytics 
Deployment
Technology agnostic 
•Instead of a jack-of-all-trades indexing for free text search, geospatial search, etc, use mission-specific indices for specific application logic needs 
•Focus on storage patterns vice database specific operations, thereby enforcing data access standards across the enterprise 
•Allow for new cartridges for web frameworks including Node.js, Python, Ruby, etc. 
Each app has its own needs, and it is not on the platform builder to force the team into a particular technology, rather offer a solution to meet the use case
Easy to deploy and secure 
The platform provisions and scales, like classic PaaS, and embeds data layer connections and security on Amazon EC2 
•Developers pull-down sandbox from the collaboration environment to develop on their local box 
•App / service is output as a WAR and YML file (buildpack) 
•The app registration page allows engineers to deploy and register apps, data feeds, and services on the platform 
•EzDeployersupports dynamic resource management to all capabilities hosted and provisions through Amazon Elastic Compute Cloud (EC2)
App registration 
•Applications carry role-based access controls with human inserted deployment authorization 
•Registration to include data feeds, services, batch jobs, and intents. 
•Ability to assign other users as admin controllers through AWS Identity and Access Management (IAM) controls or other IdAM 
•Cuts down time to deploy and removes the need for app developers to write Puppet scripts 
•Build in account management policies for financial tracking of PaaSand IaaScosts 
Deploy with buildpackssecurely through the application registration page and provide elasticity as a service by abstracting Amazon EC2 services
Lab76: Collaborative development 
•Speed start of development from weeks to hours by enabling a truly agile development environment 
•GitLabwas exposed for source control and promoting the sharing of code across the enterprise through governance and oversight 
•Customized RedMinewas exposed for task management and to allow task oversight and alignment 
•DevOpscould clone an Amazon Virtual Private Cloud and stand up new environments in a day vs. months of setting up for each app or system 
The key to speeding transition was to remove redundancy; by providing a one-stop shop for devtools (Git, RedMine, Jenkins), a means to share code and common development environments, we gained months back from each development team
Leveraging a data layer on SQL and NoSQL the platform abstracts physical data stores and promotes storage patterns to enable ease of sharing, force object-level security, and provide the ability to plug and play databases 
Breaking-down disparate data stores 
•That’s not to say we implement Big Data SQL 
•Instead, we have the model that binds app development, BigData, and security 
•Focus developers towards database abstractions extensible toany database 
So what? 
•Move to production with Big Data without impacting existing SQL based production architecture (think PostgreSQL to RDS) 
•Brings data together across the enterprise helping customers with disparate engineering teams build to a standard
Distributed query 
We distribute object-specific queries across disparate data sets exposed through the data layer while controlling access through the service and at the data level 
•Migrate off-legacy data stores without disrupting production instances 
•Focus on object-based queries across many data sets as well as across Amazon VPC within an environment 
•Work with Clouderato modify Impala to run against multiple data stores 
•Common access controls across multiple data sets 
So what? 
•Common method to discover data across many apps, great for BI tools and third-party apps like Palantir, Tableau, etc. 
•Decreases the duplication of storage across the enterprise through common indexing patterns
Security becomes an API 
•All data is encrypted in transit 
•All transactions are authorized by the security service 
•All data is secured at the object level 
•Robust security service —scales horizontally and generated authorization tokens base on external IdAMproperties 
•Internal group management service scales to trillions of groups and beyond 
•Compressed bitvectorrepresentation of data visibility and access authorizations speeds security computations 
Following several zero-day attacks the enterprise is waking up to security but has no understanding of how to secure their Big Data platforms —a major reason many are not in production 
Bob 
Bob has authorizations: 
X, Y, and Z 
Data 
Data is tagged as: X, Y, and R. Sorry Bob! Only X and Y for you! 
Query 
Object-level security across all data stores through a common API will provide dramatic efficiencies as it decreases time to model data across multiple data stores
Metering and monitoring 
•JavascriptAPI for web apps, Thrift API for services, and REST for others 
•Improve application usability and usefulness by examining analytics on usage patterns 
•Diagnose issues with system, services, and apps 
•Determine cost allocation based on what agencies and organizations are using the system 
To bring back focus on understanding the environment we needed the platform to provide a comprehensive visualization to monitor users, data and services on AWS
Batch (Amino) 
•Removes complexity of Amazon EMR for the average engineer 
•Crowd source microanalyticsthrough analysts and engineers 
•Data agnostic 
•Not a black box 
•Fully scalable 
•Inherent cross-data source linked indexes 
•Encourages sharing of knowledge, discovery 
•Index built to support machine learning 
•Security considered up front —index is in Accumulo 
•Utilized AWS to enable rapid load-balancing to support demand based on data and usage 
Developers can write Amazon Elastic MapReduce(EMR) code to analyze data, but don’t know what to look for; the analysts know what to look for, but don’t know how to write code. Technology is not the problem.It’s enabling the analyst to effectively leverage technology and reuse it.
The impact 
So What? What were the overall accomplishments to date? Well… 
Time: The platform and the development model decreased the development time from 6–8 months to production to 3–4 weeks. 
Lean and Mean: Application teams went from being heavy on DevOps, security, testing to smaller, more agile teams focused on specific-mission use cases 
Most importantly… 
We revectoredteams back to their users, providing more capabilities in less time, thereby saving lives and protecting our country 
Data Shared: Legacy REST/SOAP interfaces have begun to die and time spent on sharing data is down significantly without impacting operations and more apps have more access to data 
Money: Removal of redundant code and system, faster app deployment, cuts in total storage costs, and decrease in team sizes led to a significant cost savings up front for the customer
http://bit.ly/awsevals

Contenu connexe

Tendances

GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
Jeremy Maranitch
 

Tendances (20)

SnapLogic Cloud Integration
SnapLogic Cloud IntegrationSnapLogic Cloud Integration
SnapLogic Cloud Integration
 
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
Simplifying Your Cloud Architecture with a Logical Data Fabric (APAC)
 
Domain Driven Data: Apache Kafka® and the Data Mesh
Domain Driven Data: Apache Kafka® and the Data MeshDomain Driven Data: Apache Kafka® and the Data Mesh
Domain Driven Data: Apache Kafka® and the Data Mesh
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)
 
Data Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation AnalyticsData Virtualization - Enabling Next Generation Analytics
Data Virtualization - Enabling Next Generation Analytics
 
Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
 Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
Big Data Fabric for At-Scale Real-Time Analysis by Edwin Robbins
 
Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)Data Virtualization: From Zero to Hero (Middle East)
Data Virtualization: From Zero to Hero (Middle East)
 
An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018An Introduction to Data Virtualization in 2018
An Introduction to Data Virtualization in 2018
 
Crimson 3 - Final case presentation
Crimson 3 - Final case presentationCrimson 3 - Final case presentation
Crimson 3 - Final case presentation
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
 
Why Data Virtualization? An Introduction.
Why Data Virtualization? An Introduction.Why Data Virtualization? An Introduction.
Why Data Virtualization? An Introduction.
 
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017GigaOm-sector-roadmap-cloud-analytic-databases-2017
GigaOm-sector-roadmap-cloud-analytic-databases-2017
 
Datamesh community meetup 28th jan 2021
Datamesh community meetup 28th jan 2021Datamesh community meetup 28th jan 2021
Datamesh community meetup 28th jan 2021
 
Lessons learned from over 25 Data Virtualization implementations
Lessons learned from over 25 Data Virtualization implementationsLessons learned from over 25 Data Virtualization implementations
Lessons learned from over 25 Data Virtualization implementations
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & BénéficesVirtualisation de données : Enjeux, Usages & Bénéfices
Virtualisation de données : Enjeux, Usages & Bénéfices
 
Data Virtualization: The Agile Delivery Platform
Data Virtualization: The Agile Delivery PlatformData Virtualization: The Agile Delivery Platform
Data Virtualization: The Agile Delivery Platform
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
Cheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduceCheetah:Data Warehouse on Top of MapReduce
Cheetah:Data Warehouse on Top of MapReduce
 
Cloud Storage Spring Cleaning: A Treasure Hunt
Cloud Storage Spring Cleaning: A Treasure HuntCloud Storage Spring Cleaning: A Treasure Hunt
Cloud Storage Spring Cleaning: A Treasure Hunt
 

En vedette

En vedette (14)

(ENT214) Flying Through Airport Security Using a Multiregion, Managed Solutio...
(ENT214) Flying Through Airport Security Using a Multiregion, Managed Solutio...(ENT214) Flying Through Airport Security Using a Multiregion, Managed Solutio...
(ENT214) Flying Through Airport Security Using a Multiregion, Managed Solutio...
 
HPC in AWS - Technical Workshop
HPC in AWS - Technical WorkshopHPC in AWS - Technical Workshop
HPC in AWS - Technical Workshop
 
AWS Webcast - Build Agile Applications in AWS Cloud for Government
AWS Webcast - Build Agile Applications in AWS Cloud for GovernmentAWS Webcast - Build Agile Applications in AWS Cloud for Government
AWS Webcast - Build Agile Applications in AWS Cloud for Government
 
AWS re:Invent 2016: Modernizing Government in the Cloud in Highly Regulated E...
AWS re:Invent 2016: Modernizing Government in the Cloud in Highly Regulated E...AWS re:Invent 2016: Modernizing Government in the Cloud in Highly Regulated E...
AWS re:Invent 2016: Modernizing Government in the Cloud in Highly Regulated E...
 
AWS Webcast - Webinar Series for State and Local Government #1: Discover Clou...
AWS Webcast - Webinar Series for State and Local Government #1: Discover Clou...AWS Webcast - Webinar Series for State and Local Government #1: Discover Clou...
AWS Webcast - Webinar Series for State and Local Government #1: Discover Clou...
 
(SEC204) AWS GovCloud (US): Not Just for Govies
(SEC204) AWS GovCloud (US): Not Just for Govies(SEC204) AWS GovCloud (US): Not Just for Govies
(SEC204) AWS GovCloud (US): Not Just for Govies
 
AWS Summit Stockholm 2014 – T4 – Continuous integration on AWS
AWS Summit Stockholm 2014 – T4 – Continuous integration on AWSAWS Summit Stockholm 2014 – T4 – Continuous integration on AWS
AWS Summit Stockholm 2014 – T4 – Continuous integration on AWS
 
Cloud eHealth in Medical Imaging & Radiology
Cloud eHealth in Medical Imaging & RadiologyCloud eHealth in Medical Imaging & Radiology
Cloud eHealth in Medical Imaging & Radiology
 
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
AWS re:Invent 2016: Using AWS to Meet Requirements for Education, Healthcare ...
 
HPC in the Cloud
HPC in the CloudHPC in the Cloud
HPC in the Cloud
 
Intro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS CloudIntro to High Performance Computing in the AWS Cloud
Intro to High Performance Computing in the AWS Cloud
 
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
AWS re:Invent 2016: High Performance Computing on AWS (CMP207)
 
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
Easy Analytics on AWS with Amazon Redshift, Amazon QuickSight, and Amazon Mac...
 
(ENT206) Migrating Thousands of Workloads to AWS at Enterprise Scale | AWS re...
(ENT206) Migrating Thousands of Workloads to AWS at Enterprise Scale | AWS re...(ENT206) Migrating Thousands of Workloads to AWS at Enterprise Scale | AWS re...
(ENT206) Migrating Thousands of Workloads to AWS at Enterprise Scale | AWS re...
 

Similaire à (ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014

Connect Ops and Security with Flexible Web App and API Protection
Connect Ops and Security with Flexible Web App and API ProtectionConnect Ops and Security with Flexible Web App and API Protection
Connect Ops and Security with Flexible Web App and API Protection
DevOps.com
 
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
Amazon Web Services
 

Similaire à (ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014 (20)

Cloudera federal summit
Cloudera federal summitCloudera federal summit
Cloudera federal summit
 
Cloudera Federal Forum 2014: EzBake, the DoDIIS App Engine
Cloudera Federal Forum 2014: EzBake, the DoDIIS App EngineCloudera Federal Forum 2014: EzBake, the DoDIIS App Engine
Cloudera Federal Forum 2014: EzBake, the DoDIIS App Engine
 
Introduction to Microsoft Azure
Introduction to Microsoft AzureIntroduction to Microsoft Azure
Introduction to Microsoft Azure
 
Cloud Computing Overview
Cloud Computing OverviewCloud Computing Overview
Cloud Computing Overview
 
CSC AWS re:Invent Enterprise DevOps session
CSC AWS re:Invent Enterprise DevOps sessionCSC AWS re:Invent Enterprise DevOps session
CSC AWS re:Invent Enterprise DevOps session
 
Migration Recipes for Success - AWS Summit Cape Town 2017
Migration Recipes for Success - AWS Summit Cape Town 2017 Migration Recipes for Success - AWS Summit Cape Town 2017
Migration Recipes for Success - AWS Summit Cape Town 2017
 
IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data IBM Relay 2015: Open for Data
IBM Relay 2015: Open for Data
 
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
NUS-ISS Learning Day 2018- Designing software to make the most of cloud platf...
 
Critical Considerations for Moving Your Core Business Applications to the Clo...
Critical Considerations for Moving Your Core Business Applications to the Clo...Critical Considerations for Moving Your Core Business Applications to the Clo...
Critical Considerations for Moving Your Core Business Applications to the Clo...
 
Connect Ops and Security with Flexible Web App and API Protection
Connect Ops and Security with Flexible Web App and API ProtectionConnect Ops and Security with Flexible Web App and API Protection
Connect Ops and Security with Flexible Web App and API Protection
 
2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics2022 Trends in Enterprise Analytics
2022 Trends in Enterprise Analytics
 
Estimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics PlatformEstimating the Total Costs of Your Cloud Analytics Platform
Estimating the Total Costs of Your Cloud Analytics Platform
 
Data Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWSData Driven Advanced Analytics using Denodo Platform on AWS
Data Driven Advanced Analytics using Denodo Platform on AWS
 
Boot camp - Migration to AWS
Boot camp - Migration to AWSBoot camp - Migration to AWS
Boot camp - Migration to AWS
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
AWS Webcast - Migrating your Data Center to the Cloud
AWS Webcast - Migrating your Data Center to the CloudAWS Webcast - Migrating your Data Center to the Cloud
AWS Webcast - Migrating your Data Center to the Cloud
 
Unit II Cloud Delivery Models.pptx
Unit II Cloud Delivery Models.pptxUnit II Cloud Delivery Models.pptx
Unit II Cloud Delivery Models.pptx
 
AWS Partner: Grindr: Aggregate, Analyze, and Act on 900M Daily API Calls
AWS Partner: Grindr: Aggregate, Analyze, and Act on 900M Daily API CallsAWS Partner: Grindr: Aggregate, Analyze, and Act on 900M Daily API Calls
AWS Partner: Grindr: Aggregate, Analyze, and Act on 900M Daily API Calls
 
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
AWS re:Invent 2016: Effective Application Data Analytics for Modern Applicati...
 
Microservices for Application Modernisation
Microservices for Application ModernisationMicroservices for Application Modernisation
Microservices for Application Modernisation
 

Plus de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 

Dernier (20)

GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 

(ENT211) Migrating the US Government to the Cloud | AWS re:Invent 2014

  • 1. November 13, 2014 I Las Vegas Matt Carroll, CTO, Defense & Intelligence CSC
  • 2. The problem Over 400+ apps within its enterprise Over 1000+ active data sources consuming data on the order of TBs daily Network supports over 230,000 daily users with mission and business needs Apps Data Users Network Security Multiple networks deployed worldwide on multiple continents Every capability runs through a lengthy certification and accreditation process (4–6 mo) Disparate activities across apps and data have left little quantitative data We faced a highly complex environment for a US Government customer that had a large dependency on legacy systems with a need to modernize quickly Metrics
  • 3. Customer challenges Budget •Not enough money to transition every app to take advantage of Big Data or a distributed system •Outsourcing IaaSneeds to be monitored for accounting, security, scale, etcwithout complex software •Application elasticity is critical to understanding the true costs of operations and maintenance •Storage (data) is a much bigger cost than expected •Need to consolidate systems engineering support While we faced many challenges it became clear early on that budgetand ease of integration for appsmust be our two driving forces App migration is not simple •Most apps are CRUD based; write a report, find a report •Security business logic is baked into each app •Number one question: Why can’t I choose the technology that best fits my app? •Cannot disrupt operations by any means! •Applications must reside on multiple networks and work together •Takes too long to get started, laying down databases, web tiers, etc Security is the ultimate killer of time The process around security became complicated, burdensome and still insufficient to counter threats at scale
  • 4. Our mission Our Missionis to facilitate Big Data analytics across the enterprise by providing the tools necessary to align the work of the application engineer, analytic developer, and data scientist —freeing them tofocus on end products, not infrastructure;we provide this through EzBake Big Data should be easy Big Data should drive insight Big Data should be ubiquitous Big Data should be secure
  • 5. EzBake It’s all about making application transition easier!!! Rather than assembling your own big data stack, EzBakeprovides an integrated way to compose the different elements of your application: collecting, processing, storing, and querying data Ease of application development •Time to market of apps and reuse •Autodeploymentand high-availability scaling •Integrated analytics and audit trails for logs, metrics, data access, and security events Built-in security layer •Role-based access and complex policies •Down to the object / cell-level controls •Encryption in transit Data layer •Ubiquitous data access (no stovepipes!) •Simplified streaming / batch analytics •Tailorableand technology agnostic •Abstracted index patterns Data layer Custom applications Physical databases MongoDBAccumulo PostgreSQL(RDS) RedishBase ElasticsearchTitan +Custom Execution layer Stream Batch Query Events +More Security layer
  • 6. Key features Scaled and commonly used thrift services, typically used during streaming ingest Interface for building data flow topologies which abstract physical stream processors Both direct access to indices and aggregate query across the various data sets Indexing patterns exposed as thrift services and abstracts the physical database Amazon Elastic MapReduce (EMR) abstractions that enable complex, multidimensional discovery Both at the data persistence and user access layers Automated elasticity through a GUI-based deployment Streaming ingest (Frack) Common services Data persistence Distributed query Security Batch analytics Deployment
  • 7. Technology agnostic •Instead of a jack-of-all-trades indexing for free text search, geospatial search, etc, use mission-specific indices for specific application logic needs •Focus on storage patterns vice database specific operations, thereby enforcing data access standards across the enterprise •Allow for new cartridges for web frameworks including Node.js, Python, Ruby, etc. Each app has its own needs, and it is not on the platform builder to force the team into a particular technology, rather offer a solution to meet the use case
  • 8. Easy to deploy and secure The platform provisions and scales, like classic PaaS, and embeds data layer connections and security on Amazon EC2 •Developers pull-down sandbox from the collaboration environment to develop on their local box •App / service is output as a WAR and YML file (buildpack) •The app registration page allows engineers to deploy and register apps, data feeds, and services on the platform •EzDeployersupports dynamic resource management to all capabilities hosted and provisions through Amazon Elastic Compute Cloud (EC2)
  • 9. App registration •Applications carry role-based access controls with human inserted deployment authorization •Registration to include data feeds, services, batch jobs, and intents. •Ability to assign other users as admin controllers through AWS Identity and Access Management (IAM) controls or other IdAM •Cuts down time to deploy and removes the need for app developers to write Puppet scripts •Build in account management policies for financial tracking of PaaSand IaaScosts Deploy with buildpackssecurely through the application registration page and provide elasticity as a service by abstracting Amazon EC2 services
  • 10. Lab76: Collaborative development •Speed start of development from weeks to hours by enabling a truly agile development environment •GitLabwas exposed for source control and promoting the sharing of code across the enterprise through governance and oversight •Customized RedMinewas exposed for task management and to allow task oversight and alignment •DevOpscould clone an Amazon Virtual Private Cloud and stand up new environments in a day vs. months of setting up for each app or system The key to speeding transition was to remove redundancy; by providing a one-stop shop for devtools (Git, RedMine, Jenkins), a means to share code and common development environments, we gained months back from each development team
  • 11. Leveraging a data layer on SQL and NoSQL the platform abstracts physical data stores and promotes storage patterns to enable ease of sharing, force object-level security, and provide the ability to plug and play databases Breaking-down disparate data stores •That’s not to say we implement Big Data SQL •Instead, we have the model that binds app development, BigData, and security •Focus developers towards database abstractions extensible toany database So what? •Move to production with Big Data without impacting existing SQL based production architecture (think PostgreSQL to RDS) •Brings data together across the enterprise helping customers with disparate engineering teams build to a standard
  • 12. Distributed query We distribute object-specific queries across disparate data sets exposed through the data layer while controlling access through the service and at the data level •Migrate off-legacy data stores without disrupting production instances •Focus on object-based queries across many data sets as well as across Amazon VPC within an environment •Work with Clouderato modify Impala to run against multiple data stores •Common access controls across multiple data sets So what? •Common method to discover data across many apps, great for BI tools and third-party apps like Palantir, Tableau, etc. •Decreases the duplication of storage across the enterprise through common indexing patterns
  • 13. Security becomes an API •All data is encrypted in transit •All transactions are authorized by the security service •All data is secured at the object level •Robust security service —scales horizontally and generated authorization tokens base on external IdAMproperties •Internal group management service scales to trillions of groups and beyond •Compressed bitvectorrepresentation of data visibility and access authorizations speeds security computations Following several zero-day attacks the enterprise is waking up to security but has no understanding of how to secure their Big Data platforms —a major reason many are not in production Bob Bob has authorizations: X, Y, and Z Data Data is tagged as: X, Y, and R. Sorry Bob! Only X and Y for you! Query Object-level security across all data stores through a common API will provide dramatic efficiencies as it decreases time to model data across multiple data stores
  • 14. Metering and monitoring •JavascriptAPI for web apps, Thrift API for services, and REST for others •Improve application usability and usefulness by examining analytics on usage patterns •Diagnose issues with system, services, and apps •Determine cost allocation based on what agencies and organizations are using the system To bring back focus on understanding the environment we needed the platform to provide a comprehensive visualization to monitor users, data and services on AWS
  • 15. Batch (Amino) •Removes complexity of Amazon EMR for the average engineer •Crowd source microanalyticsthrough analysts and engineers •Data agnostic •Not a black box •Fully scalable •Inherent cross-data source linked indexes •Encourages sharing of knowledge, discovery •Index built to support machine learning •Security considered up front —index is in Accumulo •Utilized AWS to enable rapid load-balancing to support demand based on data and usage Developers can write Amazon Elastic MapReduce(EMR) code to analyze data, but don’t know what to look for; the analysts know what to look for, but don’t know how to write code. Technology is not the problem.It’s enabling the analyst to effectively leverage technology and reuse it.
  • 16. The impact So What? What were the overall accomplishments to date? Well… Time: The platform and the development model decreased the development time from 6–8 months to production to 3–4 weeks. Lean and Mean: Application teams went from being heavy on DevOps, security, testing to smaller, more agile teams focused on specific-mission use cases Most importantly… We revectoredteams back to their users, providing more capabilities in less time, thereby saving lives and protecting our country Data Shared: Legacy REST/SOAP interfaces have begun to die and time spent on sharing data is down significantly without impacting operations and more apps have more access to data Money: Removal of redundant code and system, faster app deployment, cuts in total storage costs, and decrease in team sizes led to a significant cost savings up front for the customer