SlideShare une entreprise Scribd logo
1  sur  14
Télécharger pour lire hors ligne
Data warehousing infrastructure on Cloud
Praveen Hebbagodi
Director of Engineering, Akamai Technologies




                                               Akamai Confidential
Agenda


• Introduction to Akamai
• Akamai BI Solutions
• Data warehousing platform
       • Features
       • Architecture
       • Operations
• Conclusions




Akamai Confidential     Powering a Better Internet   ©2011 Akamai
The Akamai Network…
       … a large-scale on-demand distributed computing platform

                                                    Accelerating Daily Traffic of:
                                                    • 3+ Tbps
                                                    • 11+ million hits per second
                                                    • 10+ million concurrent streams
                                                    • 800+ billion deliveries/day
                                                    • 30+ petabytes/day

                                                    Connecting:
           -   88,000+ Servers                      • 465 million unique IP addresses
           -   1,100+ Networks                      • From 234 countries
           -   1,600+ Locations
           -   650+ Cities
                                                    Deflecting Attack Traffic:
           -   71 Countries
                                                    • From 198 countries
                                                    • Targeting 10,000 unique ports

Akamai Confidential               Powering a Better Internet                      ©2011 Akamai
Major services provided by Akamai
 Content delivery:
        • HTTP/S (15-30% of total HTTP traffic!)
        • Live and On-Demand Streaming

 Application delivery:
        •     Web Application Acceleration
        •     Dynamic Site Acceleration
        •     EdgeComputing
        •     IP Application Acceleration


 Example applications:
        • Online commerce, media delivery,
         B2B/B2C applications, software downloads,
         social networking sites, …
        • You likely use many of our services each day



Akamai Confidential                  Powering a Better Internet   ©2011 Akamai
Akamai BI Solutions

Helps our customers to get deeper insights into their audience and
          content usage in the context of their business
Example Solutions:
Media Analytics
• A comprehensive solution for
  content and audience intelligence
  for broadband media
• Features
     •    Dashboards for Engagement
          Overview, Ad Optimizations and
          Content Usage

     •    Standard Reports with detailed
          engagement and audinece
          information

     •    Custom dimensions and reports to
          suit business specific needs

Akamai Confidential                   Powering a Better Internet   ©2011 Akamai
Akamai BI Solutions: Examples

QOS Monitor
• Real-time quality of
  service monitoring
  solution for online media
  delivery
• Features
     •    Set thresholds for
          breaches, find root causes
          and resolve issues using
          “Notifications”

     •    Live Real-time monitoring
          console with data
          aggregation as fine as 30
          sec.

     •    20 standard reports &
          dashboards for historical
          diagnostics & debugging.


Akamai Confidential                    Powering a Better Internet   ©2011 Akamai
Features


• Intuitive dashboard & report builder UI with advanced
  visualizations
• Over 50 Standard dimensions & metrics
• Support for any customer specific dimensions
       • Regular expression extraction
       • Plug-in API
•    Ad-hoc query, drill-down
•    Lookup tables
•    Dashboards & reports can be provisioned on the fly
•    Real-time notifications
•    Data access via web interface, SOAP API, Email & download
     (CSV, PDF, HTML)



Akamai Confidential              Powering a Better Internet   ©2011 Akamai
Data warehousing Platform



                                         xml
                              xml
Reporting                Data    Data                                   xml
                         Storage Processing                    Data
                                                               Collection

                                                                                 Data Sources
  Analytics workflow programmable via portal
          • Data sources, filters, metrics, dimensions, reports, dashboards configured via
            xml metadata

  Distributed data collection in the Cloud
          • Data sources: end user machines (beacons), edge server logs, agents
          • Filtering, and partial aggregation at the source and in collection layers
          • Facilitates scalability and better utilization of resources



   Akamai Confidential                    Powering a Better Internet               ©2011 Akamai
Data warehousing Platform



                                        xml
                              xml
Reporting                Data    Data                                  xml
                         Storage Processing                   Data
                                                              Collection

                                                                             Data Sources

  Data Processing is a flexible map-reduce framework
          • Dataflow graph of map-reduce operations
          • Enhancements for better latencies, scheduling optimizations
          • Faster message passing interfaces (network, in-memory)




   Akamai Confidential                   Powering a Better Internet           ©2011 Akamai
Data-warehousing Platform




                                                                xml
                                                                          Processing
                  Reporting
                                                  Data
                        Artist Name
                                                  Storage

                                      Time

Data abstraction is a set of data cubes
   • Supports fast slice-and-dice, drill-down operations,…
Data cubes are physically realized in distributed columnar DB
   • SQL interface, column compression, bitmap indexes
   • In-situ updates, write-optimized store
   • Sharding and cluster management
Akamai Confidential                          Powering a Better Internet        ©2011 Akamai
Approach to Operations

• Treat failures as normal

• Build in layers of redundancy
       • At all levels: geo/network, with in a cluster
       • Multi-path communications

• Weaker data consistency models

• Zoning
       • Dynamic Configuration
       • Software Installs

• Design systems that run themselves
       • Autonomic response where appropriate




Akamai Confidential                Powering a Better Internet   ©2011 Akamai
Conclusions
•    Being on cloud facilitates in building highly scalable platform for “big
     data” applications

•    Design for failures
       • Build redundant systems at all levels
       • Multiple levels of fault-tolerance

•    Automation, autonomics, more automation…
       • Avoid “manual changes”
                • They will happen, so have good process to minimize/track


•    Deterministic software and config management system
       • Converges to consistent state & built-in safe roll-back

•    Good tools for understanding system behavior and data quality

•    Sophisticated tools for capacity management and performance
     monitoring


Akamai Confidential                       Powering a Better Internet         ©2011 Akamai
To find out more about Akamai…

More info:                www.akamai.com

Contacting me:
                                           Praveen Hebbagodi
                                         phebbagodi@akamai.com

Technical publications:
                        http://www.akamai.com/html/perspectives/techpubs.html


Jobs:
                      http://www.akamai.com/html/careers/index.html
                           http://twitter.com/akamaijobsindia


Questions?


Akamai Confidential                         Powering a Better Internet          ©2011 Akamai
Thank you




            Akamai Confidential

Contenu connexe

Tendances

Webxpress Home Delivery Solution
Webxpress Home Delivery SolutionWebxpress Home Delivery Solution
Webxpress Home Delivery SolutionWebXpress
 
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud Complexities
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud ComplexitiesHot Cloud Companies: Tap In Systems - The Problem: Managing Cloud Complexities
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud ComplexitiesOpSource
 
A scalable server environment for your applications
A scalable server environment for your applicationsA scalable server environment for your applications
A scalable server environment for your applicationsGigaSpaces
 
SAP HANA INFRA - Amazon Web Services - Cloud
SAP HANA INFRA - Amazon Web Services - CloudSAP HANA INFRA - Amazon Web Services - Cloud
SAP HANA INFRA - Amazon Web Services - CloudSandeep Mahindra
 
WebXpress @ ReTechCon 2012
WebXpress @ ReTechCon 2012WebXpress @ ReTechCon 2012
WebXpress @ ReTechCon 2012WebXpress
 
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Christopher Foot
 
NephoScale Elastic Networking
NephoScale Elastic NetworkingNephoScale Elastic Networking
NephoScale Elastic NetworkingNephoScale
 
Australia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsAustralia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsChris McNulty
 
AWS Webcast - Introducing Amazon Redshift
AWS Webcast - Introducing Amazon RedshiftAWS Webcast - Introducing Amazon Redshift
AWS Webcast - Introducing Amazon RedshiftAmazon Web Services
 
Managing Performance in the Cloud
Managing Performance in the CloudManaging Performance in the Cloud
Managing Performance in the CloudDevOpsGroup
 
Dynamics Day 2012: NAV Update and Roadmap
Dynamics Day 2012: NAV Update and RoadmapDynamics Day 2012: NAV Update and Roadmap
Dynamics Day 2012: NAV Update and RoadmapIntergen
 
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014Amazon Web Services
 
AzureSQL Managed Instance (SQLKonferenz 2018)
AzureSQL Managed Instance (SQLKonferenz 2018)AzureSQL Managed Instance (SQLKonferenz 2018)
AzureSQL Managed Instance (SQLKonferenz 2018)Jovan Popovic
 
AWS Partner Presentation - Aspera
AWS Partner Presentation - Aspera AWS Partner Presentation - Aspera
AWS Partner Presentation - Aspera Amazon Web Services
 
A Step By Step Guide To Put DB2 On Amazon Cloud
A Step By Step Guide To Put DB2 On Amazon CloudA Step By Step Guide To Put DB2 On Amazon Cloud
A Step By Step Guide To Put DB2 On Amazon CloudDeepak Rao
 
SoftLayer API 12032015
SoftLayer API  12032015SoftLayer API  12032015
SoftLayer API 12032015Nacho Daza
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mwsarnoa
 
Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Vinay Mistry
 

Tendances (20)

Webxpress Home Delivery Solution
Webxpress Home Delivery SolutionWebxpress Home Delivery Solution
Webxpress Home Delivery Solution
 
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud Complexities
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud ComplexitiesHot Cloud Companies: Tap In Systems - The Problem: Managing Cloud Complexities
Hot Cloud Companies: Tap In Systems - The Problem: Managing Cloud Complexities
 
A scalable server environment for your applications
A scalable server environment for your applicationsA scalable server environment for your applications
A scalable server environment for your applications
 
SAP HANA INFRA - Amazon Web Services - Cloud
SAP HANA INFRA - Amazon Web Services - CloudSAP HANA INFRA - Amazon Web Services - Cloud
SAP HANA INFRA - Amazon Web Services - Cloud
 
WebXpress @ ReTechCon 2012
WebXpress @ ReTechCon 2012WebXpress @ ReTechCon 2012
WebXpress @ ReTechCon 2012
 
2018 jk
2018 jk2018 jk
2018 jk
 
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
Selecting a SQL Server Cloud Platform - IaaS, Amazon RDS or Azure SQL DB?
 
NephoScale Elastic Networking
NephoScale Elastic NetworkingNephoScale Elastic Networking
NephoScale Elastic Networking
 
Australia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance SolutionsAustralia SharePoint Conference 2012 - Quest Governance Solutions
Australia SharePoint Conference 2012 - Quest Governance Solutions
 
AWS Webcast - Introducing Amazon Redshift
AWS Webcast - Introducing Amazon RedshiftAWS Webcast - Introducing Amazon Redshift
AWS Webcast - Introducing Amazon Redshift
 
Managing Performance in the Cloud
Managing Performance in the CloudManaging Performance in the Cloud
Managing Performance in the Cloud
 
Dynamics Day 2012: NAV Update and Roadmap
Dynamics Day 2012: NAV Update and RoadmapDynamics Day 2012: NAV Update and Roadmap
Dynamics Day 2012: NAV Update and Roadmap
 
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014
(BIZ401) Kellogg Company Runs SAP in a Hybrid Environment | AWS re:Invent 2014
 
AzureSQL Managed Instance (SQLKonferenz 2018)
AzureSQL Managed Instance (SQLKonferenz 2018)AzureSQL Managed Instance (SQLKonferenz 2018)
AzureSQL Managed Instance (SQLKonferenz 2018)
 
AWS Partner Presentation - Aspera
AWS Partner Presentation - Aspera AWS Partner Presentation - Aspera
AWS Partner Presentation - Aspera
 
A Step By Step Guide To Put DB2 On Amazon Cloud
A Step By Step Guide To Put DB2 On Amazon CloudA Step By Step Guide To Put DB2 On Amazon Cloud
A Step By Step Guide To Put DB2 On Amazon Cloud
 
SoftLayer API 12032015
SoftLayer API  12032015SoftLayer API  12032015
SoftLayer API 12032015
 
Kuali update v4 - mw
Kuali update   v4 - mwKuali update   v4 - mw
Kuali update v4 - mw
 
Scalability Design Principles - Internal Session
Scalability Design Principles - Internal SessionScalability Design Principles - Internal Session
Scalability Design Principles - Internal Session
 
Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015Nippon It Solutions Data services offering 2015
Nippon It Solutions Data services offering 2015
 

En vedette

Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...Mia Yuan Cao
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseSnowflake Computing
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingAmazon Web Services
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...Amazon Web Services
 
Cloud Computing and your Data Warehouse
Cloud Computing and your Data WarehouseCloud Computing and your Data Warehouse
Cloud Computing and your Data Warehousedrluckyspin
 
Cloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsCloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsSeeling Cheung
 

En vedette (9)

Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
Real-Time Analytics and Visualization of Streaming Big Data with JReport & Sc...
 
Introducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data WarehouseIntroducing the Snowflake Computing Cloud Data Warehouse
Introducing the Snowflake Computing Cloud Data Warehouse
 
SendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data WarehousingSendGrid Improves Email Delivery with Hybrid Data Warehousing
SendGrid Improves Email Delivery with Hybrid Data Warehousing
 
Deep Dive on Amazon Redshift
Deep Dive on Amazon RedshiftDeep Dive on Amazon Redshift
Deep Dive on Amazon Redshift
 
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
AWS Summit 2013 | India - Petabyte Scale Data Warehousing at Low Cost, Abhish...
 
Amazon Redshift Masterclass
Amazon Redshift MasterclassAmazon Redshift Masterclass
Amazon Redshift Masterclass
 
Life of data from generation to visualization using big data
Life of data from generation to visualization using big dataLife of data from generation to visualization using big data
Life of data from generation to visualization using big data
 
Cloud Computing and your Data Warehouse
Cloud Computing and your Data WarehouseCloud Computing and your Data Warehouse
Cloud Computing and your Data Warehouse
 
Cloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and AnalyticsCloud Based Data Warehousing and Analytics
Cloud Based Data Warehousing and Analytics
 

Similaire à Data Warehousing Infrastructure on Cloud

Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...
Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...
Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...Odinot Stanislas
 
國際頂尖製造業品牌應用Akamai案例分享
國際頂尖製造業品牌應用Akamai案例分享國際頂尖製造業品牌應用Akamai案例分享
國際頂尖製造業品牌應用Akamai案例分享併力科技 JFT
 
Optimizing Cloud Computing with IPv6
Optimizing Cloud Computing with IPv6Optimizing Cloud Computing with IPv6
Optimizing Cloud Computing with IPv6John Rhoton
 
클라우드 컴퓨팅에 따른 데이터센터의 변화
클라우드 컴퓨팅에 따른 데이터센터의 변화클라우드 컴퓨팅에 따른 데이터센터의 변화
클라우드 컴퓨팅에 따른 데이터센터의 변화Fanny Lee
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...confluent
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDATAVERSITY
 
A non-technical introduction to Cloud Computing
A non-technical introduction to Cloud ComputingA non-technical introduction to Cloud Computing
A non-technical introduction to Cloud ComputingWilliam Pourmajidi
 
Building real time data-driven products
Building real time data-driven productsBuilding real time data-driven products
Building real time data-driven productsLars Albertsson
 
Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management SuiteOracleVolutionSeries
 
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018Amazon Web Services
 
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...Chris McNulty
 
CloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudCloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudTim Mackey
 
SoftLayer Cloud Services
SoftLayer Cloud ServicesSoftLayer Cloud Services
SoftLayer Cloud ServicesAnna Landolfi
 
SKALI On The Cloud
SKALI On The CloudSKALI On The Cloud
SKALI On The CloudSKALI Group
 
Private Clouds - Business Agility Seminar
Private Clouds - Business Agility SeminarPrivate Clouds - Business Agility Seminar
Private Clouds - Business Agility SeminarExponential_e
 
Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Tim Mackey
 
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011Antonio Alba
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridScaleOut Software
 
Building_a_Modern_Data_Platform_in_the_Cloud.pdf
Building_a_Modern_Data_Platform_in_the_Cloud.pdfBuilding_a_Modern_Data_Platform_in_the_Cloud.pdf
Building_a_Modern_Data_Platform_in_the_Cloud.pdfAmazon Web Services
 

Similaire à Data Warehousing Infrastructure on Cloud (20)

Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...
Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...
Scale-out Storage on Intel® Architecture Based Platforms: Characterizing and ...
 
國際頂尖製造業品牌應用Akamai案例分享
國際頂尖製造業品牌應用Akamai案例分享國際頂尖製造業品牌應用Akamai案例分享
國際頂尖製造業品牌應用Akamai案例分享
 
Optimizing Cloud Computing with IPv6
Optimizing Cloud Computing with IPv6Optimizing Cloud Computing with IPv6
Optimizing Cloud Computing with IPv6
 
클라우드 컴퓨팅에 따른 데이터센터의 변화
클라우드 컴퓨팅에 따른 데이터센터의 변화클라우드 컴퓨팅에 따른 데이터센터의 변화
클라우드 컴퓨팅에 따른 데이터센터의 변화
 
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...
 
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled ArchitectureDM Radio Webinar: Adopting a Streaming-Enabled Architecture
DM Radio Webinar: Adopting a Streaming-Enabled Architecture
 
A non-technical introduction to Cloud Computing
A non-technical introduction to Cloud ComputingA non-technical introduction to Cloud Computing
A non-technical introduction to Cloud Computing
 
Building real time data-driven products
Building real time data-driven productsBuilding real time data-driven products
Building real time data-driven products
 
Oracle Application Management Suite
Oracle Application Management SuiteOracle Application Management Suite
Oracle Application Management Suite
 
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018
Create a Serverless Web Event Pipeline (ADT301) - AWS re:Invent 2018
 
Cloud computing
Cloud computing Cloud computing
Cloud computing
 
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...
Australia SharePoint Conference 2012 - SharePoint Performance - Tales from th...
 
CloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloudCloudOpen Japan - Controlling the cost of your first cloud
CloudOpen Japan - Controlling the cost of your first cloud
 
SoftLayer Cloud Services
SoftLayer Cloud ServicesSoftLayer Cloud Services
SoftLayer Cloud Services
 
SKALI On The Cloud
SKALI On The CloudSKALI On The Cloud
SKALI On The Cloud
 
Private Clouds - Business Agility Seminar
Private Clouds - Business Agility SeminarPrivate Clouds - Business Agility Seminar
Private Clouds - Business Agility Seminar
 
Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014Taming the cost of your first cloud - CCCEU 2014
Taming the cost of your first cloud - CCCEU 2014
 
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
Part 2 OCLC Strategic Presentation Bruce Crocco ACURIL 2011
 
Top 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data GridTop 6 Reasons to Use a Distributed Data Grid
Top 6 Reasons to Use a Distributed Data Grid
 
Building_a_Modern_Data_Platform_in_the_Cloud.pdf
Building_a_Modern_Data_Platform_in_the_Cloud.pdfBuilding_a_Modern_Data_Platform_in_the_Cloud.pdf
Building_a_Modern_Data_Platform_in_the_Cloud.pdf
 

Plus de tdwiindia

TDWI Speaker profiles
TDWI Speaker profilesTDWI Speaker profiles
TDWI Speaker profilestdwiindia
 
TDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State VisionTDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State Visiontdwiindia
 
Tdwi event summary
Tdwi event summaryTdwi event summary
Tdwi event summarytdwiindia
 
Intel IT Cloud Strategy
Intel IT Cloud StrategyIntel IT Cloud Strategy
Intel IT Cloud Strategytdwiindia
 
BI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAPBI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAPtdwiindia
 
Big data appliances for BI on Cloud
Big data appliances for BI on CloudBig data appliances for BI on Cloud
Big data appliances for BI on Cloudtdwiindia
 
BI on Cloud Computing
BI on Cloud ComputingBI on Cloud Computing
BI on Cloud Computingtdwiindia
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloudtdwiindia
 
Business Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business PerspectiveBusiness Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business Perspectivetdwiindia
 

Plus de tdwiindia (9)

TDWI Speaker profiles
TDWI Speaker profilesTDWI Speaker profiles
TDWI Speaker profiles
 
TDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State VisionTDWI Inda BI on Cloud Future State Vision
TDWI Inda BI on Cloud Future State Vision
 
Tdwi event summary
Tdwi event summaryTdwi event summary
Tdwi event summary
 
Intel IT Cloud Strategy
Intel IT Cloud StrategyIntel IT Cloud Strategy
Intel IT Cloud Strategy
 
BI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAPBI on Cloud - Perspective from SAP
BI on Cloud - Perspective from SAP
 
Big data appliances for BI on Cloud
Big data appliances for BI on CloudBig data appliances for BI on Cloud
Big data appliances for BI on Cloud
 
BI on Cloud Computing
BI on Cloud ComputingBI on Cloud Computing
BI on Cloud Computing
 
What is BI on Cloud
What is BI on CloudWhat is BI on Cloud
What is BI on Cloud
 
Business Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business PerspectiveBusiness Intelligence on Cloud: A Business Perspective
Business Intelligence on Cloud: A Business Perspective
 

Dernier

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 

Dernier (20)

"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 

Data Warehousing Infrastructure on Cloud

  • 1. Data warehousing infrastructure on Cloud Praveen Hebbagodi Director of Engineering, Akamai Technologies Akamai Confidential
  • 2. Agenda • Introduction to Akamai • Akamai BI Solutions • Data warehousing platform • Features • Architecture • Operations • Conclusions Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 3. The Akamai Network… … a large-scale on-demand distributed computing platform Accelerating Daily Traffic of: • 3+ Tbps • 11+ million hits per second • 10+ million concurrent streams • 800+ billion deliveries/day • 30+ petabytes/day Connecting: - 88,000+ Servers • 465 million unique IP addresses - 1,100+ Networks • From 234 countries - 1,600+ Locations - 650+ Cities Deflecting Attack Traffic: - 71 Countries • From 198 countries • Targeting 10,000 unique ports Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 4. Major services provided by Akamai Content delivery: • HTTP/S (15-30% of total HTTP traffic!) • Live and On-Demand Streaming Application delivery: • Web Application Acceleration • Dynamic Site Acceleration • EdgeComputing • IP Application Acceleration Example applications: • Online commerce, media delivery, B2B/B2C applications, software downloads, social networking sites, … • You likely use many of our services each day Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 5. Akamai BI Solutions Helps our customers to get deeper insights into their audience and content usage in the context of their business Example Solutions: Media Analytics • A comprehensive solution for content and audience intelligence for broadband media • Features • Dashboards for Engagement Overview, Ad Optimizations and Content Usage • Standard Reports with detailed engagement and audinece information • Custom dimensions and reports to suit business specific needs Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 6. Akamai BI Solutions: Examples QOS Monitor • Real-time quality of service monitoring solution for online media delivery • Features • Set thresholds for breaches, find root causes and resolve issues using “Notifications” • Live Real-time monitoring console with data aggregation as fine as 30 sec. • 20 standard reports & dashboards for historical diagnostics & debugging. Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 7. Features • Intuitive dashboard & report builder UI with advanced visualizations • Over 50 Standard dimensions & metrics • Support for any customer specific dimensions • Regular expression extraction • Plug-in API • Ad-hoc query, drill-down • Lookup tables • Dashboards & reports can be provisioned on the fly • Real-time notifications • Data access via web interface, SOAP API, Email & download (CSV, PDF, HTML) Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 8. Data warehousing Platform xml xml Reporting Data Data xml Storage Processing Data Collection Data Sources Analytics workflow programmable via portal • Data sources, filters, metrics, dimensions, reports, dashboards configured via xml metadata Distributed data collection in the Cloud • Data sources: end user machines (beacons), edge server logs, agents • Filtering, and partial aggregation at the source and in collection layers • Facilitates scalability and better utilization of resources Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 9. Data warehousing Platform xml xml Reporting Data Data xml Storage Processing Data Collection Data Sources Data Processing is a flexible map-reduce framework • Dataflow graph of map-reduce operations • Enhancements for better latencies, scheduling optimizations • Faster message passing interfaces (network, in-memory) Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 10. Data-warehousing Platform xml Processing Reporting Data Artist Name Storage Time Data abstraction is a set of data cubes • Supports fast slice-and-dice, drill-down operations,… Data cubes are physically realized in distributed columnar DB • SQL interface, column compression, bitmap indexes • In-situ updates, write-optimized store • Sharding and cluster management Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 11. Approach to Operations • Treat failures as normal • Build in layers of redundancy • At all levels: geo/network, with in a cluster • Multi-path communications • Weaker data consistency models • Zoning • Dynamic Configuration • Software Installs • Design systems that run themselves • Autonomic response where appropriate Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 12. Conclusions • Being on cloud facilitates in building highly scalable platform for “big data” applications • Design for failures • Build redundant systems at all levels • Multiple levels of fault-tolerance • Automation, autonomics, more automation… • Avoid “manual changes” • They will happen, so have good process to minimize/track • Deterministic software and config management system • Converges to consistent state & built-in safe roll-back • Good tools for understanding system behavior and data quality • Sophisticated tools for capacity management and performance monitoring Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 13. To find out more about Akamai… More info: www.akamai.com Contacting me: Praveen Hebbagodi phebbagodi@akamai.com Technical publications: http://www.akamai.com/html/perspectives/techpubs.html Jobs: http://www.akamai.com/html/careers/index.html http://twitter.com/akamaijobsindia Questions? Akamai Confidential Powering a Better Internet ©2011 Akamai
  • 14. Thank you Akamai Confidential