Soumettre la recherche
Mettre en ligne
Handling Data in Mega Scale Web Systems
•
Télécharger en tant que PPT, PDF
•
7 j'aime
•
1,063 vues
V
Vineet Gupta
Suivre
Technologie
Signaler
Partager
Signaler
Partager
1 sur 54
Télécharger maintenant
Recommandé
Spark
Spark
Nitish Upreti
Hadoop tutorial for beginners-tibacademy.in
Hadoop tutorial for beginners-tibacademy.in
TIB Academy
Distributed Computing with Apache Hadoop: Technology Overview
Distributed Computing with Apache Hadoop: Technology Overview
Konstantin V. Shvachko
The Secrets of Building Realtime Big Data Systems
The Secrets of Building Realtime Big Data Systems
nathanmarz
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Spark Summit
Bhupeshbansal bigdata
Bhupeshbansal bigdata
Bhupesh Bansal
Real-Time Big Data at In-Memory Speed, Using Storm
Real-Time Big Data at In-Memory Speed, Using Storm
Nati Shalom
Meetup ml spark_ppt
Meetup ml spark_ppt
Snehal Nagmote
Recommandé
Spark
Spark
Nitish Upreti
Hadoop tutorial for beginners-tibacademy.in
Hadoop tutorial for beginners-tibacademy.in
TIB Academy
Distributed Computing with Apache Hadoop: Technology Overview
Distributed Computing with Apache Hadoop: Technology Overview
Konstantin V. Shvachko
The Secrets of Building Realtime Big Data Systems
The Secrets of Building Realtime Big Data Systems
nathanmarz
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Spark Summit
Bhupeshbansal bigdata
Bhupeshbansal bigdata
Bhupesh Bansal
Real-Time Big Data at In-Memory Speed, Using Storm
Real-Time Big Data at In-Memory Speed, Using Storm
Nati Shalom
Meetup ml spark_ppt
Meetup ml spark_ppt
Snehal Nagmote
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
longda feng
Hadoop
Hadoop
Ramakrishna Reddy Bijjam
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
DECK36
getFamiliarWithHadoop
getFamiliarWithHadoop
AmirReza Mohammadi
Hadoop fault tolerance
Hadoop fault tolerance
Pallav Jha
Sector Sphere 2009
Sector Sphere 2009
lilyco
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Yahoo Developer Network
Hdfs high availability
Hdfs high availability
Hadoop User Group
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
sreehari orienit
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
Spark vs storm
Spark vs storm
Trong Ton
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Farzad Nozarian
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Data Con LA
Presentation on Hadoop Technology
Presentation on Hadoop Technology
OpenDev
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Sid Anand
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Chicago Hadoop Users Group
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Bryan Bende
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
Avishek Patra
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Directi Group
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Jags Ramnarayan
Reduce Side Joins
Reduce Side Joins
Edureka!
Contenu connexe
Tendances
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
longda feng
Hadoop
Hadoop
Ramakrishna Reddy Bijjam
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
DECK36
getFamiliarWithHadoop
getFamiliarWithHadoop
AmirReza Mohammadi
Hadoop fault tolerance
Hadoop fault tolerance
Pallav Jha
Sector Sphere 2009
Sector Sphere 2009
lilyco
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Yahoo Developer Network
Hdfs high availability
Hdfs high availability
Hadoop User Group
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
sreehari orienit
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
DataStax Academy
Spark vs storm
Spark vs storm
Trong Ton
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Farzad Nozarian
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Data Con LA
Presentation on Hadoop Technology
Presentation on Hadoop Technology
OpenDev
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Sid Anand
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Jen Aman
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Chicago Hadoop Users Group
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Bryan Bende
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
Avishek Patra
Tendances
(19)
Jstorm introduction-0.9.6
Jstorm introduction-0.9.6
Hadoop
Hadoop
PHP Backends for Real-Time User Interaction using Apache Storm.
PHP Backends for Real-Time User Interaction using Apache Storm.
getFamiliarWithHadoop
getFamiliarWithHadoop
Hadoop fault tolerance
Hadoop fault tolerance
Sector Sphere 2009
Sector Sphere 2009
HUG Nov 2010: HDFS Raid - Facebook
HUG Nov 2010: HDFS Raid - Facebook
Hdfs high availability
Hdfs high availability
Hadoop training-in-hyderabad
Hadoop training-in-hyderabad
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
IBM Spark Technology Center: Real-time Advanced Analytics and Machine Learnin...
Spark vs storm
Spark vs storm
S4: Distributed Stream Computing Platform
S4: Distributed Stream Computing Platform
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Real time big data analytics with Storm by Ron Bodkin of Think Big Analytics
Presentation on Hadoop Technology
Presentation on Hadoop Technology
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Building & Operating High-Fidelity Data Streams - QCon Plus 2021
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Deep Learning on Apache® Spark™ : Workflows and Best Practices
Yahoo compares Storm and Spark
Yahoo compares Storm and Spark
Document Similarity with Cloud Computing
Document Similarity with Cloud Computing
Distributed Caching - Cache Unleashed
Distributed Caching - Cache Unleashed
En vedette
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Directi Group
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Jags Ramnarayan
Reduce Side Joins
Reduce Side Joins
Edureka!
Introduction to Tokenization
Introduction to Tokenization
Nabeel Yoosuf
Denormalization
Denormalization
Sohail Haider
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data Sets
Pradeeban Kathiravelu, Ph.D.
What is Payment Tokenization?
What is Payment Tokenization?
Rambus Inc
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
Overview of AWS Services for your Enterprise
Overview of AWS Services for your Enterprise
Blazeclan Technologies Private Limited
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduce
datasalt
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
Vladi Vexler
En vedette
(11)
Handling Data in Mega Scale Systems
Handling Data in Mega Scale Systems
Hpts 2011 flexible_oltp
Hpts 2011 flexible_oltp
Reduce Side Joins
Reduce Side Joins
Introduction to Tokenization
Introduction to Tokenization
Denormalization
Denormalization
Efficient Duplicate Detection Over Massive Data Sets
Efficient Duplicate Detection Over Massive Data Sets
What is Payment Tokenization?
What is Payment Tokenization?
Cassandra @ Sony: The good, the bad, and the ugly part 2
Cassandra @ Sony: The good, the bad, and the ugly part 2
Overview of AWS Services for your Enterprise
Overview of AWS Services for your Enterprise
Tuple map reduce: beyond classic mapreduce
Tuple map reduce: beyond classic mapreduce
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
MySQL Visual Analysis and Scale-out Strategy definition - Webinar deck
Similaire à Handling Data in Mega Scale Web Systems
Front Range PHP NoSQL Databases
Front Range PHP NoSQL Databases
Jon Meredith
Modeling data and best practices for the Azure Cosmos DB.
Modeling data and best practices for the Azure Cosmos DB.
Mohammad Asif
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Andre Essing
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Renato Lucindo
Pnuts
Pnuts
Ruchika Mehresh
PNUTS
PNUTS
Ruchika Mehresh
Pnuts Review
Pnuts Review
Ruchika Mehresh
Cloud storage
Cloud storage
Zeeshan Bilal
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
J Singh
Basics of Distributed Systems - Distributed Storage
Basics of Distributed Systems - Distributed Storage
Nilesh Salpe
Tech-Spark: Exploring the Cosmos DB
Tech-Spark: Exploring the Cosmos DB
Ralph Attard
Design Patterns For Distributed NO-reational databases
Design Patterns For Distributed NO-reational databases
lovingprince58
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real time
Itai Yaffe
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
Naoki (Neo) SATO
17-NoSQL.pptx
17-NoSQL.pptx
levichan1
NOSQL Database: Apache Cassandra
NOSQL Database: Apache Cassandra
Folio3 Software
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Amazon Web Services
MYSQL
MYSQL
gilashikwa
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
guestdfd1ec
Need for Time series Database
Need for Time series Database
Pramit Choudhary
Similaire à Handling Data in Mega Scale Web Systems
(20)
Front Range PHP NoSQL Databases
Front Range PHP NoSQL Databases
Modeling data and best practices for the Azure Cosmos DB.
Modeling data and best practices for the Azure Cosmos DB.
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Distributed Systems: scalability and high availability
Distributed Systems: scalability and high availability
Pnuts
Pnuts
PNUTS
PNUTS
Pnuts Review
Pnuts Review
Cloud storage
Cloud storage
CS 542 Parallel DBs, NoSQL, MapReduce
CS 542 Parallel DBs, NoSQL, MapReduce
Basics of Distributed Systems - Distributed Storage
Basics of Distributed Systems - Distributed Storage
Tech-Spark: Exploring the Cosmos DB
Tech-Spark: Exploring the Cosmos DB
Design Patterns For Distributed NO-reational databases
Design Patterns For Distributed NO-reational databases
Big data serving: Processing and inference at scale in real time
Big data serving: Processing and inference at scale in real time
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
[db tech showcase Tokyo 2019] Azure Cosmos DB Deep Dive ~ Partitioning, Globa...
17-NoSQL.pptx
17-NoSQL.pptx
NOSQL Database: Apache Cassandra
NOSQL Database: Apache Cassandra
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
Getting Started with Amazon Redshift - AWS July 2016 Webinar Series
MYSQL
MYSQL
Design Patterns for Distributed Non-Relational Databases
Design Patterns for Distributed Non-Relational Databases
Need for Time series Database
Need for Time series Database
Dernier
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
Dilum Bandara
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
mohitsingh558521
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
Alfredo García Lavilla
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
LoriGlavin3
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Sergiu Bodiu
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
BookNet Canada
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
LoriGlavin3
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
Kalema Edgar
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
Stephanie Beckett
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
gvaughan
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
LoriGlavin3
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
Alan Dix
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
Dubai Multi Commodity Centre
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Mark Simos
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
2toLead Limited
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
MounikaPolabathina
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
LoriGlavin3
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Curtis Poe
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Florian Wilhelm
Dernier
(20)
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
Handling Data in Mega Scale Web Systems
1.
Vineet Gupta |
GM – Software Engineering | Directi http://www.vineetgupta.com Licensed under Creative Commons Attribution Sharealike Noncommercial Intelligent People. Uncommon Ideas.
2.
3.
4.
5.
6.
7.
8.
Host App Server
DB Server RAM CPU CPU CPU RAM RAM
9.
Sunfire X4640 M2
8 x 6-core 2.6 GHz $ 27k to $ 170k PowerEdge R200 Dual core 2.8 GHz Around $ 550
10.
11.
T1, T2, T3,
T4 App Layer
12.
13.
14.
15.
16.
17.
18.
T1, T2, T3,
T4, T5 App Layer
19.
20.
T3 App Layer
T4 T5 T2 T1 First million rows T3 T4 T5 T2 T1 Second million rows T3 T4 T5 T2 T1 Third million rows
21.
22.
Source:
http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.20.1495
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
41.
42.
43.
44.
45.
46.
47.
48.
49.
50.
51.
52.
53.
54.
Intelligent People. Uncommon
Ideas. Licensed under Creative Commons Attribution Sharealike Noncommercial
Télécharger maintenant