SlideShare une entreprise Scribd logo
1  sur  36
Télécharger pour lire hors ligne
Googol records (with MySQL)
  IPC | October 2008 | Alex Aulbach
Definition: Googol



                       10100

                        or

10 000 000 000 000 000 000 000 000 000 000 000 000
 000 000 000 000 000 000 000 000 000 000 000 000
       000 000 000 000 000 000 000 000 000

                        or

              “Imaginable big number”


                                                     „Googol records“
                                                     © MAYFLOWER GmbH 2008 2
Overview



What will the future bring for databases?

Is the principal way to access data the best?

Patterns (or suggestions)

and showing how that could work with MySQL.

Discuss!




                                                „Googol records“
                                                © MAYFLOWER GmbH 2008 3
The (performance) future of the web


Only 10-20 % of world population are “in the Internet”.




How should it be with 80 % ?

                                                          „Googol records“
                                                          © MAYFLOWER GmbH 2008 4
The (performance) future of the web


World population is growing and people get older.




                                                    „Googol records“
                                                    © MAYFLOWER GmbH 2008 5
The (performance) future of the web


More specialized databases

More ways to access them

Much easier to access

Sharing knowledge vs. closed knowledge: Who wins?

Services become more dependent to others

The web grows faster than Moores Law!
 (Moores Law: “Only” Factor 1000 in 20 years.)


                                                    „Googol records“
                                                    © MAYFLOWER GmbH 2008 6
What does this mean us?



We will surely come into problems

But cannot say when, where and why

No Boss. The data belongs to everyone

It’s like new roads

                It’s “Real-live”!




                                        „Googol records“
                                        © MAYFLOWER GmbH 2008 7
Consequences of growth


New hardware will no longer solve speed problems

Even new database will not

Even a rewrite of the application won’t




      Need to rethink the problems from scratch!




                                                   „Googol records“
                                                   © MAYFLOWER GmbH 2008 8
Of course...


… need for splitting, sharding, partitioning, cluster etc.

… need to plan growth from beginning of the project.

… hardware resources can no longer be planned.

… distinct importance of data.

… estimate instead of being correct.




                                                             „Googol records“
                                                             © MAYFLOWER GmbH 2008 9
But ...




          Is this enough?



                            „Googol records“
                            © MAYFLOWER GmbH 2008 10
Patterns (or better: suggestions)


Brain storage engine.

Reading differs from writing.

Redundancy and specialization.

The storage itself can keep the information.

Time (and sleep).

The journey is the reward.




                                               „Googol records“
                                               © MAYFLOWER GmbH 2008 11
1 :: Brain storage engine :: 1


Short term memory (working memory)

  Unsorted, unfiltered, any data
  Fast read
  Very much fast updates/changes
  Remembers which data is changed/invalid
  Limited




                                            „Googol records“
                                            © MAYFLOWER GmbH 2008 12
1 :: Brain storage engine :: 2


Long-term memory

  Presorted, well filtered data
  Unlimited (well, more or less)
  Extremely fast read access (sometimes)
  Updates/inserts by repeating in working memory
  Sleep helps to better store




                                                   „Googol records“
                                                   © MAYFLOWER GmbH 2008 13
How does that model fit into real life?


Nobody awaits to find old things fast

Telephone-books

90/10-Problems




                                          „Googol records“
                                          © MAYFLOWER GmbH 2008 14
Show


Searching in long term memory.
Scaling of working/long-term memory
                   vs.
 one table with inserts/updates/deletes.




                                           „Googol records“
                                           © MAYFLOWER GmbH 2008 15
2 :: Reading differs from writing


Look at the physical processes

Reading with the fingertips:
 No read and write at the same time

Handling reading and writing as different aspects of the
 same thing is a compromise

Only specialization enables good optimization




                                                           „Googol records“
                                                           © MAYFLOWER GmbH 2008 16
Reader/Writer: Simplest layout




                                 „Googol records“
                                 © MAYFLOWER GmbH 2008 17
The web as storage?




                      „Googol records“
                      © MAYFLOWER GmbH 2008 18
Web can work like this




                         „Googol records“
                         © MAYFLOWER GmbH 2008 19
Recursive definition of the catalog




                                      „Googol records“
                                      © MAYFLOWER GmbH 2008 20
Scaling, setup as “black box”




                                „Googol records“
                                © MAYFLOWER GmbH 2008 21
Share everything




                   „Googol records“
                   © MAYFLOWER GmbH 2008 22
Comments


How does this scale?

What doesn’t work with this?




                               „Googol records“
                               © MAYFLOWER GmbH 2008 23
3 :: Redundancy and specialization :: 1


We cannot backup a googol

Nobody needs backup, but everybody needs to restore




                                                      „Googol records“
                                                      © MAYFLOWER GmbH 2008 24
3 :: Redundancy and specialization :: 2



Redundancy:

        Store the information on many places
        Store more important information on more places

Specialization:
         “Materialized views”
         EAV modeling and pivoting
         Take ideas from data warehouses
         and repositories



                                                          „Googol records“
                                                          © MAYFLOWER GmbH 2008 25
3 :: Redundancy and specialization :: 3




The wheel comes full circle:

   More important: more access.
   More access: More need for redundancy.
   More redundancy: more speed and reliability.
   More speed and reliability: more important.




                                                  „Googol records“
                                                  © MAYFLOWER GmbH 2008 26
Implementation with Reader/Writer




                                    „Googol records“
                                    © MAYFLOWER GmbH 2008 27
4 :: The storage itself
     can keep the information.



 “A storage has always physical limitations.
 A logical information of data which belongs together
 doesn't have any physical limitations.”

                             Alex Aulbach, Sept. 2008




                                                        „Googol records“
                                                        © MAYFLOWER GmbH 2008 28
The index is the problem!


The googol-universe is limited.
The index can take “half of the galaxies”.
Only the “rest” can be used for the data.

Less index means:
   Faster search in the “needed” index.
   Less time to write data and index.
   Less time to warm up.
   More space for the records.




                                             „Googol records“
                                             © MAYFLOWER GmbH 2008 29
Show


Access full table or split data into several parts.
   Index-size
   Write
   Presorted tables




                                                      „Googol records“
                                                      © MAYFLOWER GmbH 2008 30
5 :: Time (and sleep) :: 1


Human brain: Only three bits per second!

We all have been babies.

Trust! Just wait and see.

Developers (and customers) need to think in decades
 not in days till to the project-end.




                                                      „Googol records“
                                                      © MAYFLOWER GmbH 2008 31
5 :: Sleep (and time) :: 2


Again human brain: Learns while sleeping!
  Why not apply this for databases?

Premise: Redundancy!
   Dolphins sleep only with one hemisphere at a time.

The wheel comes full circle
   Redundancy.
   Distinct read and write.




                                                        „Googol records“
                                                        © MAYFLOWER GmbH 2008 32
Show


Well, I can’t show this, because it takes … time.




                                                    „Googol records“
                                                    © MAYFLOWER GmbH 2008 33
6 :: The journey is the reward


Future: Not so important how to search, but where.

Store step by step where to find the result, not the result.

You can find faster ways only by trying a shortcut.



It comes full circle:
    Search many different ways and take the fastest.
    While sleeping try out new things (dreaming).




                                                               „Googol records“
                                                               © MAYFLOWER GmbH 2008 34
Conclusion


Dreams may come true while sleeping.

We must invent now the tools
 to solve the problems of the future.

Speed is not a matter of hardware
 but of how things are done.

Never take speed as stated:
 In a googol-universe wormholes exists!

Moores Law may help, but do not trust em.



                                            „Googol records“
                                            © MAYFLOWER GmbH 2008 35
Thank you!


  Alex Aulbach
  Mayflower GmbH
  Pleichertorstr. 2
  97070 Würzburg, Germany
  +49 (931) 35 9 65 - 0
  alex.aulbach@mayflower.de

Contenu connexe

En vedette

Surviving architecture
Surviving architectureSurviving architecture
Surviving architectureMayflower GmbH
 
Android / iPhone Apps Mit Titanium Developer
Android / iPhone Apps Mit Titanium DeveloperAndroid / iPhone Apps Mit Titanium Developer
Android / iPhone Apps Mit Titanium DeveloperMayflower GmbH
 
Professional Refactoring
Professional RefactoringProfessional Refactoring
Professional RefactoringMayflower GmbH
 
Yii - Next level PHP Framework von Florian Facker
Yii - Next level PHP Framework von Florian FackerYii - Next level PHP Framework von Florian Facker
Yii - Next level PHP Framework von Florian FackerMayflower GmbH
 
Zend Framework and Dojo
Zend Framework and DojoZend Framework and Dojo
Zend Framework and DojoMayflower GmbH
 
Mit nginx und FastCGI skalieren
Mit nginx und FastCGI skalierenMit nginx und FastCGI skalieren
Mit nginx und FastCGI skalierenMayflower GmbH
 
Migration Concepts For Enterprise PHP Applications
Migration Concepts For Enterprise PHP ApplicationsMigration Concepts For Enterprise PHP Applications
Migration Concepts For Enterprise PHP ApplicationsMayflower GmbH
 
Native Cross-Platform-Apps mit Titanium Mobile und Alloy
Native Cross-Platform-Apps mit Titanium Mobile und AlloyNative Cross-Platform-Apps mit Titanium Mobile und Alloy
Native Cross-Platform-Apps mit Titanium Mobile und AlloyMayflower GmbH
 
Building a Cloud-based Social Network with Zend Framework and Doctrine 2
Building a Cloud-based Social Network with Zend Framework and Doctrine 2Building a Cloud-based Social Network with Zend Framework and Doctrine 2
Building a Cloud-based Social Network with Zend Framework and Doctrine 2Mayflower GmbH
 
Request Lifecycle im Zend Framework
Request Lifecycle im Zend FrameworkRequest Lifecycle im Zend Framework
Request Lifecycle im Zend FrameworkMayflower GmbH
 

En vedette (11)

Surviving architecture
Surviving architectureSurviving architecture
Surviving architecture
 
Android / iPhone Apps Mit Titanium Developer
Android / iPhone Apps Mit Titanium DeveloperAndroid / iPhone Apps Mit Titanium Developer
Android / iPhone Apps Mit Titanium Developer
 
Professional Refactoring
Professional RefactoringProfessional Refactoring
Professional Refactoring
 
Yii - Next level PHP Framework von Florian Facker
Yii - Next level PHP Framework von Florian FackerYii - Next level PHP Framework von Florian Facker
Yii - Next level PHP Framework von Florian Facker
 
Zend Framework and Dojo
Zend Framework and DojoZend Framework and Dojo
Zend Framework and Dojo
 
Unit Test Fun
Unit Test FunUnit Test Fun
Unit Test Fun
 
Mit nginx und FastCGI skalieren
Mit nginx und FastCGI skalierenMit nginx und FastCGI skalieren
Mit nginx und FastCGI skalieren
 
Migration Concepts For Enterprise PHP Applications
Migration Concepts For Enterprise PHP ApplicationsMigration Concepts For Enterprise PHP Applications
Migration Concepts For Enterprise PHP Applications
 
Native Cross-Platform-Apps mit Titanium Mobile und Alloy
Native Cross-Platform-Apps mit Titanium Mobile und AlloyNative Cross-Platform-Apps mit Titanium Mobile und Alloy
Native Cross-Platform-Apps mit Titanium Mobile und Alloy
 
Building a Cloud-based Social Network with Zend Framework and Doctrine 2
Building a Cloud-based Social Network with Zend Framework and Doctrine 2Building a Cloud-based Social Network with Zend Framework and Doctrine 2
Building a Cloud-based Social Network with Zend Framework and Doctrine 2
 
Request Lifecycle im Zend Framework
Request Lifecycle im Zend FrameworkRequest Lifecycle im Zend Framework
Request Lifecycle im Zend Framework
 

Similaire à Googol Data

Gluecon miller horizon
Gluecon miller horizonGluecon miller horizon
Gluecon miller horizonMike Miller
 
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017Codemotion
 
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...Christophe Jolif - Flex Data Visualization going one step further with IBM IL...
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...360|Conferences
 
When Conversion Makes Sense Following the Trends: Is your content ready?
When Conversion Makes Sense Following the Trends: Is your content ready?When Conversion Makes Sense Following the Trends: Is your content ready?
When Conversion Makes Sense Following the Trends: Is your content ready?dclsocialmedia
 
Stopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStorage Switzerland
 
An Introduction To Monitoring With Nagios PowerPoint Presentation Slides
An Introduction To Monitoring With Nagios PowerPoint Presentation SlidesAn Introduction To Monitoring With Nagios PowerPoint Presentation Slides
An Introduction To Monitoring With Nagios PowerPoint Presentation SlidesSlideTeam
 
Nimble storage
Nimble storageNimble storage
Nimble storagedvmug1
 
Live CEO Interview and Webinar Update on the State of Deduplication
 Live CEO Interview and Webinar Update on the State of Deduplication Live CEO Interview and Webinar Update on the State of Deduplication
Live CEO Interview and Webinar Update on the State of DeduplicationStorage Switzerland
 
Nagios An Open Source Network Management System Powerpoint Presentation Slides
Nagios An Open Source Network Management System Powerpoint Presentation SlidesNagios An Open Source Network Management System Powerpoint Presentation Slides
Nagios An Open Source Network Management System Powerpoint Presentation SlidesSlideTeam
 
Enterprise Search Summit - Speeding Up Search
Enterprise Search Summit - Speeding Up SearchEnterprise Search Summit - Speeding Up Search
Enterprise Search Summit - Speeding Up SearchAzul Systems Inc.
 
All Flash Arrays: Transforming storage, data center economics and business p...
All Flash Arrays:  Transforming storage, data center economics and business p...All Flash Arrays:  Transforming storage, data center economics and business p...
All Flash Arrays: Transforming storage, data center economics and business p...Violin Memory
 
Jonathon Rochelle @ FOWA Feb 07
Jonathon Rochelle @ FOWA Feb 07Jonathon Rochelle @ FOWA Feb 07
Jonathon Rochelle @ FOWA Feb 07carsonsystems
 
Murli Thirumale, CEO Ocarina Networks
Murli Thirumale, CEO Ocarina NetworksMurli Thirumale, CEO Ocarina Networks
Murli Thirumale, CEO Ocarina NetworksEntrepreneurTrek
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudInside Analysis
 
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014Dave Stokes
 
Developing Robust IoT Gateway Applications from Building Blocks
Developing Robust IoT Gateway Applications from Building BlocksDeveloping Robust IoT Gateway Applications from Building Blocks
Developing Robust IoT Gateway Applications from Building BlocksFrank Alexander Kraemer
 
System Monitoring With Nagios PowerPoint Presentation Slides
System Monitoring With Nagios PowerPoint Presentation SlidesSystem Monitoring With Nagios PowerPoint Presentation Slides
System Monitoring With Nagios PowerPoint Presentation SlidesSlideTeam
 

Similaire à Googol Data (20)

Gluecon miller horizon
Gluecon miller horizonGluecon miller horizon
Gluecon miller horizon
 
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017
Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017
 
Javascript Library
Javascript LibraryJavascript Library
Javascript Library
 
Cohodatawebinar
Cohodatawebinar Cohodatawebinar
Cohodatawebinar
 
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...Christophe Jolif - Flex Data Visualization going one step further with IBM IL...
Christophe Jolif - Flex Data Visualization going one step further with IBM IL...
 
When Conversion Makes Sense Following the Trends: Is your content ready?
When Conversion Makes Sense Following the Trends: Is your content ready?When Conversion Makes Sense Following the Trends: Is your content ready?
When Conversion Makes Sense Following the Trends: Is your content ready?
 
Stopping Storage Hardware Sprawl
Stopping Storage Hardware SprawlStopping Storage Hardware Sprawl
Stopping Storage Hardware Sprawl
 
An Introduction To Monitoring With Nagios PowerPoint Presentation Slides
An Introduction To Monitoring With Nagios PowerPoint Presentation SlidesAn Introduction To Monitoring With Nagios PowerPoint Presentation Slides
An Introduction To Monitoring With Nagios PowerPoint Presentation Slides
 
Nimble storage
Nimble storageNimble storage
Nimble storage
 
Live CEO Interview and Webinar Update on the State of Deduplication
 Live CEO Interview and Webinar Update on the State of Deduplication Live CEO Interview and Webinar Update on the State of Deduplication
Live CEO Interview and Webinar Update on the State of Deduplication
 
Nagios An Open Source Network Management System Powerpoint Presentation Slides
Nagios An Open Source Network Management System Powerpoint Presentation SlidesNagios An Open Source Network Management System Powerpoint Presentation Slides
Nagios An Open Source Network Management System Powerpoint Presentation Slides
 
Enterprise Search Summit - Speeding Up Search
Enterprise Search Summit - Speeding Up SearchEnterprise Search Summit - Speeding Up Search
Enterprise Search Summit - Speeding Up Search
 
All Flash Arrays: Transforming storage, data center economics and business p...
All Flash Arrays:  Transforming storage, data center economics and business p...All Flash Arrays:  Transforming storage, data center economics and business p...
All Flash Arrays: Transforming storage, data center economics and business p...
 
Hello Gumbo
Hello GumboHello Gumbo
Hello Gumbo
 
Jonathon Rochelle @ FOWA Feb 07
Jonathon Rochelle @ FOWA Feb 07Jonathon Rochelle @ FOWA Feb 07
Jonathon Rochelle @ FOWA Feb 07
 
Murli Thirumale, CEO Ocarina Networks
Murli Thirumale, CEO Ocarina NetworksMurli Thirumale, CEO Ocarina Networks
Murli Thirumale, CEO Ocarina Networks
 
The New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the Cloud
 
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014
Scaling MySQl 1 to N Servers -- Los Angelese MySQL User Group Feb 2014
 
Developing Robust IoT Gateway Applications from Building Blocks
Developing Robust IoT Gateway Applications from Building BlocksDeveloping Robust IoT Gateway Applications from Building Blocks
Developing Robust IoT Gateway Applications from Building Blocks
 
System Monitoring With Nagios PowerPoint Presentation Slides
System Monitoring With Nagios PowerPoint Presentation SlidesSystem Monitoring With Nagios PowerPoint Presentation Slides
System Monitoring With Nagios PowerPoint Presentation Slides
 

Plus de Mayflower GmbH

Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...
Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...
Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...Mayflower GmbH
 
JavaScript Days 2015: Security
JavaScript Days 2015: SecurityJavaScript Days 2015: Security
JavaScript Days 2015: SecurityMayflower GmbH
 
Vom Entwickler zur Führungskraft
Vom Entwickler zur FührungskraftVom Entwickler zur Führungskraft
Vom Entwickler zur FührungskraftMayflower GmbH
 
Salt and pepper — native code in the browser Browser using Google native Client
Salt and pepper — native code in the browser Browser using Google native ClientSalt and pepper — native code in the browser Browser using Google native Client
Salt and pepper — native code in the browser Browser using Google native ClientMayflower GmbH
 
Plugging holes — javascript memory leak debugging
Plugging holes — javascript memory leak debuggingPlugging holes — javascript memory leak debugging
Plugging holes — javascript memory leak debuggingMayflower GmbH
 
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...Mayflower GmbH
 
Pair Programming Mythbusters
Pair Programming MythbustersPair Programming Mythbusters
Pair Programming MythbustersMayflower GmbH
 
Shoeism - Frau im Glück
Shoeism - Frau im GlückShoeism - Frau im Glück
Shoeism - Frau im GlückMayflower GmbH
 
Bessere Software schneller liefern
Bessere Software schneller liefernBessere Software schneller liefern
Bessere Software schneller liefernMayflower GmbH
 
Von 0 auf 100 in 2 Sprints
Von 0 auf 100 in 2 SprintsVon 0 auf 100 in 2 Sprints
Von 0 auf 100 in 2 SprintsMayflower GmbH
 
Piwik anpassen und skalieren
Piwik anpassen und skalierenPiwik anpassen und skalieren
Piwik anpassen und skalierenMayflower GmbH
 
Agilitaet im E-Commerce - E-Commerce Breakfast
Agilitaet im E-Commerce - E-Commerce BreakfastAgilitaet im E-Commerce - E-Commerce Breakfast
Agilitaet im E-Commerce - E-Commerce BreakfastMayflower GmbH
 
Mongo DB - Segen oder Fluch
Mongo DB - Segen oder FluchMongo DB - Segen oder Fluch
Mongo DB - Segen oder FluchMayflower GmbH
 

Plus de Mayflower GmbH (20)

Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...
Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...
Mit Maintenance umgehen können- Fixt du noch Bugs oder lieferst du schon neue...
 
Why and what is go
Why and what is goWhy and what is go
Why and what is go
 
Agile Anti-Patterns
Agile Anti-PatternsAgile Anti-Patterns
Agile Anti-Patterns
 
JavaScript Days 2015: Security
JavaScript Days 2015: SecurityJavaScript Days 2015: Security
JavaScript Days 2015: Security
 
Vom Entwickler zur Führungskraft
Vom Entwickler zur FührungskraftVom Entwickler zur Führungskraft
Vom Entwickler zur Führungskraft
 
Produktive teams
Produktive teamsProduktive teams
Produktive teams
 
Salt and pepper — native code in the browser Browser using Google native Client
Salt and pepper — native code in the browser Browser using Google native ClientSalt and pepper — native code in the browser Browser using Google native Client
Salt and pepper — native code in the browser Browser using Google native Client
 
Plugging holes — javascript memory leak debugging
Plugging holes — javascript memory leak debuggingPlugging holes — javascript memory leak debugging
Plugging holes — javascript memory leak debugging
 
Usability im web
Usability im webUsability im web
Usability im web
 
Rewrites überleben
Rewrites überlebenRewrites überleben
Rewrites überleben
 
JavaScript Security
JavaScript SecurityJavaScript Security
JavaScript Security
 
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...
50 mal produktiver - oder warum ich gute Teams brauche und nicht gute Entwick...
 
Responsive Webdesign
Responsive WebdesignResponsive Webdesign
Responsive Webdesign
 
Pair Programming Mythbusters
Pair Programming MythbustersPair Programming Mythbusters
Pair Programming Mythbusters
 
Shoeism - Frau im Glück
Shoeism - Frau im GlückShoeism - Frau im Glück
Shoeism - Frau im Glück
 
Bessere Software schneller liefern
Bessere Software schneller liefernBessere Software schneller liefern
Bessere Software schneller liefern
 
Von 0 auf 100 in 2 Sprints
Von 0 auf 100 in 2 SprintsVon 0 auf 100 in 2 Sprints
Von 0 auf 100 in 2 Sprints
 
Piwik anpassen und skalieren
Piwik anpassen und skalierenPiwik anpassen und skalieren
Piwik anpassen und skalieren
 
Agilitaet im E-Commerce - E-Commerce Breakfast
Agilitaet im E-Commerce - E-Commerce BreakfastAgilitaet im E-Commerce - E-Commerce Breakfast
Agilitaet im E-Commerce - E-Commerce Breakfast
 
Mongo DB - Segen oder Fluch
Mongo DB - Segen oder FluchMongo DB - Segen oder Fluch
Mongo DB - Segen oder Fluch
 

Dernier

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 

Dernier (20)

DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 

Googol Data

  • 1. Googol records (with MySQL) IPC | October 2008 | Alex Aulbach
  • 2. Definition: Googol 10100 or 10 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 000 or “Imaginable big number” „Googol records“ © MAYFLOWER GmbH 2008 2
  • 3. Overview What will the future bring for databases? Is the principal way to access data the best? Patterns (or suggestions) and showing how that could work with MySQL. Discuss! „Googol records“ © MAYFLOWER GmbH 2008 3
  • 4. The (performance) future of the web Only 10-20 % of world population are “in the Internet”. How should it be with 80 % ? „Googol records“ © MAYFLOWER GmbH 2008 4
  • 5. The (performance) future of the web World population is growing and people get older. „Googol records“ © MAYFLOWER GmbH 2008 5
  • 6. The (performance) future of the web More specialized databases More ways to access them Much easier to access Sharing knowledge vs. closed knowledge: Who wins? Services become more dependent to others The web grows faster than Moores Law! (Moores Law: “Only” Factor 1000 in 20 years.) „Googol records“ © MAYFLOWER GmbH 2008 6
  • 7. What does this mean us? We will surely come into problems But cannot say when, where and why No Boss. The data belongs to everyone It’s like new roads It’s “Real-live”! „Googol records“ © MAYFLOWER GmbH 2008 7
  • 8. Consequences of growth New hardware will no longer solve speed problems Even new database will not Even a rewrite of the application won’t Need to rethink the problems from scratch! „Googol records“ © MAYFLOWER GmbH 2008 8
  • 9. Of course... … need for splitting, sharding, partitioning, cluster etc. … need to plan growth from beginning of the project. … hardware resources can no longer be planned. … distinct importance of data. … estimate instead of being correct. „Googol records“ © MAYFLOWER GmbH 2008 9
  • 10. But ... Is this enough? „Googol records“ © MAYFLOWER GmbH 2008 10
  • 11. Patterns (or better: suggestions) Brain storage engine. Reading differs from writing. Redundancy and specialization. The storage itself can keep the information. Time (and sleep). The journey is the reward. „Googol records“ © MAYFLOWER GmbH 2008 11
  • 12. 1 :: Brain storage engine :: 1 Short term memory (working memory) Unsorted, unfiltered, any data Fast read Very much fast updates/changes Remembers which data is changed/invalid Limited „Googol records“ © MAYFLOWER GmbH 2008 12
  • 13. 1 :: Brain storage engine :: 2 Long-term memory Presorted, well filtered data Unlimited (well, more or less) Extremely fast read access (sometimes) Updates/inserts by repeating in working memory Sleep helps to better store „Googol records“ © MAYFLOWER GmbH 2008 13
  • 14. How does that model fit into real life? Nobody awaits to find old things fast Telephone-books 90/10-Problems „Googol records“ © MAYFLOWER GmbH 2008 14
  • 15. Show Searching in long term memory. Scaling of working/long-term memory vs. one table with inserts/updates/deletes. „Googol records“ © MAYFLOWER GmbH 2008 15
  • 16. 2 :: Reading differs from writing Look at the physical processes Reading with the fingertips: No read and write at the same time Handling reading and writing as different aspects of the same thing is a compromise Only specialization enables good optimization „Googol records“ © MAYFLOWER GmbH 2008 16
  • 17. Reader/Writer: Simplest layout „Googol records“ © MAYFLOWER GmbH 2008 17
  • 18. The web as storage? „Googol records“ © MAYFLOWER GmbH 2008 18
  • 19. Web can work like this „Googol records“ © MAYFLOWER GmbH 2008 19
  • 20. Recursive definition of the catalog „Googol records“ © MAYFLOWER GmbH 2008 20
  • 21. Scaling, setup as “black box” „Googol records“ © MAYFLOWER GmbH 2008 21
  • 22. Share everything „Googol records“ © MAYFLOWER GmbH 2008 22
  • 23. Comments How does this scale? What doesn’t work with this? „Googol records“ © MAYFLOWER GmbH 2008 23
  • 24. 3 :: Redundancy and specialization :: 1 We cannot backup a googol Nobody needs backup, but everybody needs to restore „Googol records“ © MAYFLOWER GmbH 2008 24
  • 25. 3 :: Redundancy and specialization :: 2 Redundancy: Store the information on many places Store more important information on more places Specialization: “Materialized views” EAV modeling and pivoting Take ideas from data warehouses and repositories „Googol records“ © MAYFLOWER GmbH 2008 25
  • 26. 3 :: Redundancy and specialization :: 3 The wheel comes full circle: More important: more access. More access: More need for redundancy. More redundancy: more speed and reliability. More speed and reliability: more important. „Googol records“ © MAYFLOWER GmbH 2008 26
  • 27. Implementation with Reader/Writer „Googol records“ © MAYFLOWER GmbH 2008 27
  • 28. 4 :: The storage itself can keep the information. “A storage has always physical limitations. A logical information of data which belongs together doesn't have any physical limitations.” Alex Aulbach, Sept. 2008 „Googol records“ © MAYFLOWER GmbH 2008 28
  • 29. The index is the problem! The googol-universe is limited. The index can take “half of the galaxies”. Only the “rest” can be used for the data. Less index means: Faster search in the “needed” index. Less time to write data and index. Less time to warm up. More space for the records. „Googol records“ © MAYFLOWER GmbH 2008 29
  • 30. Show Access full table or split data into several parts. Index-size Write Presorted tables „Googol records“ © MAYFLOWER GmbH 2008 30
  • 31. 5 :: Time (and sleep) :: 1 Human brain: Only three bits per second! We all have been babies. Trust! Just wait and see. Developers (and customers) need to think in decades not in days till to the project-end. „Googol records“ © MAYFLOWER GmbH 2008 31
  • 32. 5 :: Sleep (and time) :: 2 Again human brain: Learns while sleeping! Why not apply this for databases? Premise: Redundancy! Dolphins sleep only with one hemisphere at a time. The wheel comes full circle Redundancy. Distinct read and write. „Googol records“ © MAYFLOWER GmbH 2008 32
  • 33. Show Well, I can’t show this, because it takes … time. „Googol records“ © MAYFLOWER GmbH 2008 33
  • 34. 6 :: The journey is the reward Future: Not so important how to search, but where. Store step by step where to find the result, not the result. You can find faster ways only by trying a shortcut. It comes full circle: Search many different ways and take the fastest. While sleeping try out new things (dreaming). „Googol records“ © MAYFLOWER GmbH 2008 34
  • 35. Conclusion Dreams may come true while sleeping. We must invent now the tools to solve the problems of the future. Speed is not a matter of hardware but of how things are done. Never take speed as stated: In a googol-universe wormholes exists! Moores Law may help, but do not trust em. „Googol records“ © MAYFLOWER GmbH 2008 35
  • 36. Thank you! Alex Aulbach Mayflower GmbH Pleichertorstr. 2 97070 Würzburg, Germany +49 (931) 35 9 65 - 0 alex.aulbach@mayflower.de