SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
ANALYZING BIG DATA
                          IS PROGRAMMING
                           FOR THE CLOUD
                                 Chris Boos (@boosc)
                                   boos@arago.de
                             CloudCamp Frankfurt 24.5.2012




Donnerstag, 24. Mai 12
Data, lots of it
Donnerstag, 24. Mai 12
Even in simple datasets, common statistics
                    fails - (avg, min, max, distribution)
Donnerstag, 24. Mai 12
79 times more CPU power than used
                   in Apollo missions on one iPhone
Donnerstag, 24. Mai 12
Why you need big data
                         You Are Here !                    Yield

                          2010 s   Systems Thinking                   Wisdom

                          2000 s   Knowledge Ecology               Intelligence

                          1990 s   Knowledge Management            Knowledge
                          1980 s
                                   Information Mangement           Information
                          1970 s

                          1960 s
                          1950 s   Data Processing                        Data




Donnerstag, 24. Mai 12
Finding clusters, evaluating outliers
                        and interpreting white noise
Donnerstag, 24. Mai 12
You are not looking for patterns,
                          you are looking for anomalies
Donnerstag, 24. Mai 12
Cloud Computing 1.0
                           Is
              When the IT guys are finally
              able to explain to business
                people what they were
              talking about 20 years ago!

Donnerstag, 24. Mai 12
=



Donnerstag, 24. Mai 12
Computation on
                            demand
                                +
                          Pay as you go

Donnerstag, 24. Mai 12
Cloud Computing 2.0
                         Is
             When the IT guys realize
              that using this scalable
             ressource also calles for
            new ways of programming
Donnerstag, 24. Mai 12
=



Donnerstag, 24. Mai 12
go beyond IaaS
                             and start
                         thinking parallel


Donnerstag, 24. Mai 12
and



Donnerstag, 24. Mai 12
BASE
                         (Basically Available, Soft State, Eventual consistency)




                                               not

                                             ACID
                            (Atomicity, Consistency, Isolation, Durability)

Donnerstag, 24. Mai 12
How to scale
                                 (AWS Example)


                     • Do not allocate instances manually
                     • Each component needs to be independent
                     • Plan for failure
                     • Actively provoke failure

Donnerstag, 24. Mai 12
Human Software
                     •   Click Workers and Mechanical Turks are not just
                         cheap labour
                     •   They allow programmers to hand tasks to humans
                         they are not able to handle algorithmically
                     •   Make use of it to
                         •   Do things too complicated for machine learning
                         •   Pre populate machine learning spaces



Donnerstag, 24. Mai 12
Old Style (Imperative)
                             Programming
                     •   Step by step explanation   1
                         what to do

                     •   Explaining WHAT to do
                         rather than RESULTS
                         you want                   2


                     • Always necessary
                         for basic algorithms
                                                    3



Donnerstag, 24. Mai 12
One New Stly (Functional)
                     Programming I
                     •   Combine results to       1
                         become a program
                                                  2
                     •   Allows dynamic           3
                         distribution

                     •   Map-Reduce is only one
                         way of doing it!




Donnerstag, 24. Mai 12
Functional
                                 Programming II
                         F ( G ( H ( A,B) , C), D)

                    getMusicLikes(getFriends(facebookID)

                    Instead of

                    for i in getFriends(facebookID)
                       getMusicLikes(i)



Donnerstag, 24. Mai 12
Check out my tool list:
                         http://www.hcboos.net/100-links/




Donnerstag, 24. Mai 12
2 Examples




Donnerstag, 24. Mai 12
The AMP3 Platform at Senzari.com
                         Adaptable Music Parallel Processing Platform



Donnerstag, 24. Mai 12
MARS-o-Matic at arago.de
                         Big data based IT modelling and pricing app

Donnerstag, 24. Mai 12
Thank You for
                          Your Time


Donnerstag, 24. Mai 12
Credits
                     • „Big Data Just Beginning to Explode“ by
                         CSC http://www.csc.com/insights/flxwd/
                         78931-big_data_just_beginning_to_explode
                     • „Social media network connections among
                         twitter users“ by Marc Smith http://
                         www.flickr.com/photos/marc_smith/
                     • Asteroid Datasets by Bruce Gary http://
                         brucegary.net/POVENMIRE/x.htm


Donnerstag, 24. Mai 12

Contenu connexe

Similaire à Chris Boos, arago AG: Big Data means new programs

Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7CS, NcState
 
Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingPaco Nathan
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your businessAcunu
 
Talk at Ken Goldberg's Berkeley Lab - June 12th
Talk at Ken Goldberg's Berkeley Lab - June 12thTalk at Ken Goldberg's Berkeley Lab - June 12th
Talk at Ken Goldberg's Berkeley Lab - June 12thNick Pinkston
 
Building Data Products
Building Data ProductsBuilding Data Products
Building Data ProductsCloudera, Inc.
 
Smart Data Webinar: Advances in Natural Language Processing II - NL Generation
Smart Data Webinar: Advances in Natural Language Processing II - NL GenerationSmart Data Webinar: Advances in Natural Language Processing II - NL Generation
Smart Data Webinar: Advances in Natural Language Processing II - NL GenerationDATAVERSITY
 
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) Better
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) BetterImplementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) Better
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) BetterDATAVERSITY
 
Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lora Aroyo
 
Augury and Omens Aside, Part 1:
 The Business Case for Apache Mesos
Augury and Omens Aside, Part 1:
 The Business Case for Apache MesosAugury and Omens Aside, Part 1:
 The Business Case for Apache Mesos
Augury and Omens Aside, Part 1:
 The Business Case for Apache MesosPaco Nathan
 
Gilbane Boston 2012 Big Data 101
Gilbane Boston 2012 Big Data 101Gilbane Boston 2012 Big Data 101
Gilbane Boston 2012 Big Data 101Peter O'Kelly
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataRoi Blanco
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maartRobin De Croon
 
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Inside Analysis
 
Gluecon miller horizon
Gluecon miller horizonGluecon miller horizon
Gluecon miller horizonMike Miller
 
Talk at Institute for the Future
Talk at Institute for the FutureTalk at Institute for the Future
Talk at Institute for the FutureNick Pinkston
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMEGigaom
 

Similaire à Chris Boos, arago AG: Big Data means new programs (20)

Dm sei-tutorial-v7
Dm sei-tutorial-v7Dm sei-tutorial-v7
Dm sei-tutorial-v7
 
Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely heading
 
Exploring Big Data value for your business
Exploring Big Data value for your businessExploring Big Data value for your business
Exploring Big Data value for your business
 
Talk at Ken Goldberg's Berkeley Lab - June 12th
Talk at Ken Goldberg's Berkeley Lab - June 12thTalk at Ken Goldberg's Berkeley Lab - June 12th
Talk at Ken Goldberg's Berkeley Lab - June 12th
 
Building Data Products
Building Data ProductsBuilding Data Products
Building Data Products
 
Smart Data Webinar: Advances in Natural Language Processing II - NL Generation
Smart Data Webinar: Advances in Natural Language Processing II - NL GenerationSmart Data Webinar: Advances in Natural Language Processing II - NL Generation
Smart Data Webinar: Advances in Natural Language Processing II - NL Generation
 
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) Better
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) BetterImplementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) Better
Implementing Big Data, NoSQL, & Hadoop - Bigger Is (Usually) Better
 
Big data 101
Big data 101Big data 101
Big data 101
 
Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)Lecture 7: Social Web Challenges (2012)
Lecture 7: Social Web Challenges (2012)
 
Augury and Omens Aside, Part 1:
 The Business Case for Apache Mesos
Augury and Omens Aside, Part 1:
 The Business Case for Apache MesosAugury and Omens Aside, Part 1:
 The Business Case for Apache Mesos
Augury and Omens Aside, Part 1:
 The Business Case for Apache Mesos
 
Gilbane Boston 2012 Big Data 101
Gilbane Boston 2012 Big Data 101Gilbane Boston 2012 Big Data 101
Gilbane Boston 2012 Big Data 101
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
Thesispresentatie maart
Thesispresentatie maartThesispresentatie maart
Thesispresentatie maart
 
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
Drinking from the Fire Hose: Practical Approaches to Big Data Preparation and...
 
Gluecon miller horizon
Gluecon miller horizonGluecon miller horizon
Gluecon miller horizon
 
Talk at Institute for the Future
Talk at Institute for the FutureTalk at Institute for the Future
Talk at Institute for the Future
 
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUMETHE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
THE 3V’S OF BIG DATA: VARIETY, VELOCITY, and VOLUME
 
Big Data: an introduction
Big Data: an introductionBig Data: an introduction
Big Data: an introduction
 

Plus de Cloudcamp

Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enx
Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enxImmo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enx
Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enxCloudcamp
 
Sascha Dittmann, Ernst & Young: Big Data in der Cloud
Sascha Dittmann, Ernst & Young: Big Data in der CloudSascha Dittmann, Ernst & Young: Big Data in der Cloud
Sascha Dittmann, Ernst & Young: Big Data in der CloudCloudcamp
 
Clemens engler, IBM: smarte is vs auf dem weg in die cloud
Clemens engler, IBM: smarte is vs auf dem weg in die cloudClemens engler, IBM: smarte is vs auf dem weg in die cloud
Clemens engler, IBM: smarte is vs auf dem weg in die cloudCloudcamp
 
Thomas latz wie sich unternehmen hybrid der cloud annaehern
Thomas latz   wie sich unternehmen hybrid der cloud annaehernThomas latz   wie sich unternehmen hybrid der cloud annaehern
Thomas latz wie sich unternehmen hybrid der cloud annaehernCloudcamp
 
Paul Huppertz Cloud Computing From System Design To Service Composing
Paul Huppertz  Cloud Computing    From System Design To Service ComposingPaul Huppertz  Cloud Computing    From System Design To Service Composing
Paul Huppertz Cloud Computing From System Design To Service ComposingCloudcamp
 
Mark Philipp Kost Private Cloud
Mark Philipp Kost   Private CloudMark Philipp Kost   Private Cloud
Mark Philipp Kost Private CloudCloudcamp
 
Matthias Jung Cloud Security New Problem Or New Context
Matthias Jung   Cloud Security New Problem Or New ContextMatthias Jung   Cloud Security New Problem Or New Context
Matthias Jung Cloud Security New Problem Or New ContextCloudcamp
 
Cedric Huesler Data First Approach
Cedric Huesler   Data First ApproachCedric Huesler   Data First Approach
Cedric Huesler Data First ApproachCloudcamp
 
Uri Budnik Stories From The Trenches
Uri Budnik   Stories From The TrenchesUri Budnik   Stories From The Trenches
Uri Budnik Stories From The TrenchesCloudcamp
 

Plus de Cloudcamp (9)

Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enx
Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enxImmo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enx
Immo Wehrenberg, ENX: SkIDentity sichere Cloud-Identitäten im enx
 
Sascha Dittmann, Ernst & Young: Big Data in der Cloud
Sascha Dittmann, Ernst & Young: Big Data in der CloudSascha Dittmann, Ernst & Young: Big Data in der Cloud
Sascha Dittmann, Ernst & Young: Big Data in der Cloud
 
Clemens engler, IBM: smarte is vs auf dem weg in die cloud
Clemens engler, IBM: smarte is vs auf dem weg in die cloudClemens engler, IBM: smarte is vs auf dem weg in die cloud
Clemens engler, IBM: smarte is vs auf dem weg in die cloud
 
Thomas latz wie sich unternehmen hybrid der cloud annaehern
Thomas latz   wie sich unternehmen hybrid der cloud annaehernThomas latz   wie sich unternehmen hybrid der cloud annaehern
Thomas latz wie sich unternehmen hybrid der cloud annaehern
 
Paul Huppertz Cloud Computing From System Design To Service Composing
Paul Huppertz  Cloud Computing    From System Design To Service ComposingPaul Huppertz  Cloud Computing    From System Design To Service Composing
Paul Huppertz Cloud Computing From System Design To Service Composing
 
Mark Philipp Kost Private Cloud
Mark Philipp Kost   Private CloudMark Philipp Kost   Private Cloud
Mark Philipp Kost Private Cloud
 
Matthias Jung Cloud Security New Problem Or New Context
Matthias Jung   Cloud Security New Problem Or New ContextMatthias Jung   Cloud Security New Problem Or New Context
Matthias Jung Cloud Security New Problem Or New Context
 
Cedric Huesler Data First Approach
Cedric Huesler   Data First ApproachCedric Huesler   Data First Approach
Cedric Huesler Data First Approach
 
Uri Budnik Stories From The Trenches
Uri Budnik   Stories From The TrenchesUri Budnik   Stories From The Trenches
Uri Budnik Stories From The Trenches
 

Dernier

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 

Dernier (20)

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 

Chris Boos, arago AG: Big Data means new programs

  • 1. ANALYZING BIG DATA IS PROGRAMMING FOR THE CLOUD Chris Boos (@boosc) boos@arago.de CloudCamp Frankfurt 24.5.2012 Donnerstag, 24. Mai 12
  • 2. Data, lots of it Donnerstag, 24. Mai 12
  • 3. Even in simple datasets, common statistics fails - (avg, min, max, distribution) Donnerstag, 24. Mai 12
  • 4. 79 times more CPU power than used in Apollo missions on one iPhone Donnerstag, 24. Mai 12
  • 5. Why you need big data You Are Here ! Yield 2010 s Systems Thinking Wisdom 2000 s Knowledge Ecology Intelligence 1990 s Knowledge Management Knowledge 1980 s Information Mangement Information 1970 s 1960 s 1950 s Data Processing Data Donnerstag, 24. Mai 12
  • 6. Finding clusters, evaluating outliers and interpreting white noise Donnerstag, 24. Mai 12
  • 7. You are not looking for patterns, you are looking for anomalies Donnerstag, 24. Mai 12
  • 8. Cloud Computing 1.0 Is When the IT guys are finally able to explain to business people what they were talking about 20 years ago! Donnerstag, 24. Mai 12
  • 10. Computation on demand + Pay as you go Donnerstag, 24. Mai 12
  • 11. Cloud Computing 2.0 Is When the IT guys realize that using this scalable ressource also calles for new ways of programming Donnerstag, 24. Mai 12
  • 13. go beyond IaaS and start thinking parallel Donnerstag, 24. Mai 12
  • 15. BASE (Basically Available, Soft State, Eventual consistency) not ACID (Atomicity, Consistency, Isolation, Durability) Donnerstag, 24. Mai 12
  • 16. How to scale (AWS Example) • Do not allocate instances manually • Each component needs to be independent • Plan for failure • Actively provoke failure Donnerstag, 24. Mai 12
  • 17. Human Software • Click Workers and Mechanical Turks are not just cheap labour • They allow programmers to hand tasks to humans they are not able to handle algorithmically • Make use of it to • Do things too complicated for machine learning • Pre populate machine learning spaces Donnerstag, 24. Mai 12
  • 18. Old Style (Imperative) Programming • Step by step explanation 1 what to do • Explaining WHAT to do rather than RESULTS you want 2 • Always necessary for basic algorithms 3 Donnerstag, 24. Mai 12
  • 19. One New Stly (Functional) Programming I • Combine results to 1 become a program 2 • Allows dynamic 3 distribution • Map-Reduce is only one way of doing it! Donnerstag, 24. Mai 12
  • 20. Functional Programming II F ( G ( H ( A,B) , C), D) getMusicLikes(getFriends(facebookID) Instead of for i in getFriends(facebookID) getMusicLikes(i) Donnerstag, 24. Mai 12
  • 21. Check out my tool list: http://www.hcboos.net/100-links/ Donnerstag, 24. Mai 12
  • 23. The AMP3 Platform at Senzari.com Adaptable Music Parallel Processing Platform Donnerstag, 24. Mai 12
  • 24. MARS-o-Matic at arago.de Big data based IT modelling and pricing app Donnerstag, 24. Mai 12
  • 25. Thank You for Your Time Donnerstag, 24. Mai 12
  • 26. Credits • „Big Data Just Beginning to Explode“ by CSC http://www.csc.com/insights/flxwd/ 78931-big_data_just_beginning_to_explode • „Social media network connections among twitter users“ by Marc Smith http:// www.flickr.com/photos/marc_smith/ • Asteroid Datasets by Bruce Gary http:// brucegary.net/POVENMIRE/x.htm Donnerstag, 24. Mai 12