SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Big Data
       Little 	

 Tests	

              John Heintz	

           Founder, Gist Labs	

Technical Consultant, Cutter Consortium	

                     	

      john@gistlabs.com @jheintz	

           http://gistlabs.com
About John Heintz	

       •  Developer since 1995	

       •  Agilist since 1999	

       •  Founded Gist Labs in 2008	


       •  Developer, Mentor, Consultant	

       •  Intuitive, Abstract, Precise	

                  Kool-Aids I’ve drank:	

                  Agile/Lean/Kanban, OO, TDD, REST, Mentoring, Craftsmanship,
                  Emergent/Progressive Design, InnovationGames®, Systems and
                  Complexity Theory	

                  	

2	

                                                                © 2012 Gist Labs, LLC
My Goals for You	



       •  Demystify test automation for Big Data	

       •  Provide executable examples	



3	

                                        © 2012 Gist Labs, LLC
What you shouldn’t
             expect…	


       •  Barely introduce Big Data concepts	

       •  No performance tuning	



4	

                                        © 2012 Gist Labs, LLC
Simple Code, Config	


       •  I went as simple and clear as possible	

       •  Java, JUnit4	

       •  Maven… okay maybe not simple :-	


5	

                                          © 2012 Gist Labs, LLC
Mostly Code	


       •  Remember the Law of Two Feet	

       •  If code isn’t what you were looking for I
         totally respect you finding something better
         for your time J	




6	

                                        © 2012 Gist Labs, LLC
•  Everything available from
             	

http://gistlabs.com/2012/08/big-data-little-tests/ 	

       	

       •  The entire command script is there…	

             	

so you can take notes assuming that’s available	




7	

                                                    © 2012 Gist Labs, LLC
My Soapboxes…	


       These are topics I’ll repeat myself on	



       •  Fast test execution	

       •  One-click build	


8	

                                          © 2012 Gist Labs, LLC
Big Data	


       •  Too much	

       •  Too fast	

       •  Not trivially structured	


9	

                                    © 2012 Gist Labs, LLC
Map Reduce	

        •  Map from one input to one output	

        •  Reduce from many inputs to one output	


        •  Can be run in parallel	

        •  Crude, but massive	


10	

                                      © 2012 Gist Labs, LLC
CAP Theorem	


        •  Consistency	

        •  Availability	

        •  Partition Tolerance	


11	

                               © 2012 Gist Labs, LLC
Big Data Ecosystem	

        •  Hadoop: A giant among giants	

          	

    	

(Tons of projects on this platform!!)	

        •  Cassandra: Feels like a weird RDBMS	

        •  Riak: An elegant key/value/search store	

        •  MongoDB: Document store	

12	

                                              © 2012 Gist Labs, LLC
Let’s Run Some Code	




13	

                    © 2012 Gist Labs, LLC
Hadoop Tests	




14	

                © 2012 Gist Labs, LLC
Riak tests	




15	

                   © 2012 Gist Labs, LLC
Other Frameworks	

        •  CassandraUnit	

          https://github.com/jsevellec/cassandra-unit	

          	

        •  PigUnit, Hadoop Query Language	

          http://pig.apache.org/docs/r0.8.1/pigunit.html	

          	


16	

                                          © 2012 Gist Labs, LLC
Code Questions?	



        •  Fast test execution?	

        •  One-click build?	



17	

                                © 2012 Gist Labs, LLC
What about Big Tests?	



        •  Real test data	

        •  Realistic cluster	



18	

                             © 2012 Gist Labs, LLC
Real Test Data	


        My favorite strategy is to:	

          •  Develop with small, crafted data	

          •  Build/test the same way	

          •  Run another test on top of real prod data	


19	

                                        © 2012 Gist Labs, LLC
Production


                               Continuous Integration Servers
                         Continuous Deployment Servers

                                             Build
               Test1

                                            Cluster
             Cluster
                    Test2

                                                                                            Cluster
                       Staging


        Developers



                               Version Control





        Developers





                                      Virtual vs Physical Servers

                                                                                                                             Private vs Public Cloud


                                                                                 Developer Sandboxes

                      Network Infrastructure



                                                                                                            Self-service Provisioning

                                                       Storage Infrastructure





20	

                                                                                                                        © 2012 Gist Labs, LLC
Realistic Cluster	


        •  Use a CI/DevOps environment	

        •  Virtualize, “X as a Service”	

          •  Virtual Machines	

          •  Virtual Infrastructure (Network, Storage)	


21	

                                        © 2012 Gist Labs, LLC
Jenkins CI Server	

        •  Master/slave clusters	

        •  Plugins for Hadoop and VMWare	

        •  http://jenkins-ci.org/	





22	

                                     © 2012 Gist Labs, LLC
Big Questions?	





23	

                  © 2012 Gist Labs, LLC
Thank you!	


        •  Everything available from:	

          	

http://gistlabs.com/2012/08/big-data-little-tests/ 	



        •  John Heintz, @jheintz, http://gistlabs.com	


24	

                                                © 2012 Gist Labs, LLC

Contenu connexe

Tendances

Cloud is such stuff as dreams are made on
Cloud is such stuff as dreams are made onCloud is such stuff as dreams are made on
Cloud is such stuff as dreams are made onPatrick Chanezon
 
DockerCon EU 2018 "Continuous Delivery with Docker and Java"
DockerCon EU 2018 "Continuous Delivery with Docker and Java"DockerCon EU 2018 "Continuous Delivery with Docker and Java"
DockerCon EU 2018 "Continuous Delivery with Docker and Java"Daniel Bryant
 
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"London k8s "Ambassador: Open Source Kubernetes Edge Gateway"
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"Daniel Bryant
 
Integration in the Age of DevOps
Integration in the Age of DevOpsIntegration in the Age of DevOps
Integration in the Age of DevOpsBrian Ashburn
 
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...{code} by Dell EMC
 
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...{code} by Dell EMC
 
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...{code} by Dell EMC
 
JBoss Architect Meetup - November 2013 - 'Play By The Rules'
JBoss Architect Meetup - November 2013 - 'Play By The Rules'JBoss Architect Meetup - November 2013 - 'Play By The Rules'
JBoss Architect Meetup - November 2013 - 'Play By The Rules'JBossArchitectForum
 
JBoss Architect Forum London - October 2013 - Platform as a What?
JBoss Architect Forum London - October 2013 - Platform as a What?JBoss Architect Forum London - October 2013 - Platform as a What?
JBoss Architect Forum London - October 2013 - Platform as a What?JBossArchitectForum
 
DCSF 19 Modern Orchestrated IT for Enterprise CMS
DCSF 19  Modern Orchestrated IT for Enterprise CMSDCSF 19  Modern Orchestrated IT for Enterprise CMS
DCSF 19 Modern Orchestrated IT for Enterprise CMSDocker, Inc.
 
DBTA Data Summit : Eliminating the data constraint in Application Development
DBTA Data Summit : Eliminating the data constraint in Application DevelopmentDBTA Data Summit : Eliminating the data constraint in Application Development
DBTA Data Summit : Eliminating the data constraint in Application DevelopmentKyle Hailey
 
A No-Nonsense Approach to Hyperconvergence
A No-Nonsense Approach to HyperconvergenceA No-Nonsense Approach to Hyperconvergence
A No-Nonsense Approach to HyperconvergenceChris Wahl
 
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppoJoe Clarke
 
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017{code} by Dell EMC
 
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...{code} by Dell EMC
 
Cloud Security - I ain’t rocket science @ Club.cloud 20211103
Cloud Security - I ain’t rocket science @ Club.cloud 20211103Cloud Security - I ain’t rocket science @ Club.cloud 20211103
Cloud Security - I ain’t rocket science @ Club.cloud 20211103Edzo Botjes
 
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...VMworld
 
AllDayDevOps: "Microservices: The People and Organisational Impact"
AllDayDevOps: "Microservices: The People and Organisational Impact"AllDayDevOps: "Microservices: The People and Organisational Impact"
AllDayDevOps: "Microservices: The People and Organisational Impact"Daniel Bryant
 

Tendances (20)

Cloud is such stuff as dreams are made on
Cloud is such stuff as dreams are made onCloud is such stuff as dreams are made on
Cloud is such stuff as dreams are made on
 
DockerCon EU 2018 "Continuous Delivery with Docker and Java"
DockerCon EU 2018 "Continuous Delivery with Docker and Java"DockerCon EU 2018 "Continuous Delivery with Docker and Java"
DockerCon EU 2018 "Continuous Delivery with Docker and Java"
 
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"London k8s "Ambassador: Open Source Kubernetes Edge Gateway"
London k8s "Ambassador: Open Source Kubernetes Edge Gateway"
 
Integration in the Age of DevOps
Integration in the Age of DevOpsIntegration in the Age of DevOps
Integration in the Age of DevOps
 
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...
Reduce Your Configuration Management Nightmare with Docker - Jonah Horowitz -...
 
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...
Large Scale Cassandra Made Better in Containers - Chris Duchesne and Aaron Sp...
 
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...
Docker for Private Clouds with RackHD - Justin Kenney and Aaron Spiegel - Del...
 
JBoss Architect Meetup - November 2013 - 'Play By The Rules'
JBoss Architect Meetup - November 2013 - 'Play By The Rules'JBoss Architect Meetup - November 2013 - 'Play By The Rules'
JBoss Architect Meetup - November 2013 - 'Play By The Rules'
 
JBoss Architect Forum London - October 2013 - Platform as a What?
JBoss Architect Forum London - October 2013 - Platform as a What?JBoss Architect Forum London - October 2013 - Platform as a What?
JBoss Architect Forum London - October 2013 - Platform as a What?
 
DCSF 19 Modern Orchestrated IT for Enterprise CMS
DCSF 19  Modern Orchestrated IT for Enterprise CMSDCSF 19  Modern Orchestrated IT for Enterprise CMS
DCSF 19 Modern Orchestrated IT for Enterprise CMS
 
DBTA Data Summit : Eliminating the data constraint in Application Development
DBTA Data Summit : Eliminating the data constraint in Application DevelopmentDBTA Data Summit : Eliminating the data constraint in Application Development
DBTA Data Summit : Eliminating the data constraint in Application Development
 
A No-Nonsense Approach to Hyperconvergence
A No-Nonsense Approach to HyperconvergenceA No-Nonsense Approach to Hyperconvergence
A No-Nonsense Approach to Hyperconvergence
 
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo
3V0-622 objective-3.1-logical-physical with Joe Clarke @elgwhoppo
 
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017
The Open Source Effect on Dell EMC - Joshua Bernstein - Dell EMC World 2017
 
Core Principles Of Ci
Core Principles Of CiCore Principles Of Ci
Core Principles Of Ci
 
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...
Google does containers: Hello Kubernetes - Steve Wong and Vladimir Vivien - D...
 
Cloud Security - I ain’t rocket science @ Club.cloud 20211103
Cloud Security - I ain’t rocket science @ Club.cloud 20211103Cloud Security - I ain’t rocket science @ Club.cloud 20211103
Cloud Security - I ain’t rocket science @ Club.cloud 20211103
 
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...
VMworld 2013: How UC San Francisco Delivered ‘Science as a Service’ with Priv...
 
AllDayDevOps: "Microservices: The People and Organisational Impact"
AllDayDevOps: "Microservices: The People and Organisational Impact"AllDayDevOps: "Microservices: The People and Organisational Impact"
AllDayDevOps: "Microservices: The People and Organisational Impact"
 
Javantura v7 - Behaviour Driven Development with Cucumber - Ivan Lozić
Javantura v7 - Behaviour Driven Development with Cucumber - Ivan LozićJavantura v7 - Behaviour Driven Development with Cucumber - Ivan Lozić
Javantura v7 - Behaviour Driven Development with Cucumber - Ivan Lozić
 

En vedette

Simple methods for_reliable_user_involvement
Simple methods for_reliable_user_involvementSimple methods for_reliable_user_involvement
Simple methods for_reliable_user_involvementdrewz lin
 
Agile and the nature of decision making
Agile and the nature of decision makingAgile and the nature of decision making
Agile and the nature of decision makingdrewz lin
 
Athrun instrument driver
Athrun instrument driverAthrun instrument driver
Athrun instrument driverdrewz lin
 
Scaling scrum-mega-framework
Scaling scrum-mega-frameworkScaling scrum-mega-framework
Scaling scrum-mega-frameworkdrewz lin
 
4 exercises for part 1
4   exercises for part 14   exercises for part 1
4 exercises for part 1drewz lin
 
Djt22 justinliu djt.qq.com
Djt22 justinliu djt.qq.comDjt22 justinliu djt.qq.com
Djt22 justinliu djt.qq.comdrewz lin
 
Top100summit 互联网发布与实验体系建设
Top100summit 互联网发布与实验体系建设 Top100summit 互联网发布与实验体系建设
Top100summit 互联网发布与实验体系建设 drewz lin
 
豆瓣I os自动化测试实践和经验
豆瓣I os自动化测试实践和经验豆瓣I os自动化测试实践和经验
豆瓣I os自动化测试实践和经验drewz lin
 
敏捷模式下的百度研发基础环境建设 1
敏捷模式下的百度研发基础环境建设 1敏捷模式下的百度研发基础环境建设 1
敏捷模式下的百度研发基础环境建设 1drewz lin
 
Soundararajan arthurbalci agile2012_handout_v1_0
Soundararajan arthurbalci agile2012_handout_v1_0Soundararajan arthurbalci agile2012_handout_v1_0
Soundararajan arthurbalci agile2012_handout_v1_0drewz lin
 
Hands onagile2012shared
Hands onagile2012sharedHands onagile2012shared
Hands onagile2012shareddrewz lin
 
Test design problems investigation taixiaomei 20120807
Test design problems investigation taixiaomei 20120807Test design problems investigation taixiaomei 20120807
Test design problems investigation taixiaomei 20120807drewz lin
 
Buildinga teamthroughfeedback 0
Buildinga teamthroughfeedback 0Buildinga teamthroughfeedback 0
Buildinga teamthroughfeedback 0drewz lin
 
Tcpcopy 阿里技术沙龙
Tcpcopy 阿里技术沙龙Tcpcopy 阿里技术沙龙
Tcpcopy 阿里技术沙龙drewz lin
 
15 ma cs and signatures
15   ma cs and signatures15   ma cs and signatures
15 ma cs and signaturesdrewz lin
 
9 password security
9   password security9   password security
9 password securitydrewz lin
 
大规模网站架构
大规模网站架构大规模网站架构
大规模网站架构drewz lin
 
Agile 2012 - leadership agility workshop slides -- final.pptx
Agile 2012  - leadership agility workshop slides -- final.pptxAgile 2012  - leadership agility workshop slides -- final.pptx
Agile 2012 - leadership agility workshop slides -- final.pptxdrewz lin
 
13 asymmetric key cryptography
13   asymmetric key cryptography13   asymmetric key cryptography
13 asymmetric key cryptographydrewz lin
 

En vedette (19)

Simple methods for_reliable_user_involvement
Simple methods for_reliable_user_involvementSimple methods for_reliable_user_involvement
Simple methods for_reliable_user_involvement
 
Agile and the nature of decision making
Agile and the nature of decision makingAgile and the nature of decision making
Agile and the nature of decision making
 
Athrun instrument driver
Athrun instrument driverAthrun instrument driver
Athrun instrument driver
 
Scaling scrum-mega-framework
Scaling scrum-mega-frameworkScaling scrum-mega-framework
Scaling scrum-mega-framework
 
4 exercises for part 1
4   exercises for part 14   exercises for part 1
4 exercises for part 1
 
Djt22 justinliu djt.qq.com
Djt22 justinliu djt.qq.comDjt22 justinliu djt.qq.com
Djt22 justinliu djt.qq.com
 
Top100summit 互联网发布与实验体系建设
Top100summit 互联网发布与实验体系建设 Top100summit 互联网发布与实验体系建设
Top100summit 互联网发布与实验体系建设
 
豆瓣I os自动化测试实践和经验
豆瓣I os自动化测试实践和经验豆瓣I os自动化测试实践和经验
豆瓣I os自动化测试实践和经验
 
敏捷模式下的百度研发基础环境建设 1
敏捷模式下的百度研发基础环境建设 1敏捷模式下的百度研发基础环境建设 1
敏捷模式下的百度研发基础环境建设 1
 
Soundararajan arthurbalci agile2012_handout_v1_0
Soundararajan arthurbalci agile2012_handout_v1_0Soundararajan arthurbalci agile2012_handout_v1_0
Soundararajan arthurbalci agile2012_handout_v1_0
 
Hands onagile2012shared
Hands onagile2012sharedHands onagile2012shared
Hands onagile2012shared
 
Test design problems investigation taixiaomei 20120807
Test design problems investigation taixiaomei 20120807Test design problems investigation taixiaomei 20120807
Test design problems investigation taixiaomei 20120807
 
Buildinga teamthroughfeedback 0
Buildinga teamthroughfeedback 0Buildinga teamthroughfeedback 0
Buildinga teamthroughfeedback 0
 
Tcpcopy 阿里技术沙龙
Tcpcopy 阿里技术沙龙Tcpcopy 阿里技术沙龙
Tcpcopy 阿里技术沙龙
 
15 ma cs and signatures
15   ma cs and signatures15   ma cs and signatures
15 ma cs and signatures
 
9 password security
9   password security9   password security
9 password security
 
大规模网站架构
大规模网站架构大规模网站架构
大规模网站架构
 
Agile 2012 - leadership agility workshop slides -- final.pptx
Agile 2012  - leadership agility workshop slides -- final.pptxAgile 2012  - leadership agility workshop slides -- final.pptx
Agile 2012 - leadership agility workshop slides -- final.pptx
 
13 asymmetric key cryptography
13   asymmetric key cryptography13   asymmetric key cryptography
13 asymmetric key cryptography
 

Similaire à Big datalittletests heintz

Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data TechnologiesDATAVERSITY
 
Mobile Application Testing in the Cloud - Oct 2012
Mobile Application Testing in the Cloud - Oct 2012Mobile Application Testing in the Cloud - Oct 2012
Mobile Application Testing in the Cloud - Oct 2012CloudBees
 
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...CloudBees
 
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Puppet
 
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012Cloud Foundry, the Open Platform as a Service - Oscon - July 2012
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012Patrick Chanezon
 
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and  one click hadoop provis...How bigtop leveraged docker for build automation and  one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...Evans Ye
 
Cinder OpenStack -- Meetup Austin, TX
Cinder OpenStack -- Meetup Austin, TX Cinder OpenStack -- Meetup Austin, TX
Cinder OpenStack -- Meetup Austin, TX NetApp
 
"Portrait of the developer as The Artist" Lockheed Architect Workshop
"Portrait of the developer as The Artist" Lockheed Architect Workshop"Portrait of the developer as The Artist" Lockheed Architect Workshop
"Portrait of the developer as The Artist" Lockheed Architect WorkshopPatrick Chanezon
 
Scaling With Sun Systems For MySQL Jan09
Scaling With Sun Systems For MySQL Jan09Scaling With Sun Systems For MySQL Jan09
Scaling With Sun Systems For MySQL Jan09Steve Staso
 
DevOps for Big Data - Data 360 2014 Conference
DevOps for Big Data - Data 360 2014 ConferenceDevOps for Big Data - Data 360 2014 Conference
DevOps for Big Data - Data 360 2014 ConferenceGrid Dynamics
 
Deploying your SaaS stack OnPrem
Deploying your SaaS stack OnPremDeploying your SaaS stack OnPrem
Deploying your SaaS stack OnPremKris Buytaert
 
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...Mike Dorman
 
Node.js what's next (Index 2018)
Node.js what's next (Index 2018)Node.js what's next (Index 2018)
Node.js what's next (Index 2018)Gibson Fahnestock
 
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User Experience
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User ExperienceNagios Conference 2012 - Nathan Vonnahme - Monitoring the User Experience
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User ExperienceNagios
 
Basics About Git & GitHub
Basics About Git & GitHubBasics About Git & GitHub
Basics About Git & GitHubRaiful Hasan
 
Design For Testability
Design For TestabilityDesign For Testability
Design For TestabilityWill Iverson
 

Similaire à Big datalittletests heintz (20)

Integrating Big Data Technologies
Integrating Big Data TechnologiesIntegrating Big Data Technologies
Integrating Big Data Technologies
 
Mobile Application Testing in the Cloud - Oct 2012
Mobile Application Testing in the Cloud - Oct 2012Mobile Application Testing in the Cloud - Oct 2012
Mobile Application Testing in the Cloud - Oct 2012
 
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...
Jumping from Continuous Integration to Continuous Delivery with Jenkins Enter...
 
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
Continuous Development with Jenkins - Stephen Connolly at PuppetCamp Dublin '12
 
Cloud foundry and openstackcloud
Cloud foundry and openstackcloudCloud foundry and openstackcloud
Cloud foundry and openstackcloud
 
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012Cloud Foundry, the Open Platform as a Service - Oscon - July 2012
Cloud Foundry, the Open Platform as a Service - Oscon - July 2012
 
How bigtop leveraged docker for build automation and one click hadoop provis...
How bigtop leveraged docker for build automation and  one click hadoop provis...How bigtop leveraged docker for build automation and  one click hadoop provis...
How bigtop leveraged docker for build automation and one click hadoop provis...
 
OWF12/Java Sacha labourey
OWF12/Java Sacha laboureyOWF12/Java Sacha labourey
OWF12/Java Sacha labourey
 
Cinder OpenStack -- Meetup Austin, TX
Cinder OpenStack -- Meetup Austin, TX Cinder OpenStack -- Meetup Austin, TX
Cinder OpenStack -- Meetup Austin, TX
 
DevOps Days Ohio
DevOps Days OhioDevOps Days Ohio
DevOps Days Ohio
 
"Portrait of the developer as The Artist" Lockheed Architect Workshop
"Portrait of the developer as The Artist" Lockheed Architect Workshop"Portrait of the developer as The Artist" Lockheed Architect Workshop
"Portrait of the developer as The Artist" Lockheed Architect Workshop
 
Scaling With Sun Systems For MySQL Jan09
Scaling With Sun Systems For MySQL Jan09Scaling With Sun Systems For MySQL Jan09
Scaling With Sun Systems For MySQL Jan09
 
DevOps for Big Data - Data 360 2014 Conference
DevOps for Big Data - Data 360 2014 ConferenceDevOps for Big Data - Data 360 2014 Conference
DevOps for Big Data - Data 360 2014 Conference
 
Deploying your SaaS stack OnPrem
Deploying your SaaS stack OnPremDeploying your SaaS stack OnPrem
Deploying your SaaS stack OnPrem
 
Implementing your own Google App Engine
Implementing your own Google App Engine Implementing your own Google App Engine
Implementing your own Google App Engine
 
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...
Don't Repeat Our Mistakes! Lessons Learned from Running Go Daddy's Private Cl...
 
Node.js what's next (Index 2018)
Node.js what's next (Index 2018)Node.js what's next (Index 2018)
Node.js what's next (Index 2018)
 
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User Experience
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User ExperienceNagios Conference 2012 - Nathan Vonnahme - Monitoring the User Experience
Nagios Conference 2012 - Nathan Vonnahme - Monitoring the User Experience
 
Basics About Git & GitHub
Basics About Git & GitHubBasics About Git & GitHub
Basics About Git & GitHub
 
Design For Testability
Design For TestabilityDesign For Testability
Design For Testability
 

Plus de drewz lin

Web security-–-everything-we-know-is-wrong-eoin-keary
Web security-–-everything-we-know-is-wrong-eoin-kearyWeb security-–-everything-we-know-is-wrong-eoin-keary
Web security-–-everything-we-know-is-wrong-eoin-kearydrewz lin
 
Via forensics appsecusa-nov-2013
Via forensics appsecusa-nov-2013Via forensics appsecusa-nov-2013
Via forensics appsecusa-nov-2013drewz lin
 
Phu appsec13
Phu appsec13Phu appsec13
Phu appsec13drewz lin
 
Owasp2013 johannesullrich
Owasp2013 johannesullrichOwasp2013 johannesullrich
Owasp2013 johannesullrichdrewz lin
 
Owasp advanced mobile-application-code-review-techniques-v0.2
Owasp advanced mobile-application-code-review-techniques-v0.2Owasp advanced mobile-application-code-review-techniques-v0.2
Owasp advanced mobile-application-code-review-techniques-v0.2drewz lin
 
I mas appsecusa-nov13-v2
I mas appsecusa-nov13-v2I mas appsecusa-nov13-v2
I mas appsecusa-nov13-v2drewz lin
 
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolf
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolfDefeating xss-and-xsrf-with-my faces-frameworks-steve-wolf
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolfdrewz lin
 
Csrf not-all-defenses-are-created-equal
Csrf not-all-defenses-are-created-equalCsrf not-all-defenses-are-created-equal
Csrf not-all-defenses-are-created-equaldrewz lin
 
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21drewz lin
 
Appsec usa roberthansen
Appsec usa roberthansenAppsec usa roberthansen
Appsec usa roberthansendrewz lin
 
Appsec usa2013 js_libinsecurity_stefanodipaola
Appsec usa2013 js_libinsecurity_stefanodipaolaAppsec usa2013 js_libinsecurity_stefanodipaola
Appsec usa2013 js_libinsecurity_stefanodipaoladrewz lin
 
Appsec2013 presentation-dickson final-with_all_final_edits
Appsec2013 presentation-dickson final-with_all_final_editsAppsec2013 presentation-dickson final-with_all_final_edits
Appsec2013 presentation-dickson final-with_all_final_editsdrewz lin
 
Appsec2013 presentation
Appsec2013 presentationAppsec2013 presentation
Appsec2013 presentationdrewz lin
 
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitations
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitationsAppsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitations
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitationsdrewz lin
 
Appsec2013 assurance tagging-robert martin
Appsec2013 assurance tagging-robert martinAppsec2013 assurance tagging-robert martin
Appsec2013 assurance tagging-robert martindrewz lin
 
Amol scadaowasp
Amol scadaowaspAmol scadaowasp
Amol scadaowaspdrewz lin
 
Agile sdlc-v1.1-owasp-app sec-usa
Agile sdlc-v1.1-owasp-app sec-usaAgile sdlc-v1.1-owasp-app sec-usa
Agile sdlc-v1.1-owasp-app sec-usadrewz lin
 
Vulnex app secusa2013
Vulnex app secusa2013Vulnex app secusa2013
Vulnex app secusa2013drewz lin
 
基于虚拟化技术的分布式软件测试框架
基于虚拟化技术的分布式软件测试框架基于虚拟化技术的分布式软件测试框架
基于虚拟化技术的分布式软件测试框架drewz lin
 
新浪微博稳定性经验谈
新浪微博稳定性经验谈新浪微博稳定性经验谈
新浪微博稳定性经验谈drewz lin
 

Plus de drewz lin (20)

Web security-–-everything-we-know-is-wrong-eoin-keary
Web security-–-everything-we-know-is-wrong-eoin-kearyWeb security-–-everything-we-know-is-wrong-eoin-keary
Web security-–-everything-we-know-is-wrong-eoin-keary
 
Via forensics appsecusa-nov-2013
Via forensics appsecusa-nov-2013Via forensics appsecusa-nov-2013
Via forensics appsecusa-nov-2013
 
Phu appsec13
Phu appsec13Phu appsec13
Phu appsec13
 
Owasp2013 johannesullrich
Owasp2013 johannesullrichOwasp2013 johannesullrich
Owasp2013 johannesullrich
 
Owasp advanced mobile-application-code-review-techniques-v0.2
Owasp advanced mobile-application-code-review-techniques-v0.2Owasp advanced mobile-application-code-review-techniques-v0.2
Owasp advanced mobile-application-code-review-techniques-v0.2
 
I mas appsecusa-nov13-v2
I mas appsecusa-nov13-v2I mas appsecusa-nov13-v2
I mas appsecusa-nov13-v2
 
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolf
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolfDefeating xss-and-xsrf-with-my faces-frameworks-steve-wolf
Defeating xss-and-xsrf-with-my faces-frameworks-steve-wolf
 
Csrf not-all-defenses-are-created-equal
Csrf not-all-defenses-are-created-equalCsrf not-all-defenses-are-created-equal
Csrf not-all-defenses-are-created-equal
 
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21
Chuck willis-owaspbwa-beyond-1.0-app secusa-2013-11-21
 
Appsec usa roberthansen
Appsec usa roberthansenAppsec usa roberthansen
Appsec usa roberthansen
 
Appsec usa2013 js_libinsecurity_stefanodipaola
Appsec usa2013 js_libinsecurity_stefanodipaolaAppsec usa2013 js_libinsecurity_stefanodipaola
Appsec usa2013 js_libinsecurity_stefanodipaola
 
Appsec2013 presentation-dickson final-with_all_final_edits
Appsec2013 presentation-dickson final-with_all_final_editsAppsec2013 presentation-dickson final-with_all_final_edits
Appsec2013 presentation-dickson final-with_all_final_edits
 
Appsec2013 presentation
Appsec2013 presentationAppsec2013 presentation
Appsec2013 presentation
 
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitations
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitationsAppsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitations
Appsec 2013-krehel-ondrej-forensic-investigations-of-web-exploitations
 
Appsec2013 assurance tagging-robert martin
Appsec2013 assurance tagging-robert martinAppsec2013 assurance tagging-robert martin
Appsec2013 assurance tagging-robert martin
 
Amol scadaowasp
Amol scadaowaspAmol scadaowasp
Amol scadaowasp
 
Agile sdlc-v1.1-owasp-app sec-usa
Agile sdlc-v1.1-owasp-app sec-usaAgile sdlc-v1.1-owasp-app sec-usa
Agile sdlc-v1.1-owasp-app sec-usa
 
Vulnex app secusa2013
Vulnex app secusa2013Vulnex app secusa2013
Vulnex app secusa2013
 
基于虚拟化技术的分布式软件测试框架
基于虚拟化技术的分布式软件测试框架基于虚拟化技术的分布式软件测试框架
基于虚拟化技术的分布式软件测试框架
 
新浪微博稳定性经验谈
新浪微博稳定性经验谈新浪微博稳定性经验谈
新浪微博稳定性经验谈
 

Big datalittletests heintz

  • 1. Big Data Little Tests John Heintz Founder, Gist Labs Technical Consultant, Cutter Consortium john@gistlabs.com @jheintz http://gistlabs.com
  • 2. About John Heintz •  Developer since 1995 •  Agilist since 1999 •  Founded Gist Labs in 2008 •  Developer, Mentor, Consultant •  Intuitive, Abstract, Precise Kool-Aids I’ve drank: Agile/Lean/Kanban, OO, TDD, REST, Mentoring, Craftsmanship, Emergent/Progressive Design, InnovationGames®, Systems and Complexity Theory 2 © 2012 Gist Labs, LLC
  • 3. My Goals for You •  Demystify test automation for Big Data •  Provide executable examples 3 © 2012 Gist Labs, LLC
  • 4. What you shouldn’t expect… •  Barely introduce Big Data concepts •  No performance tuning 4 © 2012 Gist Labs, LLC
  • 5. Simple Code, Config •  I went as simple and clear as possible •  Java, JUnit4 •  Maven… okay maybe not simple :- 5 © 2012 Gist Labs, LLC
  • 6. Mostly Code •  Remember the Law of Two Feet •  If code isn’t what you were looking for I totally respect you finding something better for your time J 6 © 2012 Gist Labs, LLC
  • 7. •  Everything available from http://gistlabs.com/2012/08/big-data-little-tests/ •  The entire command script is there… so you can take notes assuming that’s available 7 © 2012 Gist Labs, LLC
  • 8. My Soapboxes… These are topics I’ll repeat myself on •  Fast test execution •  One-click build 8 © 2012 Gist Labs, LLC
  • 9. Big Data •  Too much •  Too fast •  Not trivially structured 9 © 2012 Gist Labs, LLC
  • 10. Map Reduce •  Map from one input to one output •  Reduce from many inputs to one output •  Can be run in parallel •  Crude, but massive 10 © 2012 Gist Labs, LLC
  • 11. CAP Theorem •  Consistency •  Availability •  Partition Tolerance 11 © 2012 Gist Labs, LLC
  • 12. Big Data Ecosystem •  Hadoop: A giant among giants (Tons of projects on this platform!!) •  Cassandra: Feels like a weird RDBMS •  Riak: An elegant key/value/search store •  MongoDB: Document store 12 © 2012 Gist Labs, LLC
  • 13. Let’s Run Some Code 13 © 2012 Gist Labs, LLC
  • 14. Hadoop Tests 14 © 2012 Gist Labs, LLC
  • 15. Riak tests 15 © 2012 Gist Labs, LLC
  • 16. Other Frameworks •  CassandraUnit https://github.com/jsevellec/cassandra-unit •  PigUnit, Hadoop Query Language http://pig.apache.org/docs/r0.8.1/pigunit.html 16 © 2012 Gist Labs, LLC
  • 17. Code Questions? •  Fast test execution? •  One-click build? 17 © 2012 Gist Labs, LLC
  • 18. What about Big Tests? •  Real test data •  Realistic cluster 18 © 2012 Gist Labs, LLC
  • 19. Real Test Data My favorite strategy is to: •  Develop with small, crafted data •  Build/test the same way •  Run another test on top of real prod data 19 © 2012 Gist Labs, LLC
  • 20. Production Continuous Integration Servers Continuous Deployment Servers Build Test1 Cluster Cluster Test2 Cluster Staging Developers Version Control Developers Virtual vs Physical Servers Private vs Public Cloud Developer Sandboxes Network Infrastructure Self-service Provisioning Storage Infrastructure 20 © 2012 Gist Labs, LLC
  • 21. Realistic Cluster •  Use a CI/DevOps environment •  Virtualize, “X as a Service” •  Virtual Machines •  Virtual Infrastructure (Network, Storage) 21 © 2012 Gist Labs, LLC
  • 22. Jenkins CI Server •  Master/slave clusters •  Plugins for Hadoop and VMWare •  http://jenkins-ci.org/ 22 © 2012 Gist Labs, LLC
  • 23. Big Questions? 23 © 2012 Gist Labs, LLC
  • 24. Thank you! •  Everything available from: http://gistlabs.com/2012/08/big-data-little-tests/ •  John Heintz, @jheintz, http://gistlabs.com 24 © 2012 Gist Labs, LLC