SlideShare a Scribd company logo
1 of 15
Download to read offline
Big Data at News International!
Welcome !
Big Data at News International!
!
Big Data Event - 29/05/2013!
Mike Keating: Product Owner!
@mikerkeating!
	
  
Slide 01/14!
Big Data at News International!
Introduction!
•  Where to start!
•  Data and decisions!
•  Our technology choices!
•  Lessons!
!
Big Data Event - 29/05/2013!Slide 02/14!
Big Data at News International!
Where to start…!
Big Data Event - 29/05/2013!Slide 03/14!
Lots of data, suppliers, technology and teams!
!
Take control, bring digital data into one place!
Link data sets and look for new approaches!
!
Make data/ information/ knowledge/ insight available!
Build awareness and decision making capability!
	
  
Big Data at News International!
Making the basics available via dashboards!
Big Data Event - 29/05/2013!Slide 04/14!
Big Data at News International!
Understanding our content and consumption!
Big Data Event - 29/05/2013!Slide 07/14!
Visits per Visitor
ViewsperVisit
Behaviours Across Website Sections
Big Data at News International!
Understanding our content in a social world!
Big Data Event - 29/05/2013!Slide 08/14!
Big Data at News International!
Tracking	
  subscrip.on	
  growth	
  across	
  products!
Big Data Event - 29/05/2013!Slide 05/14!
Days Following First Subscription
Subscriptions
Product Growth Following Launch
Big Data at News International!
Analysing attributes that indicate churn!
Big Data Event - 29/05/2013!Slide 06/14!
Big Data at News International!
Designing products based on patterns in navigation !
The	
  iPhone	
  –	
  Octopus	
  Naviga4on	
  
The	
  Website	
  –	
  Flower	
  Petal	
  Naviga4on	
  
The	
  iPad	
  Edi4on	
  -­‐	
  Linear	
  Naviga4on	
  
Big Data Event - 29/05/2013!Slide 10/14!
Big Data at News International!
Our technology choices – what?!
Big Data Event - 29/05/2013!Slide 11/14!
Infrastructure: AWS EC2, S3, RDS, EMR, Cloudformation, Vagrant!
Ops: Jenkins, Anthill Pro, Maven, Nexus, Zabbix, CloudWatch!
Code & Config: Puppet, Github!
Data Retrieval: Java & Python!
Data Pipeline: Java Map Reduce, Apache Crunch, !
Spark, MRUnit, Celery, RabbitMQ, Python, Flume!
Data Storage: HDFS, HBase, AWS S3, MySQL, Redis!
Data Schema: Avro!
Data Access: Python APIs, Tornado, S3, Hive & AWS EMR!
Analysis: R, Pandas, Excel -> analyst’s choice!
Visualisation: R, D3, Highcharts, Google Charts!
Products: APIs, JS+HTML+CSS!
Big Data at News International!
Our technology choices – why?!
Big Data Event - 29/05/2013!Slide 11/14!
•  Team: Build on team’s skillsets and knowledge. !
•  Recruitment: Be conscious of “hire-ability”!
•  Open Source: Big wins from usage; great
communities; contribute back!
•  Versions: use what works; work with alpha releases
but not as production code!
•  Consistency: Try and use what you do today, e.g.
AWS!
•  Flexibility: Use a “better” product where it makes
sense!
Big Data at News International!
Our technology choices – who?!
Big Data Event - 29/05/2013!Slide 11/14!
•  Tech Lead: Architecture; Design; Hands-On!
•  Delivery Manager: Project Management w Agile!
•  Hadoop: build from Java Map Reduce !
•  Python: Tornado, Native Python, real-time processing!
•  Data Science: Hive, R, Modeling!
•  DevOps: AWS, Vagrant, Puppet!
•  Experience: Practical experience of Hadoop in production!
•  Capability: Ability to learn new tech, design and build!
•  Demo: contributions to projects, working examples!
Big Data at News International!
Lessons!
Big Data Event - 29/05/2013!Slide 12/14!
•  Building awareness & common knowledge!
•  Building on existing teams, systems and their work!
•  Looking for extra capability and output!
•  Focus on visuals – it needs to be sharable/ visible!
•  Working with range of teams to share outputs!
•  Making good tech choices!
Big Data at News International!
Thanks!
Big Data Event - 29/05/2013!Slide 13/14!
Big Data Team – DevOps, Hadoop, Python, UI,
Analysts, Test!
!
Technology Teams – Design, Production Ops, Perf
Testing, Security, Products, Platforms, Service Desk!
!
Editorial, Marketing, Commercial, Finance Teams!
	
  
Big Data at News International!
!
!
Thanks!
Big Data at News International	
  
	
  
	
  	
  	
  	
  	
  Contact	
  Us	
  
News	
  Interna.onal	
  Technology	
  @techatni	
  
Mike	
  Kea.ng;	
  Product	
  Owner	
  @mikerkea.ng	
  
Jobs	
  via	
  hBp://joinnitech.co.uk/	
  
Big Data Event - 29/05/2013!Slide 14/14!

More Related Content

Similar to Mike keating - News Int - 18th BDL meetup

Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...
DataWorks Summit
 
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
Karthik Murugesan
 
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Databricks
 
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Sarah Aerni
 

Similar to Mike keating - News Int - 18th BDL meetup (20)

Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...Building intelligent applications, experimental ML with Uber’s Data Science W...
Building intelligent applications, experimental ML with Uber’s Data Science W...
 
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
Uber - Building Intelligent Applications, Experimental ML with Uber’s Data Sc...
 
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
Building Intelligent Applications, Experimental ML with Uber’s Data Science W...
 
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
Data Science as a Commodity: Use MADlib, R, & other OSS Tools for Data Scienc...
 
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
Big Data Everywhere Chicago: Leading a Healthcare Company to the Big Data Pro...
 
Getting Started with Splunk Breakout Session
Getting Started with Splunk Breakout SessionGetting Started with Splunk Breakout Session
Getting Started with Splunk Breakout Session
 
Splunk hunkbeta
Splunk hunkbetaSplunk hunkbeta
Splunk hunkbeta
 
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...Geschäftliches Potential für System-Integratoren und Berater -  Graphdatenban...
Geschäftliches Potential für System-Integratoren und Berater - Graphdatenban...
 
Hadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big DataHadoop and the Future of SQL: Using BI Tools with Big Data
Hadoop and the Future of SQL: Using BI Tools with Big Data
 
HadoopWorkshopJuly2014
HadoopWorkshopJuly2014HadoopWorkshopJuly2014
HadoopWorkshopJuly2014
 
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
 
Journées SQL Server 2014 - Keynote Jour 2
Journées SQL Server 2014 - Keynote Jour 2Journées SQL Server 2014 - Keynote Jour 2
Journées SQL Server 2014 - Keynote Jour 2
 
Data Science with Hadoop - A primer
Data Science with Hadoop - A primerData Science with Hadoop - A primer
Data Science with Hadoop - A primer
 
Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3Snowplow presentation for Amsterdam Meetup #3
Snowplow presentation for Amsterdam Meetup #3
 
Big Data World Singapore 2017 - Moving Towards Digitization & Artificial Inte...
Big Data World Singapore 2017 - Moving Towards Digitization & Artificial Inte...Big Data World Singapore 2017 - Moving Towards Digitization & Artificial Inte...
Big Data World Singapore 2017 - Moving Towards Digitization & Artificial Inte...
 
Data Science with Hadoop: A Primer
Data Science with Hadoop: A PrimerData Science with Hadoop: A Primer
Data Science with Hadoop: A Primer
 
Big Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-AriBig Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-Ari
 
An Overview of BigData
An Overview of BigDataAn Overview of BigData
An Overview of BigData
 
Big Data in Action – Real-World Solution Showcase
 Big Data in Action – Real-World Solution Showcase Big Data in Action – Real-World Solution Showcase
Big Data in Action – Real-World Solution Showcase
 

Recently uploaded

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 

Mike keating - News Int - 18th BDL meetup

  • 1. Big Data at News International! Welcome ! Big Data at News International! ! Big Data Event - 29/05/2013! Mike Keating: Product Owner! @mikerkeating!   Slide 01/14!
  • 2. Big Data at News International! Introduction! •  Where to start! •  Data and decisions! •  Our technology choices! •  Lessons! ! Big Data Event - 29/05/2013!Slide 02/14!
  • 3. Big Data at News International! Where to start…! Big Data Event - 29/05/2013!Slide 03/14! Lots of data, suppliers, technology and teams! ! Take control, bring digital data into one place! Link data sets and look for new approaches! ! Make data/ information/ knowledge/ insight available! Build awareness and decision making capability!  
  • 4. Big Data at News International! Making the basics available via dashboards! Big Data Event - 29/05/2013!Slide 04/14!
  • 5. Big Data at News International! Understanding our content and consumption! Big Data Event - 29/05/2013!Slide 07/14! Visits per Visitor ViewsperVisit Behaviours Across Website Sections
  • 6. Big Data at News International! Understanding our content in a social world! Big Data Event - 29/05/2013!Slide 08/14!
  • 7. Big Data at News International! Tracking  subscrip.on  growth  across  products! Big Data Event - 29/05/2013!Slide 05/14! Days Following First Subscription Subscriptions Product Growth Following Launch
  • 8. Big Data at News International! Analysing attributes that indicate churn! Big Data Event - 29/05/2013!Slide 06/14!
  • 9. Big Data at News International! Designing products based on patterns in navigation ! The  iPhone  –  Octopus  Naviga4on   The  Website  –  Flower  Petal  Naviga4on   The  iPad  Edi4on  -­‐  Linear  Naviga4on   Big Data Event - 29/05/2013!Slide 10/14!
  • 10. Big Data at News International! Our technology choices – what?! Big Data Event - 29/05/2013!Slide 11/14! Infrastructure: AWS EC2, S3, RDS, EMR, Cloudformation, Vagrant! Ops: Jenkins, Anthill Pro, Maven, Nexus, Zabbix, CloudWatch! Code & Config: Puppet, Github! Data Retrieval: Java & Python! Data Pipeline: Java Map Reduce, Apache Crunch, ! Spark, MRUnit, Celery, RabbitMQ, Python, Flume! Data Storage: HDFS, HBase, AWS S3, MySQL, Redis! Data Schema: Avro! Data Access: Python APIs, Tornado, S3, Hive & AWS EMR! Analysis: R, Pandas, Excel -> analyst’s choice! Visualisation: R, D3, Highcharts, Google Charts! Products: APIs, JS+HTML+CSS!
  • 11. Big Data at News International! Our technology choices – why?! Big Data Event - 29/05/2013!Slide 11/14! •  Team: Build on team’s skillsets and knowledge. ! •  Recruitment: Be conscious of “hire-ability”! •  Open Source: Big wins from usage; great communities; contribute back! •  Versions: use what works; work with alpha releases but not as production code! •  Consistency: Try and use what you do today, e.g. AWS! •  Flexibility: Use a “better” product where it makes sense!
  • 12. Big Data at News International! Our technology choices – who?! Big Data Event - 29/05/2013!Slide 11/14! •  Tech Lead: Architecture; Design; Hands-On! •  Delivery Manager: Project Management w Agile! •  Hadoop: build from Java Map Reduce ! •  Python: Tornado, Native Python, real-time processing! •  Data Science: Hive, R, Modeling! •  DevOps: AWS, Vagrant, Puppet! •  Experience: Practical experience of Hadoop in production! •  Capability: Ability to learn new tech, design and build! •  Demo: contributions to projects, working examples!
  • 13. Big Data at News International! Lessons! Big Data Event - 29/05/2013!Slide 12/14! •  Building awareness & common knowledge! •  Building on existing teams, systems and their work! •  Looking for extra capability and output! •  Focus on visuals – it needs to be sharable/ visible! •  Working with range of teams to share outputs! •  Making good tech choices!
  • 14. Big Data at News International! Thanks! Big Data Event - 29/05/2013!Slide 13/14! Big Data Team – DevOps, Hadoop, Python, UI, Analysts, Test! ! Technology Teams – Design, Production Ops, Perf Testing, Security, Products, Platforms, Service Desk! ! Editorial, Marketing, Commercial, Finance Teams!  
  • 15. Big Data at News International! ! ! Thanks! Big Data at News International              Contact  Us   News  Interna.onal  Technology  @techatni   Mike  Kea.ng;  Product  Owner  @mikerkea.ng   Jobs  via  hBp://joinnitech.co.uk/   Big Data Event - 29/05/2013!Slide 14/14!