Talking about the ease of use and handling Big Data technologies in the Cloud. Using Google Cloud Platform and Amazon Web Services and all of the tools around it.
Showing the problems and how we can solve them with simple tools.
Big Data made easy in the era of the Cloud - Demi Ben-Ari
1. Big Data made easy in the era of Cloud
Demi Ben-Ari - VP R&D @ Panorays
2. About Me
Demi Ben-Ari, Co-Founder & VP R&D @ Panorays
! Google Developer Expert
! Co-Founder of Communities:
○ “Big Things” - Big Data, Data Science, DevOps
○ Google Developer Group Cloud
○ Ofek Alumni Association
In the Past:
! Sr. Data Engineer - Windward
! Team Leader & Sr. Java Software Engineer,
Missile defence and Alert System - “Ofek” – IAF
5. What is Big Data (IMHO)? And What to Monitor?
! Systems involving the “3 Vs”:
What are the right questions we want to ask?
○ Volume - How much?
○ Velocity - How fast?
○ Variety - What kind? (Difference)
6. What had happened in the last years?
! Storage got cheaper
! The capacity of Data grew exponentially
! Cloud service providers grew rapidly
! Connectivity got much easier
! Cloud made “by demand” computation possible
! “Compute” started moving to the “Data” and not the other way.
16. Structure of the Data
! Maritime Analytics Platform
! Geo Locations + Metadata
! Arriving over time
! Different types of messages being reported by satellites
! Encoded (For compression reasons)
! Might arrive later than actually transmitted
20. Data Questions? What should be measure
! Did all of the computation occur?
○ Are there any data layers missing?
! How much data do we have? (Volume)
! Is all of the data in the Database?
! Data Quality Assurance
21. Conclusions
! Keep all of the Data that you can
! In its most raw form
! Duplicating Data is not a bad thing
! By demand compute with save you much time and money
! Find the relevant tool to solve each problem
! Not one tool that will solve all of them (No such thing)
! Use the cloud as an auxiliary tool
! Will boost your productivity by much