SlideShare a Scribd company logo
1 of 15
Download to read offline
BIG DATA FOR SQL DEVELOPERS:
GET STARTED (FOR FREE)
JULY 6, 2017
BIG DATA FOR SQL DEVELOPERS
I KNOW SQL, BUT…
1. How can I run a cluster without setting one
up at home, or paying for expensive cloud
services?
2. Where do I find “big data” to analyze?
3. Do I need to learn a different programming
language?
BIG DATA FOR SQL DEVELOPERS
YOUR RDBMS - HOW WE COMMONLY VIEW IT
RDBMS
SELECT … FROM …
BIG DATA FOR SQL DEVELOPERS
YOUR RDBMS - A COLLECTION OF SYSTEMS
SELECT * FROM …
QUERY LANGUAGE INTERPRETER
QUERY PLANNER & OPTIMIZER
SECURITY
I/O
MEMORY CACHE
DATA STORAGE
DISASTER
RECOVERY
LOGGING
CONCURRENCY&TRANSACTIONALCONSISTENCY
BIG DATA FOR SQL DEVELOPERS
YOUR RDBMS - A COLLECTION OF SYSTEMS
SELECT … FROM …
OR
SQLCONTEXT.SQL(“
SELECT…”)
YARN
HDFS
CONSISTENCY?SECURITY?
BIG DATA FOR SQL DEVELOPERS
QUESTION #1 - HOW DO I GET MY OWN CLUSTER?
‣ Use IaaS
‣ AWS Free Tier
‣ Use Managed Services (AWS EMR, Azure
HDInsight)
‣ Can, but have to wait for them to contact you.
‣ Databricks Community Edition
BIG DATA FOR SQL DEVELOPERS
QUESTION #2 - WHERE CAN I FIND DATA?
‣ Census.gov
‣ The CIA world Factbook 
‣ HealthData.gov
‣ World Health Organization
‣ AWS Public Datasets
‣ Facebook Graph API
‣ Google Public Data 
‣ Databricks
YOU DON’T NEED
BIG DATA TO
LEARN BIG DATA.
- Mark Smith
BIG DATA FOR SQL DEVELOPERS
QUESTION #3 - DO I NEED TO LEARN ANOTHER LANGUAGE?
‣Yes, but you don’t have to be a software dev
‣ Python, Java, or Scala (my choice: Python)
‣ Good news: Your SQL can still help you!
BIG DATA FOR SQL DEVELOPERS
WHY SPARK?
‣Popular, vital
‣A framework for processing distributed datasets
‣Has a SQL Implementation
BIG DATA FOR SQL DEVELOPERS
WHY SPARK?
BIG DATA FOR SQL DEVELOPERS
YOUR NEXT STEPS
1. Sign up for Databricks Community Edition
2. Read and complete “A Gentle Introduction to
Apache Spark on Databricks”
3. Read and complete “Apache Spark on
Databricks for Data Engineers”
4. Read a book on the RDBMS you use most often.
BIG DATA FOR SQL DEVELOPERS
DEMO
https://community.cloud.databricks.com/?
o=8158027403376652#notebook/
3636183528035570/command/3636183528035585
DO YOU KNOW DATA, OR
DO YOU KNOW A FLAVOR
OF SQL?
BIG DATA FOR SQL DEVELOPERS
RESOURCES & CONTACT
Brent Lightsey

http://firstlightanalytics.com

brent@firstlightanalytics.com

405-295-5502

https://www.linkedin.com/in/brentlightsey/

More Related Content

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 

Featured

Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Saba Software
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
Simplilearn
 

Featured (20)

How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
 
The six step guide to practical project management
The six step guide to practical project managementThe six step guide to practical project management
The six step guide to practical project management
 
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
Beginners Guide to TikTok for Search - Rachel Pearson - We are Tilt __ Bright...
 
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
Unlocking the Power of ChatGPT and AI in Testing - A Real-World Look, present...
 
12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work12 Ways to Increase Your Influence at Work
12 Ways to Increase Your Influence at Work
 
ChatGPT webinar slides
ChatGPT webinar slidesChatGPT webinar slides
ChatGPT webinar slides
 
More than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike RoutesMore than Just Lines on a Map: Best Practices for U.S Bike Routes
More than Just Lines on a Map: Best Practices for U.S Bike Routes
 
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
Ride the Storm: Navigating Through Unstable Periods / Katerina Rudko (Belka G...
 
Barbie - Brand Strategy Presentation
Barbie - Brand Strategy PresentationBarbie - Brand Strategy Presentation
Barbie - Brand Strategy Presentation
 
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them wellGood Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
Good Stuff Happens in 1:1 Meetings: Why you need them and how to do them well
 
Introduction to C Programming Language
Introduction to C Programming LanguageIntroduction to C Programming Language
Introduction to C Programming Language
 

Big Data for SQL Developers: Get Started for Free

  • 1. BIG DATA FOR SQL DEVELOPERS: GET STARTED (FOR FREE) JULY 6, 2017
  • 2. BIG DATA FOR SQL DEVELOPERS I KNOW SQL, BUT… 1. How can I run a cluster without setting one up at home, or paying for expensive cloud services? 2. Where do I find “big data” to analyze? 3. Do I need to learn a different programming language?
  • 3. BIG DATA FOR SQL DEVELOPERS YOUR RDBMS - HOW WE COMMONLY VIEW IT RDBMS SELECT … FROM …
  • 4. BIG DATA FOR SQL DEVELOPERS YOUR RDBMS - A COLLECTION OF SYSTEMS SELECT * FROM … QUERY LANGUAGE INTERPRETER QUERY PLANNER & OPTIMIZER SECURITY I/O MEMORY CACHE DATA STORAGE DISASTER RECOVERY LOGGING CONCURRENCY&TRANSACTIONALCONSISTENCY
  • 5. BIG DATA FOR SQL DEVELOPERS YOUR RDBMS - A COLLECTION OF SYSTEMS SELECT … FROM … OR SQLCONTEXT.SQL(“ SELECT…”) YARN HDFS CONSISTENCY?SECURITY?
  • 6. BIG DATA FOR SQL DEVELOPERS QUESTION #1 - HOW DO I GET MY OWN CLUSTER? ‣ Use IaaS ‣ AWS Free Tier ‣ Use Managed Services (AWS EMR, Azure HDInsight) ‣ Can, but have to wait for them to contact you. ‣ Databricks Community Edition
  • 7. BIG DATA FOR SQL DEVELOPERS QUESTION #2 - WHERE CAN I FIND DATA? ‣ Census.gov ‣ The CIA world Factbook  ‣ HealthData.gov ‣ World Health Organization ‣ AWS Public Datasets ‣ Facebook Graph API ‣ Google Public Data  ‣ Databricks
  • 8. YOU DON’T NEED BIG DATA TO LEARN BIG DATA. - Mark Smith
  • 9. BIG DATA FOR SQL DEVELOPERS QUESTION #3 - DO I NEED TO LEARN ANOTHER LANGUAGE? ‣Yes, but you don’t have to be a software dev ‣ Python, Java, or Scala (my choice: Python) ‣ Good news: Your SQL can still help you!
  • 10. BIG DATA FOR SQL DEVELOPERS WHY SPARK? ‣Popular, vital ‣A framework for processing distributed datasets ‣Has a SQL Implementation
  • 11. BIG DATA FOR SQL DEVELOPERS WHY SPARK?
  • 12. BIG DATA FOR SQL DEVELOPERS YOUR NEXT STEPS 1. Sign up for Databricks Community Edition 2. Read and complete “A Gentle Introduction to Apache Spark on Databricks” 3. Read and complete “Apache Spark on Databricks for Data Engineers” 4. Read a book on the RDBMS you use most often.
  • 13. BIG DATA FOR SQL DEVELOPERS DEMO https://community.cloud.databricks.com/? o=8158027403376652#notebook/ 3636183528035570/command/3636183528035585
  • 14. DO YOU KNOW DATA, OR DO YOU KNOW A FLAVOR OF SQL?
  • 15. BIG DATA FOR SQL DEVELOPERS RESOURCES & CONTACT Brent Lightsey
 http://firstlightanalytics.com
 brent@firstlightanalytics.com
 405-295-5502
 https://www.linkedin.com/in/brentlightsey/