An short introduction on Big Query. With this presentation you'll quickly discover :
How load data in BigQuery
How to build dashboard using BigQuery
How to work with BigQuery
and, at last but not least, we've added some best practices
We hope you'll enjoy this presentation and that it will help you to start exploring this wonderful solution. Don't hesitate to send us your feedbacks or questions
2. SUMMARY
HOW TO LOAD DATA TO BIGQUERY ?
01
PAGE 2
HOW TO WORK WITH BIGQUERY ?
02
PAGE 2
HOW TO BUILD DASHBOARDS USING BIGQUERY ?
03
PAGE 2
SOME BEST PRACTICES
04
PAGE 2
4. BIGQUERY - NON GEEKY DEFINITION
This is a place where you can stock and query your data for 0,02 $ per GB (~256 MP3 audio files) and then create nice (and
free!) dashboards in Google Data Studio.
It is a part of Google Cloud Platform.
You’ll need some basic SQL knowledge to work with it.
(Officially : BigQuery is a Web service from Google that is used for handling or analyzing big data. It is part of the Google
Cloud Platform.)
6. SQL
Language that allows you to communicate with your database.
Example :
SELECT name, job, salary
FROM people
WHERE salary IS NULL
Learn basic SQL with free courses :
https://campus.datacamp.com/courses/intro-to-sql-for-data-science/
https://www.codecademy.com/courses/sql-analyzing-business-metrics/
https://www.w3schools.com/sql/
7. ADVANTAGES OF BIGQUERY
2
1 3
MANAGEMENT SIMPLICITY
Easy Data management
No administration
Fully-managed
(no worries)
DATA PERFORMANCE
No limits
Fast imports
Scans TBs in seconds
GDPR - COMPLIANCE
Data storage in EU/US
Encryption
Access Controls
2−step verification
Data Loss prevention
etc.
Integration with other Google tools…
300$ to test GBQ…
Large community..
Education tools (courses, labs,etc.)
1 TB (125GB) per month is Free
14. WORK WITG GOOGLE BIGQUERY
Into (Before)
Put your data into BigQuery
Out (After)
Extract your prepared data to
your dashboard
Inside
Work with your data in
BigQuery01
02
03
16. BIGQUERY INGESTION - GOOGLE SERVICES
BigQuery Data Transfer Service
Google Analytics 360
Google Play, Google Ads(2.5$ per month per customerID), Youtube (5$ per month per channel)
Cloud Storage (No charges, only for storage
+ Using Google Cloud Dataflow (pipeline to write data to BigQuery)
etc…
18. BIGQUERY INGESTION - LOCAL MACHINE
Load data from a readable data source
- You can load data manually
- You can use API or Client Librairies
(create connectors or use paid connectors)
API is a set of clearly defined methods of communication among
various components - applications, software
BATCH task automation without manual work
19. BIGQUERY INGESTION - STREAMING
Load data immediately without delaying (for real-time reports)
- You can create your custom solution
- Use Google solutions (Google Dataflow)
- Or use paid solutions
Use-cases : mobile application - sending errors events in real time
32. BEST PRACTICES TO WORK WITH GBQ
Control costs (custom quotas)
- Set maximum per query cost limit
- Set per user daily budget
- Set per project daily budget
Use denormalized tables
Use preview
Use Query Validator
NO * SELECT (!)
Use table expiration
Use partition
Use pricing calculator
33. SOURCES AND TOOLS
- Coursera “ From Data to Insights with Google Cloud Platform” from Google Cloud
- Google Cloud Summit Prezentations
- Google BigQuery Documentation
- Qwiklabs
- StitchData / Owox / Fivetran
- dbt (command line tool)
- SinterData
- cron/crontab
34. THANKS !
7 place du Griffon | 69001 Lyon | France
+33 4 28 29 07 52 | contact@better-stronger.com
Khrystyna GRYNKO
khrystyna@better-stronger.com | +33 4 28 29 07 92