This deck covers the journey of starting with BigQuery, adding more data sources and building a process around your data warehouse. It covers the three phases greenfield, dashboards and operational analytics and the necessary data components.
The code for uploading your product feed can be found here:
https://gist.github.com/ChrisGutknecht/fde93092e21039299ab76715596eac01
If you have any questions, reach out to me on Linkedin!
3. What You Will Take Away from this Session
1. When and why you should invest in a Marketing DWH
3. Interesting use cases by combining data sources
5. Design outcome-oriented questions for analytics projects
2. Learn the data ecosystem and the benefits of BigQuery
4. Many tactical tips for daily use
4. About Chris: Head of Acquisition & Optimization
Digital Marketer
Tech nerd
Climber
1997 2008 2013 2020
Dad of 2
Big Thanks to Steffi,
our Data Scientist
5. Bergzeit: Combining Love for Mountains & Data
Online Store for Mountain Gear
122 M Revenue in Financial Year 20/21
14 Countries & 5 Languages
Commerce. Content. Guided Tours
6. Let’s Set Clear Expectations for this Session
Technical deep dive
Intro to data ecosystem
What this session IS What it’s NOT
Machine learning
Customer data focused
Google Cloud focused
Practical tips & mistakes
Ecommerce use cases
7. Why Is Data Knowledge Important for You?
Behavioural data -> digital success
Operational analytics on the rise
Requirement Privacy by design
16. Today: The Cool Cloud Kids on the Block
100% cloud, no Ops
Seamless scaling
Instantly ready
17. The Components of a Modern Data Platform
Data Warehouse
Data Ingestion
Data Catalog & Governance
Activation
Data Quality
Job Orchestration
Visualization
Transformation
18. Analysts are Turning into Analytics Engineers
Data Warehouse
Data Ingestion
Data Catalog & Governance
Activation
Data Quality
Job Orchestration
Visualization
Transformation
20. Let’s Start from The Beginning: Data Sources
Data Ingestion
Google Ads Data Transfer
GA4 Export
Google Merchant Center
Paid Connectors, e.g. Fivetran
Custom Ingestion Scripts
Google Sheets
Cost Connectors, e.g. Funnel.io
21. 1. Navigate to Data transfers
Set up a Big Query Ads Transfer in One Minute
2. Configure the Transfer details
22. 2. Storage & Transfer
Get Your GA4 Data into BQ in Two Minutes
1. BQ Linking in Admin UI
25. Suggestions for Interesting Data Sources
Domain Data Source Available in Data Loaders?
SEO Google Search Console
SEO Pagespeed Insights & Lighthouse
SEO Google Bot Logfiles
Ecom Inventory Data & Attributes
Ecom Trusted Shops Reviews
Ecom Awin Open Orders
Social Instagram
Social Facebook
.... ...
26. 2. Storage & Transfer
The Best Custom Way to Ingest Data into BQ
Data Source 3. BigQuery
1. Cloud Function
Data Transfer handles ingest job
Observability via alerts
Data Fetch with Python and Pandas
27. How Do We Batch-Ingest New Data?
Change Data Capture
Snapshots Copies
Full History Daily State
very easy
For <300k rows
rather easy
duplicate rows
storage efficient
complex architecture
31. Price Discounts Data Model Sources:
date
sku
detail_views
product order value
ga sessions
date
sku
price
sale_price
diff_price_to_sale
diff_price_to_sale_grouped
products
35. Category Rev & Ratings Data Model Sources:
date
sku
top_category
product order value
ga sessions
date
sku
rating_count
rating_count_grouped
products ratings
54. What is Operational Analytics for Bergzeit?
ML
Products
Profit Bidding
Rule-based
Products
Data Uploads Attribution Model
Updating Affiliate Sales
55. Case: Upload Your Shopping Feed Every 10 Min
2. Cloud Function
with 15 lines of code
3. Schedule
Cronjob
1. Get GCS Bucket
Name
Code samples: https://gist.github.com/ChrisGutknecht/fde93092e21039299ab76715596eac01
56. Case: Profit Bidding & Report
More Details: https://www.slideshare.net/ChristopherGutknecht/gross-profit-bidding-for-ecommerce-smx-virtual-2021
65. How Can We Generate Value? Focus on Actions
1. Define Actions 3. Factors
2. Success Metrics
What Will You
Do Differently If
You Have the
Data?
What Would
Success in
Metrics Look
Like?
Which Factors
Influence
Success?
4. Tests
How Can We
Test Actions on
These Factors?
66. Who Should Be Your Data Hire?
1. Focus:
SQL & Warehouse
3. Focus: ML Models
2. Focus: Data Pipelines
67. Your Takeaways from this Session
1. When and why you should invest in a Marketing DWH
3. How to explore use cases by combining data sources
5. Design outcome-oriented questions for analytics projects
2. Learn the data ecosystem and the benefits of BigQuery
4. Many tactical tips for daily use
68. Thanks for Your Time.
Looking Forward To Questions!
Chris Gutknecht | Teamlead A&O | Hiring a PPC!
70. Data Warehouse vs Data Lake
Structured Data
Table Schemas
Transactions
Sharded Files
Unstructured Data
Lakehouse
71. Why Focus On Google Cloud & Big Query?
Market Leader in Data Analytics*
Free Google Data Connectors
Seamless low-tech scaling
source:: Forrester Research 2021
72. The Best Cloud Data Warehouse? It Depends
Source: https://medium.com/pocket-gems/a-comparative-analysis-between-bigquery-redshift-and-snowflake-8d194fdf5693
Google Data Sources BigQuery = Google Cloud
73. How often do we Ingest Data?
Real-Time
Stream Processing
Batch Processing
or