SlideShare une entreprise Scribd logo
1  sur  23
Télécharger pour lire hors ligne
Reelbid
Real time ad bidding DSP
( “programmatic”)
MUDIT UPPAL
INSIGHT DATA ENGINEERING, SV
What is Real Time Bidding(RTB)?
Image source:
Demand side platform
- ‘Reelbid’
Mashable.com
(publisher with
inventory)
Sell Side Platform
OpenRTB
server
Auction
service
Ad
exchange
OpenRTB
client
Bidder
service
Data management
platform
Data broker
Some
advertiser
Motivation
● Really interesting & new* data engineering challenges
Current State of the art
Google’s open-bidder still in alpha(not for public)
“In 2014, according to Business Insider Intelligence, ad revenue topped $15 billion. Real-time bidding, and in
particular, mobile and video real-time bidding, lead the way for that growth. Business Insider Intelligence estimates
say RTB revenue will pass $26 billion by the end of 2020, which far surpasses the $11.7 billion from this year.”
( *Disclaimer: NO background in Ad tech )
Latency ( 100 ms )
Queries Per Second (~2
- 3 million bids/second)
+ Infrequent bids
Some Sample
AdExchange
Reelbid
1. Hey do you want
to bid on this user’s
impression?:
2. Yes! Here’s my bid:
100 ms
"id" : "32a69c6ba388f110487f9d1e63f77b22d86e916b",
"banner": {
"h": 250,
"w": 300,
"name": "mashable.com",
"domain": "http://www.example.com",
…..
"page": "http://easy.example.com/easy?cu=13824;
cre=mu;target=_blank",
"ref" : "http://refer+url",
"publisher": {
"id": "qqwer1234xgfd",
"name": "site_name",
"domain": "my.site.com"
"device": {
"ua": "Mozilla/5.0(KHTML,
like Gecko) Version/5.1.7 Safari/534.
57.2",
"ip": "192.168.5.5",
"geo": {
"lat": 37.789,
"lon": -122.394,
"country": "USA",
"city": "San Francisco",
"region": "CA",
"zip" : "94105",
"type": 2
}
"user": {
"buyeruid" :
"89776897686798fwe87rtryt8976fsd7869678",
"id":
"55816b39711f9b5acf3b90e313ed29e51665623f",
"gender": "M",
"yob": 1975,
"customdata": "Data-asdfdwerewr",
"data": [{
"id": "pub-demographics",
"name": "data_name",
"segment": [{
"id" : "345qw245wfrtgwertrt56765wert",
"name" : "segment_name",
"value": "segment_value
}
{
"id": "eb85349d-03c3-44f4-a77b-
824f7221d116",
"seatbid": [{
"bid": [{
"id": "bid1",
"impid": "eb85349d-03c3-44f4-a77b-
824f7221d116",
"price": 0.1,
"adm": "<div>Ad Creative</div>",
"adomain": [
"http://www.example.com/clickthrough"
Goals
- Create a more efficient bidding framework using sampling and hierarchical
techniques (as described in the paper)
- Ways to Scale without adding nodes and save costs
Demo
A different approach to RTB
● A paper published by in IEEE for Data Mining--
(source of truth)
● Instead of comparing bid requests with user
profile<U,P> we look at bid requests only.
● Compensate computational resources with price
(and not accept and bid on everything)
● ROI with Gaussian/Gamma models are much better
Architecture 1.0
Architecture 2.0
11 2
3
How do we handle 2-3 million QPS?
(efficient scaling)
Sampling!
Ziggurat Algorithm for sampling random values within milli seconds (also most
memory efficient )
How do we decide the bid price?
Y < f(x)
Cost(Budget) Resources
1. Looking at overall compute resources
2. User input filtering
Algorithm
Request selection Based on Utilization Selection policy based on reward models
Filter 1
Filter 2
Filter 3
Collecting server system metrics
- Graphite
- Carbon | StatsD | Collectd
- Redis
Cost + ROI
Testing
- Event-sim
- Parallec io (ebay) [8K async HTTP requests per second]
- Smaato AdX
Challenges
- To process that file in 80 ms
- Throttling issues
- Networking challenges
- Programming/debugging/testing in a distributed environment is a very time-
consuming task
About me
Surf, Football and micro-controllers [ Building products ]
Data Scientist, Planetary/Fusion Network Ltd (NY)
MS Comp Science, Media, Business Mgmt - University of California, Santa Barbara
MUDIT UPPAL
Learning at Insight: Learning by doing
Before Insight: Batch processing, Data Science Analytics
After Insight: Stream processing [ Spark streaming, Storm, Kafka, Cassandra ] +
learning from feedback, the right tool for the right job, More debugging/testing in a
distributed computing with polyglot approach + Thinking asynchronously

Contenu connexe

Similaire à Reel bid insightd-eproject

June 2014 HUG: Interactive analytics over hadoop
June 2014 HUG: Interactive analytics over hadoopJune 2014 HUG: Interactive analytics over hadoop
June 2014 HUG: Interactive analytics over hadoop
Yahoo Developer Network
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Software
elliando dias
 
Interactive Analytics in Human Time
Interactive Analytics in Human TimeInteractive Analytics in Human Time
Interactive Analytics in Human Time
DataWorks Summit
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
Hakka Labs
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
Jun Wang
 

Similaire à Reel bid insightd-eproject (20)

If you are not paying for it, you are the product: How much do advertisers p...
 If you are not paying for it, you are the product: How much do advertisers p... If you are not paying for it, you are the product: How much do advertisers p...
If you are not paying for it, you are the product: How much do advertisers p...
 
June 2014 HUG: Interactive analytics over hadoop
June 2014 HUG: Interactive analytics over hadoopJune 2014 HUG: Interactive analytics over hadoop
June 2014 HUG: Interactive analytics over hadoop
 
IBM Z for the Digital Enterprise 2018 - Z Keynote
IBM Z for the Digital Enterprise 2018 - Z KeynoteIBM Z for the Digital Enterprise 2018 - Z Keynote
IBM Z for the Digital Enterprise 2018 - Z Keynote
 
Machine Learning for Computational Advertising
Machine Learning for Computational AdvertisingMachine Learning for Computational Advertising
Machine Learning for Computational Advertising
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Software
 
Interactive Analytics in Human Time
Interactive Analytics in Human TimeInteractive Analytics in Human Time
Interactive Analytics in Human Time
 
Criteo TektosData Meetup
Criteo TektosData MeetupCriteo TektosData Meetup
Criteo TektosData Meetup
 
Building Value - Understanding the TCO and ROI of Apache Kafka & Confluent
Building Value  - Understanding the TCO and ROI of Apache Kafka & ConfluentBuilding Value  - Understanding the TCO and ROI of Apache Kafka & Confluent
Building Value - Understanding the TCO and ROI of Apache Kafka & Confluent
 
RTBkit Introduction & Best Practices
RTBkit Introduction & Best PracticesRTBkit Introduction & Best Practices
RTBkit Introduction & Best Practices
 
Citihub IDC Event 2009 Cloud Mark Ellis
Citihub IDC Event 2009 Cloud Mark EllisCitihub IDC Event 2009 Cloud Mark Ellis
Citihub IDC Event 2009 Cloud Mark Ellis
 
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
Global C4IR-1 Masterclass Adryan - Zuehlke Engineering 2017
 
Maximize Big Data ROI via Best of Breed Patterns and Practices
Maximize Big Data ROI via Best of Breed Patterns and PracticesMaximize Big Data ROI via Best of Breed Patterns and Practices
Maximize Big Data ROI via Best of Breed Patterns and Practices
 
Fast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick GorskiFast Data Driving Personalization - Nick Gorski
Fast Data Driving Personalization - Nick Gorski
 
OEM to ISF Transformation (Final)
OEM to ISF Transformation (Final)OEM to ISF Transformation (Final)
OEM to ISF Transformation (Final)
 
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
A Brief Introduction of Real-time Bidding Display Advertising and Evaluation ...
 
Customer Experience Management for CSPs
Customer Experience Management for CSPsCustomer Experience Management for CSPs
Customer Experience Management for CSPs
 
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
From 6 hours to 1 minute... in 2 days! How we managed to stream our (long) Ha...
 
Big Data Berlin - Criteo
Big Data Berlin - CriteoBig Data Berlin - Criteo
Big Data Berlin - Criteo
 
The Lyft data platform: Now and in the future
The Lyft data platform: Now and in the futureThe Lyft data platform: Now and in the future
The Lyft data platform: Now and in the future
 
Lyft data Platform - 2019 slides
Lyft data Platform - 2019 slidesLyft data Platform - 2019 slides
Lyft data Platform - 2019 slides
 

Dernier

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Reel bid insightd-eproject

  • 1. Reelbid Real time ad bidding DSP ( “programmatic”) MUDIT UPPAL INSIGHT DATA ENGINEERING, SV
  • 2. What is Real Time Bidding(RTB)? Image source:
  • 3.
  • 4. Demand side platform - ‘Reelbid’ Mashable.com (publisher with inventory) Sell Side Platform OpenRTB server Auction service Ad exchange OpenRTB client Bidder service Data management platform Data broker Some advertiser
  • 5. Motivation ● Really interesting & new* data engineering challenges Current State of the art Google’s open-bidder still in alpha(not for public) “In 2014, according to Business Insider Intelligence, ad revenue topped $15 billion. Real-time bidding, and in particular, mobile and video real-time bidding, lead the way for that growth. Business Insider Intelligence estimates say RTB revenue will pass $26 billion by the end of 2020, which far surpasses the $11.7 billion from this year.” ( *Disclaimer: NO background in Ad tech ) Latency ( 100 ms ) Queries Per Second (~2 - 3 million bids/second) + Infrequent bids
  • 6. Some Sample AdExchange Reelbid 1. Hey do you want to bid on this user’s impression?: 2. Yes! Here’s my bid: 100 ms "id" : "32a69c6ba388f110487f9d1e63f77b22d86e916b", "banner": { "h": 250, "w": 300, "name": "mashable.com", "domain": "http://www.example.com", ….. "page": "http://easy.example.com/easy?cu=13824; cre=mu;target=_blank", "ref" : "http://refer+url", "publisher": { "id": "qqwer1234xgfd", "name": "site_name", "domain": "my.site.com" "device": { "ua": "Mozilla/5.0(KHTML, like Gecko) Version/5.1.7 Safari/534. 57.2", "ip": "192.168.5.5", "geo": { "lat": 37.789, "lon": -122.394, "country": "USA", "city": "San Francisco", "region": "CA", "zip" : "94105", "type": 2 } "user": { "buyeruid" : "89776897686798fwe87rtryt8976fsd7869678", "id": "55816b39711f9b5acf3b90e313ed29e51665623f", "gender": "M", "yob": 1975, "customdata": "Data-asdfdwerewr", "data": [{ "id": "pub-demographics", "name": "data_name", "segment": [{ "id" : "345qw245wfrtgwertrt56765wert", "name" : "segment_name", "value": "segment_value } { "id": "eb85349d-03c3-44f4-a77b- 824f7221d116", "seatbid": [{ "bid": [{ "id": "bid1", "impid": "eb85349d-03c3-44f4-a77b- 824f7221d116", "price": 0.1, "adm": "<div>Ad Creative</div>", "adomain": [ "http://www.example.com/clickthrough"
  • 7. Goals - Create a more efficient bidding framework using sampling and hierarchical techniques (as described in the paper) - Ways to Scale without adding nodes and save costs
  • 9.
  • 10. A different approach to RTB ● A paper published by in IEEE for Data Mining-- (source of truth) ● Instead of comparing bid requests with user profile<U,P> we look at bid requests only. ● Compensate computational resources with price (and not accept and bid on everything) ● ROI with Gaussian/Gamma models are much better
  • 13. How do we handle 2-3 million QPS? (efficient scaling)
  • 14. Sampling! Ziggurat Algorithm for sampling random values within milli seconds (also most memory efficient )
  • 15. How do we decide the bid price?
  • 16. Y < f(x) Cost(Budget) Resources 1. Looking at overall compute resources 2. User input filtering
  • 17. Algorithm Request selection Based on Utilization Selection policy based on reward models Filter 1 Filter 2 Filter 3
  • 18. Collecting server system metrics - Graphite - Carbon | StatsD | Collectd - Redis
  • 20. Testing - Event-sim - Parallec io (ebay) [8K async HTTP requests per second] - Smaato AdX
  • 21. Challenges - To process that file in 80 ms - Throttling issues - Networking challenges - Programming/debugging/testing in a distributed environment is a very time- consuming task
  • 22. About me Surf, Football and micro-controllers [ Building products ] Data Scientist, Planetary/Fusion Network Ltd (NY) MS Comp Science, Media, Business Mgmt - University of California, Santa Barbara MUDIT UPPAL
  • 23. Learning at Insight: Learning by doing Before Insight: Batch processing, Data Science Analytics After Insight: Stream processing [ Spark streaming, Storm, Kafka, Cassandra ] + learning from feedback, the right tool for the right job, More debugging/testing in a distributed computing with polyglot approach + Thinking asynchronously