SlideShare une entreprise Scribd logo
1  sur  18
 BIG	
  DATA	
  ECOSYSTEM	
  AT	
  
INMOBI	
  

Sharad	
  Agarwal	
  

Sharad Agarwal

Nasscom ATC 2013
Technology and Product have led to InMobi being recognized by
MIT as one of the Top 50 Disruptive Companies for 2013

2	
  
InMobi Global Reach And Scale

3	
  
Data	
  Sciences	
  
Decision Making by
Machines

Infrastructure	
  Scaling	
  
Decision Making By
Humans
Reports

Agile Reports & Analytics
Increasing Value

Data Driven Business
Decisions

Leveraging Data

Data Driven Systems

4	
  
Optimization

§ 
§ 

Campaign Delivery
Marketplace Health

Business Metrics

§ 
§ 
§ 

Adoption Metrics
Product Performance Metrics and Debugging
Planning and Strategy – Demand, Supply and others

Exploration of new
opportunities

§ 

New Product / Feature Ideas

Data Driven Decision Making
Prediction
Prediction

§ 
§ 
§ 

Prediction of Click through Rates and Conversion Rates
Forecasting and Planning – Inventory / Burn
Risk Mitigation and Management – Overburn / Fraud

Recommendation
Recommendation

§ 
§ 
§ 

App Recommendation Engine
Dynamic Personalization of Creatives
Bid Budget Recommendation

Targeting

§ 
§ 
§ 
§ 

Audience Segment based Targeting
Geo and Hyper local Targeting
Contextual Targeting
Look Alike Modelling

Pricing

§ 
§ 
§ 

Conversion Based Pricing
Engagement based Pricing
Determining the value of Supply

Data Sciences Driven Systems

6
1

Access	
  to	
  Data	
  

2

Ability	
  to	
  Process	
  

3

Ability	
  to	
  U@lize	
  

7	
  
Curate
Reporting &
Analytics

Ingest
Data
Ingestion

Normalize

Data Systems

Analyze

Store

Data Flow

Data
Consumption

Feedback ->
To power
products

8
Commoditize Data Access And Processing

By Providing Rich Abstractions

Design: Data Platform Goal

9
APLICATIONS	
  

DASHBOARD	
  
SDK	
  

DATA	
  INGESTION	
  
	
  
CONDUIT	
  +	
  PINTAIL	
  
	
  

DATA	
  MGMT	
  
	
  
FALCON	
  
	
  
	
  

	
  

ANALYTICS	
  
	
  
GRILL	
  

Signals	
  

Ac3onable	
  
Insights	
  

InMobi	
  Big	
  Data	
  Pla=orms	
  
STORM	
  
Hosted/On-­‐Premise	
  	
  Cloud(Public/Private)	
  

DATA	
  
INFRASTRUCTURE	
  
Server	
  
Infrastructure	
  
Collect signals – streaming, batch, multi-site
At Scale In Real Time

Conduit + PinTail

1
1	
  
DC1	
  Producers	
  
A_part1	
  

B_part1	
  

DC2	
  Producers	
  
A_part2	
  

DC3	
  Producers	
  
B_part3	
  

Control	
  Flow	
  

A	
  

DC1	
  Consumers	
  

B	
  

DC2	
  Consumers	
  

A	
  

B	
  

Data	
  Flow	
  

DC3	
  Consumers	
  
InMobi Incubated Its Hadoop Data
Management Project in Apache

Apache Falcon

1
3	
  
Apache Falcon
Adhoc Reporting on Logical Cube Abstraction
Across Heterogeneous Storages

GRILL

1
5	
  
GRILL: Query on Cube using HQL

1
6	
  
8 Bn

240 TB

Hbase Read-Write throughputs
per day

Amount of data read / written
by systems in a day

1+ PB Storage
10 Bn
Hadoop cluster

175 K

Raw events per day

Hadoop Jobs per day

InMobi and Big Data – Metrics

17
 
sharad@apache.org	
  
@sharad_ag	
  
	
  
Bangalore	
  Hadoop	
  
Meetup	
  

Thank You

18	
  

Contenu connexe

Tendances

Graph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 DatasetGraph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 DatasetTigerGraph
 
Getting More Value from your Cloud Data Assets
Getting More Value from your Cloud Data AssetsGetting More Value from your Cloud Data Assets
Getting More Value from your Cloud Data AssetsEliot Arnold
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big dataRaul Chong
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise WeAreEsynergy
 
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike GualtieriSpark Summit
 
The Interesting IoT: Digitizing Operations
The Interesting IoT: Digitizing OperationsThe Interesting IoT: Digitizing Operations
The Interesting IoT: Digitizing OperationsGordon Haff
 
Graph + AI World Opening Keynote
Graph + AI World Opening KeynoteGraph + AI World Opening Keynote
Graph + AI World Opening KeynoteTigerGraph
 
Business of iot_mongodb_spark
Business of iot_mongodb_sparkBusiness of iot_mongodb_spark
Business of iot_mongodb_sparkMat Keep
 
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...Databricks
 
UNIFi and HavasMedia Case Study - Creating New Customer Value with Data
UNIFi and HavasMedia Case Study - Creating New Customer Value with DataUNIFi and HavasMedia Case Study - Creating New Customer Value with Data
UNIFi and HavasMedia Case Study - Creating New Customer Value with DataUNIFI Software
 
Building an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signalsBuilding an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signalsTigerGraph
 
Graph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleGraph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleTigerGraph
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...TigerGraph
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. ServicesTigerGraph
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big dataAyad Shammout
 
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...Using Kafka in Your Organization with Real-Time User Insights for a Customer ...
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...confluent
 
Fraud prevention is better with TigerGraph inside
Fraud prevention is better with  TigerGraph insideFraud prevention is better with  TigerGraph inside
Fraud prevention is better with TigerGraph insideTigerGraph
 
Big data competitive landscape overview
Big data competitive landscape overviewBig data competitive landscape overview
Big data competitive landscape overviewBisakha Praharaj
 
Datahive 360 - Felipe Wesbonk
Datahive 360 - Felipe WesbonkDatahive 360 - Felipe Wesbonk
Datahive 360 - Felipe WesbonkImmelda Oord
 

Tendances (20)

Graph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 DatasetGraph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
Graph Gurus Episode 37: Modeling for Kaggle COVID-19 Dataset
 
Getting More Value from your Cloud Data Assets
Getting More Value from your Cloud Data AssetsGetting More Value from your Cloud Data Assets
Getting More Value from your Cloud Data Assets
 
02 a holistic approach to big data
02 a holistic approach to big data02 a holistic approach to big data
02 a holistic approach to big data
 
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise Steve Jenkins - Business Opportunities for Big Data in the Enterprise
Steve Jenkins - Business Opportunities for Big Data in the Enterprise
 
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri 5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
5 Reasons Enterprise Adoption of Spark is Unstoppable by Mike Gualtieri
 
The Interesting IoT: Digitizing Operations
The Interesting IoT: Digitizing OperationsThe Interesting IoT: Digitizing Operations
The Interesting IoT: Digitizing Operations
 
Graph + AI World Opening Keynote
Graph + AI World Opening KeynoteGraph + AI World Opening Keynote
Graph + AI World Opening Keynote
 
Business of iot_mongodb_spark
Business of iot_mongodb_sparkBusiness of iot_mongodb_spark
Business of iot_mongodb_spark
 
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
How a Media Data Platform Drives Real-time Insights & Analytics using Apache ...
 
UNIFi and HavasMedia Case Study - Creating New Customer Value with Data
UNIFi and HavasMedia Case Study - Creating New Customer Value with DataUNIFi and HavasMedia Case Study - Creating New Customer Value with Data
UNIFi and HavasMedia Case Study - Creating New Customer Value with Data
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Building an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signalsBuilding an accurate understanding of consumers based on real-world signals
Building an accurate understanding of consumers based on real-world signals
 
Graph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at ScaleGraph-Based Identity Resolution at Scale
Graph-Based Identity Resolution at Scale
 
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
How to Build An AI Based Customer Data Platform: Learn the design patterns fo...
 
Graph+AI for Fin. Services
Graph+AI for Fin. ServicesGraph+AI for Fin. Services
Graph+AI for Fin. Services
 
Ensuring compliance of patient data with big data
Ensuring compliance of patient data with big dataEnsuring compliance of patient data with big data
Ensuring compliance of patient data with big data
 
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...Using Kafka in Your Organization with Real-Time User Insights for a Customer ...
Using Kafka in Your Organization with Real-Time User Insights for a Customer ...
 
Fraud prevention is better with TigerGraph inside
Fraud prevention is better with  TigerGraph insideFraud prevention is better with  TigerGraph inside
Fraud prevention is better with TigerGraph inside
 
Big data competitive landscape overview
Big data competitive landscape overviewBig data competitive landscape overview
Big data competitive landscape overview
 
Datahive 360 - Felipe Wesbonk
Datahive 360 - Felipe WesbonkDatahive 360 - Felipe Wesbonk
Datahive 360 - Felipe Wesbonk
 

En vedette

Apache Lens at Hadoop meetup
Apache Lens at Hadoop meetupApache Lens at Hadoop meetup
Apache Lens at Hadoop meetupamarsri
 
Amber and sians planning
Amber and sians planningAmber and sians planning
Amber and sians planningAmber_
 
Mckenzie Lawrence
Mckenzie LawrenceMckenzie Lawrence
Mckenzie Lawrenceadubose
 
Will Baughn
Will BaughnWill Baughn
Will Baughnadubose
 
Get the most out of your accessibility expert
Get the most out of your accessibility expertGet the most out of your accessibility expert
Get the most out of your accessibility expertOlivier Nourry
 
Trabajo practico nº14
Trabajo practico nº14Trabajo practico nº14
Trabajo practico nº14sscida
 
脉轮能量书ⅰ
脉轮能量书ⅰ脉轮能量书ⅰ
脉轮能量书ⅰguxianbang
 
Train Your Brain Dr. Farhana Shaheen
Train Your Brain Dr. Farhana ShaheenTrain Your Brain Dr. Farhana Shaheen
Train Your Brain Dr. Farhana ShaheenFarhana Shaheen
 
Callie Hodge
Callie HodgeCallie Hodge
Callie Hodgeadubose
 
Caleb Dean
Caleb DeanCaleb Dean
Caleb Deanadubose
 
John Perry
John PerryJohn Perry
John Perryadubose
 
Tyler Butler
Tyler ButlerTyler Butler
Tyler Butleradubose
 
Derivatives in graphing-dfs
Derivatives in graphing-dfsDerivatives in graphing-dfs
Derivatives in graphing-dfsFarhana Shaheen
 
Mya Wilson
Mya WilsonMya Wilson
Mya Wilsonadubose
 
Avery Powers
Avery PowersAvery Powers
Avery Powersadubose
 
How to sell 3 million widgets, guaranteed!
How to sell 3 million widgets, guaranteed!How to sell 3 million widgets, guaranteed!
How to sell 3 million widgets, guaranteed!Jaimes Nel
 
Savannah Richardson
Savannah RichardsonSavannah Richardson
Savannah Richardsonadubose
 

En vedette (20)

Apache Lens at Hadoop meetup
Apache Lens at Hadoop meetupApache Lens at Hadoop meetup
Apache Lens at Hadoop meetup
 
Opening gCS
Opening gCSOpening gCS
Opening gCS
 
Amber and sians planning
Amber and sians planningAmber and sians planning
Amber and sians planning
 
Mckenzie Lawrence
Mckenzie LawrenceMckenzie Lawrence
Mckenzie Lawrence
 
Will Baughn
Will BaughnWill Baughn
Will Baughn
 
Get the most out of your accessibility expert
Get the most out of your accessibility expertGet the most out of your accessibility expert
Get the most out of your accessibility expert
 
Trabajo practico nº14
Trabajo practico nº14Trabajo practico nº14
Trabajo practico nº14
 
脉轮能量书ⅰ
脉轮能量书ⅰ脉轮能量书ⅰ
脉轮能量书ⅰ
 
Train Your Brain Dr. Farhana Shaheen
Train Your Brain Dr. Farhana ShaheenTrain Your Brain Dr. Farhana Shaheen
Train Your Brain Dr. Farhana Shaheen
 
Beauty of numbers
Beauty of numbersBeauty of numbers
Beauty of numbers
 
Callie Hodge
Callie HodgeCallie Hodge
Callie Hodge
 
Caleb Dean
Caleb DeanCaleb Dean
Caleb Dean
 
John Perry
John PerryJohn Perry
John Perry
 
Tyler Butler
Tyler ButlerTyler Butler
Tyler Butler
 
Derivatives in graphing-dfs
Derivatives in graphing-dfsDerivatives in graphing-dfs
Derivatives in graphing-dfs
 
Mya Wilson
Mya WilsonMya Wilson
Mya Wilson
 
Avery Powers
Avery PowersAvery Powers
Avery Powers
 
How to sell 3 million widgets, guaranteed!
How to sell 3 million widgets, guaranteed!How to sell 3 million widgets, guaranteed!
How to sell 3 million widgets, guaranteed!
 
Mean median mode_range
Mean median mode_rangeMean median mode_range
Mean median mode_range
 
Savannah Richardson
Savannah RichardsonSavannah Richardson
Savannah Richardson
 

Similaire à Big Data Ecosystem at InMobi, Nasscom ATC 2013 Noida

Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixNicolas Morales
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Pactera_US
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise DataWorks Summit
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Jeffrey T. Pollock
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreTrendwise Analytics
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data BSP Media Group
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014MapR Technologies
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Denodo
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesJames Serra
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsSkillspeed
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsRick Perret
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareMapR Technologies
 

Similaire à Big Data Ecosystem at InMobi, Nasscom ATC 2013 Noida (20)

Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
 
Getting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with BluemixGetting started with Hadoop on the Cloud with Bluemix
Getting started with Hadoop on the Cloud with Bluemix
 
Hybrid Cloud Strategy for Big Data and Analytics
Hybrid Cloud Strategy for Big Data and Analytics Hybrid Cloud Strategy for Big Data and Analytics
Hybrid Cloud Strategy for Big Data and Analytics
 
Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data Using Visualization to Succeed with Big Data
Using Visualization to Succeed with Big Data
 
Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
 
Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!Klarna Tech Talk - Mind the Data!
Klarna Tech Talk - Mind the Data!
 
Hadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and MoreHadoop,Big Data Analytics and More
Hadoop,Big Data Analytics and More
 
Capturing big value in big data
Capturing big value in big data Capturing big value in big data
Capturing big value in big data
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
 
Overview - IBM Big Data Platform
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Big Data: It’s all about the Use Cases
Big Data: It’s all about the Use CasesBig Data: It’s all about the Use Cases
Big Data: It’s all about the Use Cases
 
BIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in LogisticsBIG Data & Hadoop Applications in Logistics
BIG Data & Hadoop Applications in Logistics
 
Why Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & Analytics
 
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEnWCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
 
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
Denodo Partner Connect: A Review of the Top 5 Differentiated Use Cases for th...
 
Key Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShareKey Considerations for Putting Hadoop in Production SlideShare
Key Considerations for Putting Hadoop in Production SlideShare
 

Dernier

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 

Dernier (20)

SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 

Big Data Ecosystem at InMobi, Nasscom ATC 2013 Noida

  • 1.  BIG  DATA  ECOSYSTEM  AT   INMOBI   Sharad  Agarwal   Sharad Agarwal Nasscom ATC 2013
  • 2. Technology and Product have led to InMobi being recognized by MIT as one of the Top 50 Disruptive Companies for 2013 2  
  • 3. InMobi Global Reach And Scale 3  
  • 4. Data  Sciences   Decision Making by Machines Infrastructure  Scaling   Decision Making By Humans Reports Agile Reports & Analytics Increasing Value Data Driven Business Decisions Leveraging Data Data Driven Systems 4  
  • 5. Optimization §  §  Campaign Delivery Marketplace Health Business Metrics §  §  §  Adoption Metrics Product Performance Metrics and Debugging Planning and Strategy – Demand, Supply and others Exploration of new opportunities §  New Product / Feature Ideas Data Driven Decision Making
  • 6. Prediction Prediction §  §  §  Prediction of Click through Rates and Conversion Rates Forecasting and Planning – Inventory / Burn Risk Mitigation and Management – Overburn / Fraud Recommendation Recommendation §  §  §  App Recommendation Engine Dynamic Personalization of Creatives Bid Budget Recommendation Targeting §  §  §  §  Audience Segment based Targeting Geo and Hyper local Targeting Contextual Targeting Look Alike Modelling Pricing §  §  §  Conversion Based Pricing Engagement based Pricing Determining the value of Supply Data Sciences Driven Systems 6
  • 7. 1 Access  to  Data   2 Ability  to  Process   3 Ability  to  U@lize   7  
  • 9. Commoditize Data Access And Processing By Providing Rich Abstractions Design: Data Platform Goal 9
  • 10. APLICATIONS   DASHBOARD   SDK   DATA  INGESTION     CONDUIT  +  PINTAIL     DATA  MGMT     FALCON         ANALYTICS     GRILL   Signals   Ac3onable   Insights   InMobi  Big  Data  Pla=orms   STORM   Hosted/On-­‐Premise    Cloud(Public/Private)   DATA   INFRASTRUCTURE   Server   Infrastructure  
  • 11. Collect signals – streaming, batch, multi-site At Scale In Real Time Conduit + PinTail 1 1  
  • 12. DC1  Producers   A_part1   B_part1   DC2  Producers   A_part2   DC3  Producers   B_part3   Control  Flow   A   DC1  Consumers   B   DC2  Consumers   A   B   Data  Flow   DC3  Consumers  
  • 13. InMobi Incubated Its Hadoop Data Management Project in Apache Apache Falcon 1 3  
  • 15. Adhoc Reporting on Logical Cube Abstraction Across Heterogeneous Storages GRILL 1 5  
  • 16. GRILL: Query on Cube using HQL 1 6  
  • 17. 8 Bn 240 TB Hbase Read-Write throughputs per day Amount of data read / written by systems in a day 1+ PB Storage 10 Bn Hadoop cluster 175 K Raw events per day Hadoop Jobs per day InMobi and Big Data – Metrics 17
  • 18.   sharad@apache.org   @sharad_ag     Bangalore  Hadoop   Meetup   Thank You 18