SlideShare une entreprise Scribd logo
1  sur  77
Apache Phoenix:
Transforming HBase into a
SQL database
James Taylor
@JamesPlusPlus
http://phoenix.apache.org
About me
Completed
o Architect at Salesforce.com in Big Data group
o Started Phoenix as internal project ~3 years ago
o Open-source on Github ~1.5 years ago
o Apache incubator for ~5 months
o Graduated as Top Level Project in May 2014
o Engineer at BEA Systems
o XQuery-based federated query engine
o SQL-based complex event processing engine
Agenda
Completed
o What is Apache Phoenix?
o Why is it so fast?
o How does it help HBase scale?
o Roadmap
o Q&A
What is Apache Phoenix?
Completed
What is Apache Phoenix?
Completed
1. Turns HBase into a SQL database
o Query Engine
o MetaData Repository
o Embedded JDBC driver
o Only for HBase data
What is Apache Phoenix?
Completed
2. Fastest way to access HBase data
o HBase-specific push down
o Compiles queries into native
HBase calls (no map-reduce)
o Executes scans in parallel
Completed
SELECT * FROM t WHERE k IN (?,?,?)
Phoenix Stinger (Hive 0.11)
0.04 sec 280 sec
* 110M row table
7,000x faster
What is Apache Phoenix?
Completed
3. Lightweight
o No additional servers required
o 100% Java
o Included in HDP 2.1 distribution
o Available in Amazon EMR
o Otherwise copy Phoenix jar into
HBase lib directory on each RS
HBase Cluster Architecture
HBase Cluster Architecture
Phoenix
HBase Cluster Architecture
Phoenix
Phoenix
What is Apache Phoenix?
Completed
4. Integration-friendly
o Map to existing HBase table
o Integrate with Apache Pig
o Integrate with Apache Flume
o Integrate with Apache Sqoop (wip)
What is Apache Phoenix?
Completed
1. Turns HBase into a SQL database
2. Fastest way to access HBase data
3. Lightweight
4. Integration-friendly
Why is Phoenix so fast?
Completed
Why is Phoenix so fast?
Completed
1. HBase
o Fast, but “dumb” (on purpose)
2. Data model
o Support for composite primary key
o Binary data sorts naturally
3. Client-side parallelization
4. Push down
o Custom filters and coprocessors
Phoenix Data Model
HBase Table
Phoenix maps HBase data model to the relational world
Phoenix Data Model
HBase Table
Column Family A Column Family B
Phoenix maps HBase data model to the relational world
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Phoenix maps HBase data model to the relational world
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Phoenix maps HBase data model to the relational world
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Phoenix maps HBase data model to the relational world
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 Value
Row Key 2 Value Value
Row Key 3 Value
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 Value
Row Key 2 Value Value
Row Key 3 Value
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 Value
Row Key 2 Value Value
Row Key 3 Value
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 Value
Row Key 2 Value Value
Row Key 3 Value
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 Value
Row Key 2 Value Value
Row Key 3 Value
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
Multiple Versions
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
Phoenix Table
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
Phoenix Table
Key Value Columns
Phoenix Data Model
HBase Table
Column Family A Column Family B
Qualifier 1 Qualifier 2 Qualifier 3
Row Key 1 KeyValue
Row Key 2 KeyValue KeyValue
Row Key 3 KeyValue
Phoenix maps HBase data model to the relational world
Phoenix Table
Key Value ColumnsPrimary Key Constraint
Example
Row Key
SERVER METRICS
HOST VARCHAR
DATE DATE
RESPONSE_TIME INTEGER
GC_TIME INTEGER
CPU_TIME INTEGER
IO_TIME INTEGER
Over metrics data for servers with a schema like this:
Example
Over metrics data for servers with a schema like this:
Key Values
SERVER METRICS
HOST VARCHAR
DATE DATE
RESPONSE_TIME INTEGER
GC_TIME INTEGER
CPU_TIME INTEGER
IO_TIME INTEGER
Example
CREATE TABLE SERVER_METRICS (
HOST VARCHAR,
DATE DATE,
RESPONSE_TIME INTEGER,
GC_TIME INTEGER,
CPU_TIME INTEGER,
IO_TIME INTEGER,
CONSTRAINT pk PRIMARY KEY (HOST, DATE))
DDL command looks like this:
With data that looks like this:
SERVER METRICS
HOST + DATE RESPONSE_TIME GC_TIME
SF1 1396743589 1234
SF1 1396743589 8012
…
SF3 1396002345 2345
SF3 1396002345 2340
SF7 1396552341 5002 1234
…
Example
Row Key
With data that looks like this:
SERVER METRICS
HOST + DATE RESPONSE_TIME GC_TIME
SF1 1396743589 1234
SF1 1396743589 8012
…
SF3 1396002345 2345
SF3 1396002345 2340
SF7 1396552341 5002 1234
…
Example
Key Values
Phoenix Push Down: Example
Completed
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down: Example
Completed
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down: Example
Completed
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down: Example
Completed
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down: Example
Completed
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down
1. Skip scan filter
2. Aggregation
3. TopN
4. Hash Join
Phoenix Push Down: Skip scan
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
Phoenix Push Down: Skip scan
Completed
R1
R2
R3
R4
Phoenix Push Down: Skip scan
Client-side parallel scans
Completed
R1
R2
R3
R4
scan1
scan3
scan2
Phoenix Push Down: Skip scan
Server-side filter
Completed
SKIP
Phoenix Push Down: Skip scan
Server-side filter
Completed
INCLUDE
Phoenix Push Down: Skip scan
Server-side filter
Completed
SKIP
Phoenix Push Down: Skip scan
Server-side filter
Completed
INCLUDE
Phoenix Push Down: Skip scan
Server-side filter
SKIP
Phoenix Push Down: Skip scan
Server-side filter
INCLUDE
Phoenix Push Down: Skip scan
Server-side filter
INCLUDE
INCLUDE
INCLUDE
Phoenix Push Down: Aggregation
SELECT host, avg(response_time)
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
GROUP BY host
SERVER METRICS
HOST DATE KV1 KV2 KV3
SF1 Jun 2 10:10:10.234 239 234 674
SF1 Jun 3 23:05:44.975 23 234
SF1 Jun 9 08:10:32.147 256 314 341
SF1 Jun 9 08:10:32.147 235 256
SF1 Jun 1 11:18:28.456 235 23
SF1 Jun 3 22:03:22.142 234 314
SF1 Jun 3 22:03:22.142 432 234 256
SF2 Jun 1 10:29:58.950 23 432
SF2 Jun 2 14:55:34.104 314 876 23
SF2 Jun 3 12:46:19.123 256 234 314
SF2 Jun 3 12:46:19.123 432
SF2 Jun 8 08:23:23.456 876 876 235
SF2 Jun 1 10:31:10.234 234 234 876
SF3 Jun 1 10:31:10.234 432 432 234
SF3 Jun 3 10:31:10.234 890
SF3 Jun 8 10:31:10.234 314 314 235
SF3 Jun 1 10:31:10.234 256 256 876
SF3 Jun 1 10:31:10.234 235 234
SF3 Jun 8 10:31:10.234 876 876 432
SF3 Jun 9 10:31:10.234 234 234
SF3 Jun 3 10:31:10.234 432 276
… … … … …
Phoenix Push Down: Aggregation
Aggregate on server-side
SERVER METRICS
HOST AGGREGATE VALUES
SF1 3421
SF2 2145
SF3 9823
Phoenix Push Down: TopN
Completed
SELECT host, date, gc_time
FROM server_metrics
WHERE date > CURRENT_DATE() – 7
AND host LIKE ‘SF%’
ORDER BY gc_time DESC
LIMIT 5
Phoenix Push Down: TopN
Client-side parallel scans
Completed
R1
R2
R3
R4
scan1
scan3
scan2
Phoenix Push Down: TopN
Each region holds N rows
Completed
R1
R2
R3
R4
scan1
Phoenix Push Down: TopN
Each region holds N rows
Completed
R1
R2
R3
R4
scan2
Phoenix Push Down: TopN
Each region holds N rows
Completed
R1
R2
R3
R4
scan3
SERVER METRICS
HOST DATE GC_TIME
SF3 Jun 2 10:10:10.234 22123
SF5 Jun 3 23:05:44.975 19876
SF2 Jun 9 08:10:32.147 11345
SF2 Jun 1 11:18:28.456 10234
SF1 Jun 3 22:03:22.142 10111
Phoenix Push Down: TopN
Client-side final merge sort
Scan1
Scan2
Scan3
Phoenix Push Down: TopN
Secondary Index
Completed
CREATE INDEX gc_time_index
ON server_metrics (gc_time DESC, date DESC)
INCLUDE (response_time)
Row Key
GC_TIME_INDEX
GC_TIME INTEGER
DATE DATE
HOST VARCHAR
RESPONSE_TIME INTEGER
Phoenix Push Down: TopN
Secondary Index
Completed
CREATE INDEX gc_time_index
ON server_metrics (gc_time DESC, date DESC)
INCLUDE (response_time)
Key Value
GC_TIME_INDEX
GC_TIME INTEGER
DATE DATE
HOST VARCHAR
RESPONSE_TIME INTEGER
Phoenix Push Down: TopN
Secondary Index
Completed
o Original query doesn’t change
o Phoenix rewrites query to use index table
o All referenced columns must exist in index
table for it to be considered
o Stats coming soon!
Phoenix Push Down: Hash Join
Completed
SELECT m.*, i.location
FROM server_metrics m
JOIN host_info i ON m.host = i.host
WHERE m.date > CURRENT_DATE() – 7
AND i.location = ‘SF’
ORDER BY m.gc_time DESC
LIMIT 5
Phoenix Push Down: Hash Join
Separate LHS and RHS
Completed
SELECT m.*, i.location
FROM server_metrics m
JOIN host_info i ON m.host = i.host
WHERE m.date > CURRENT_DATE() – 7
AND i.location = ‘SF’
ORDER BY m.gc_time DESC
LIMIT 5
Phoenix Push Down: Hash Join
Separate LHS and RHS
Completed
SELECT m.*, i.location
FROM server_metrics m
JOIN host_info i ON m.host = i.host
WHERE m.date > CURRENT_DATE() – 7
AND i.location = ‘SF’
ORDER BY m.gc_time DESC
LIMIT 5
Phoenix Push Down: Hash Join
Separate LHS and RHS
Completed
LHS
SELECT *
FROM server_metrics
WHERE date >
CURRENT_DATE() – 7
ORDER BY gc_time DESC
LIMIT 5
RHS
SELECT location
FROM host_info
WHERE location = ‘SF’
Phoenix Push Down: Hash Join
Execute & broadcast RHS to each RS
Completed
RS1
RS2
RHS
Phoenix Push Down: TopN
Server-side map lookup during scan
Completed
R1
R2
R3
R4
RHS
RHS
LHS
scan1
scan2
scan3
scan4
How does Phoenix help
HBase scale?
How does Phoenix help
HBase scale?
1. Phoenix allows multiple tables to share
same physical HBase table
o Updateable VIEW
o Multi-tenant TABLE + tenant-specific VIEW
o Support for secondary indexes on VIEWs
How does Phoenix help
HBase scale?
2. HBase wants small # of big tables instead
of large # of small tables
o Each region for each column family of each table
consumes resources
Phoenix Shared Tables: VIEW
Completed
CREATE TABLE event (
type CHAR(1),
event_id BIGINT,
created_date DATE,
created_by VARCHAR,
CONSTRAINT pk PRIMARY KEY (type, event_id));
CREATE VIEW web_event (
referrer VARCHAR) AS
SELECT * FROM event
WHERE type=‘w’;
• Includes columns from TABLE
• Cannot define PK
• Updateable if only equality
expressions separated by AND
Phoenix Shared Tables: VIEW
Same physical HBase table
Completed
type = ‘c’
type = ‘m’
type = ‘p’
type = ‘w’
EVENT
CHAT_EVENT
MOBILE_EVENT
PHONE_EVENT
WEB_EVENT
Phoenix Shared Table: MULTI_TENANT
Completed
CREATE TABLE event (
tenant_id VARCHAR,
type CHAR(1),
event_id BIGINT,
created_date DATE,
created_by VARCHAR,
CONSTRAINT pk PRIMARY KEY (tenant_id, type, event_id))
MULTI_TENANT=true;
First PK column identifies tenant ID
Phoenix Shared Table: MULTI_TENANT
Completed
CREATE VIEW web_event (
referrer VARCHAR) AS
SELECT * FROM event
WHERE type=‘w’;
DriverManager.connect(“jdbc:phoenix:localhost;tenantId=me”);
CREATE VIEW my_web_event AS
SELECT * FROM web_event;
Tenant-specific view
Tenant-specific connection
Phoenix Shared Tables: MULTI_TENANT
Same physical HBase table
tenant_id = ‘aaa’
…
EVENT
tenant_id = ‘aab’
tenant_id = ‘zzz’
Phoenix Shared Tables: MULTI_TENANT
Same physical HBase table
type = ‘c’
type = ‘m’
type = ‘p’
type = ‘w’
EVENT
CHAT_EVENT
MOBILE_EVENT
PHONE_EVENT
WEB_EVENT
PER
tenant_id
Phoenix Shared Tables: MULTI_TENANT
o Tenant-specific connection may only see
and operate on their data
o MetaData APIs honor this
o Phoenix automatically manages scan ranges
o Primary key constraint of base table may not
be changed
o Indexes in separate shared table may be added to a VIEW
o DDL operations restricted
o No ALTER of base table
o No DROP of columns referenced in WHERE clause
Phoenix Roadmap
Completed
o Derived/nested tables (in 3.1/4.1)
o Local Indexes (in 4.1)
o Transactions
o More Join strategies
o Correlated sub-queries
o Cost-based query optimizer
o OLAP extensions
Thank you!
Questions/comments?

Contenu connexe

Tendances

Cost-based Query Optimization in Apache Phoenix using Apache Calcite
Cost-based Query Optimization in Apache Phoenix using Apache CalciteCost-based Query Optimization in Apache Phoenix using Apache Calcite
Cost-based Query Optimization in Apache Phoenix using Apache CalciteJulian Hyde
 
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...HostedbyConfluent
 
Scaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with DatabricksScaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with DatabricksDatabricks
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeDatabricks
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...Flink Forward
 
End-to-end Data Governance with Apache Avro and Atlas
End-to-end Data Governance with Apache Avro and AtlasEnd-to-end Data Governance with Apache Avro and Atlas
End-to-end Data Governance with Apache Avro and AtlasDataWorks Summit
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
 
Designing Structured Streaming Pipelines—How to Architect Things Right
Designing Structured Streaming Pipelines—How to Architect Things RightDesigning Structured Streaming Pipelines—How to Architect Things Right
Designing Structured Streaming Pipelines—How to Architect Things RightDatabricks
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemDatabricks
 
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta LakeSimplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta LakeDatabricks
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EngineDataWorks Summit
 
Delta from a Data Engineer's Perspective
Delta from a Data Engineer's PerspectiveDelta from a Data Engineer's Perspective
Delta from a Data Engineer's PerspectiveDatabricks
 
Hive Bucketing in Apache Spark with Tejas Patil
Hive Bucketing in Apache Spark with Tejas PatilHive Bucketing in Apache Spark with Tejas Patil
Hive Bucketing in Apache Spark with Tejas PatilDatabricks
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversScyllaDB
 
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...Andrew Lamb
 
Python and Oracle : allies for best of data management
Python and Oracle : allies for best of data managementPython and Oracle : allies for best of data management
Python and Oracle : allies for best of data managementLaurent Leturgez
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase强 王
 
Performance Tuning RocksDB for Kafka Streams’ State Stores
Performance Tuning RocksDB for Kafka Streams’ State StoresPerformance Tuning RocksDB for Kafka Streams’ State Stores
Performance Tuning RocksDB for Kafka Streams’ State Storesconfluent
 

Tendances (20)

Cost-based Query Optimization in Apache Phoenix using Apache Calcite
Cost-based Query Optimization in Apache Phoenix using Apache CalciteCost-based Query Optimization in Apache Phoenix using Apache Calcite
Cost-based Query Optimization in Apache Phoenix using Apache Calcite
 
File Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & ParquetFile Format Benchmark - Avro, JSON, ORC & Parquet
File Format Benchmark - Avro, JSON, ORC & Parquet
 
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...
Improving fault tolerance and scaling out in Kafka Streams with Bill Bejeck |...
 
Scaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with DatabricksScaling and Modernizing Data Platform with Databricks
Scaling and Modernizing Data Platform with Databricks
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta Lake
 
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
Towards Flink 2.0: Unified Batch & Stream Processing - Aljoscha Krettek, Verv...
 
End-to-end Data Governance with Apache Avro and Atlas
End-to-end Data Governance with Apache Avro and AtlasEnd-to-end Data Governance with Apache Avro and Atlas
End-to-end Data Governance with Apache Avro and Atlas
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
 
Designing Structured Streaming Pipelines—How to Architect Things Right
Designing Structured Streaming Pipelines—How to Architect Things RightDesigning Structured Streaming Pipelines—How to Architect Things Right
Designing Structured Streaming Pipelines—How to Architect Things Right
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format Ecosystem
 
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta LakeSimplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
 
Presto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything EnginePresto: Optimizing Performance of SQL-on-Anything Engine
Presto: Optimizing Performance of SQL-on-Anything Engine
 
Delta from a Data Engineer's Perspective
Delta from a Data Engineer's PerspectiveDelta from a Data Engineer's Perspective
Delta from a Data Engineer's Perspective
 
Hive Bucketing in Apache Spark with Tejas Patil
Hive Bucketing in Apache Spark with Tejas PatilHive Bucketing in Apache Spark with Tejas Patil
Hive Bucketing in Apache Spark with Tejas Patil
 
Apache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the CoversApache Iceberg: An Architectural Look Under the Covers
Apache Iceberg: An Architectural Look Under the Covers
 
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
 
Rds data lake @ Robinhood
Rds data lake @ Robinhood Rds data lake @ Robinhood
Rds data lake @ Robinhood
 
Python and Oracle : allies for best of data management
Python and Oracle : allies for best of data managementPython and Oracle : allies for best of data management
Python and Oracle : allies for best of data management
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase
 
Performance Tuning RocksDB for Kafka Streams’ State Stores
Performance Tuning RocksDB for Kafka Streams’ State StoresPerformance Tuning RocksDB for Kafka Streams’ State Stores
Performance Tuning RocksDB for Kafka Streams’ State Stores
 

Similaire à Apache Phoenix: Transforming HBase into a SQL Database

Taming HBase with Apache Phoenix and SQL
Taming HBase with Apache Phoenix and SQLTaming HBase with Apache Phoenix and SQL
Taming HBase with Apache Phoenix and SQLHBaseCon
 
Apache Phoenix: We put the SQL back in NoSQL
Apache Phoenix: We put the SQL back in NoSQLApache Phoenix: We put the SQL back in NoSQL
Apache Phoenix: We put the SQL back in NoSQLSalesforce Engineering
 
Phoenix: How (and why) we put the SQL back into the NoSQL
Phoenix: How (and why) we put the SQL back into the NoSQLPhoenix: How (and why) we put the SQL back into the NoSQL
Phoenix: How (and why) we put the SQL back into the NoSQLDataWorks Summit
 
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQL
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQLHBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQL
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQLCloudera, Inc.
 
Flex Tables Guide Software V. 7.0.x
Flex Tables Guide Software V. 7.0.xFlex Tables Guide Software V. 7.0.x
Flex Tables Guide Software V. 7.0.xAndrey Karpov
 
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!confluent
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
Stream Analytics with SQL on Apache Flink
 Stream Analytics with SQL on Apache Flink Stream Analytics with SQL on Apache Flink
Stream Analytics with SQL on Apache FlinkFabian Hueske
 
What's new in Confluent 3.2 and Apache Kafka 0.10.2
What's new in Confluent 3.2 and Apache Kafka 0.10.2 What's new in Confluent 3.2 and Apache Kafka 0.10.2
What's new in Confluent 3.2 and Apache Kafka 0.10.2 confluent
 
Apache Big Data EU 2015 - Phoenix
Apache Big Data EU 2015 - PhoenixApache Big Data EU 2015 - Phoenix
Apache Big Data EU 2015 - PhoenixNick Dimiduk
 
How Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses CassandraHow Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses Cassandragdusbabek
 
Hyperspace for Delta Lake
Hyperspace for Delta LakeHyperspace for Delta Lake
Hyperspace for Delta LakeDatabricks
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming ApplicationsC4Media
 
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...Ververica
 
Hadoop and Hive
Hadoop and HiveHadoop and Hive
Hadoop and HiveZheng Shao
 
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and Spark
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and SparkHBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and Spark
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and SparkMichael Stack
 
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...DataStax
 
Architecting a Next Generation Data Platform
Architecting a Next Generation Data PlatformArchitecting a Next Generation Data Platform
Architecting a Next Generation Data Platformhadooparchbook
 

Similaire à Apache Phoenix: Transforming HBase into a SQL Database (20)

Taming HBase with Apache Phoenix and SQL
Taming HBase with Apache Phoenix and SQLTaming HBase with Apache Phoenix and SQL
Taming HBase with Apache Phoenix and SQL
 
Apache Phoenix: We put the SQL back in NoSQL
Apache Phoenix: We put the SQL back in NoSQLApache Phoenix: We put the SQL back in NoSQL
Apache Phoenix: We put the SQL back in NoSQL
 
Phoenix: How (and why) we put the SQL back into the NoSQL
Phoenix: How (and why) we put the SQL back into the NoSQLPhoenix: How (and why) we put the SQL back into the NoSQL
Phoenix: How (and why) we put the SQL back into the NoSQL
 
Apache HAWQ Architecture
Apache HAWQ ArchitectureApache HAWQ Architecture
Apache HAWQ Architecture
 
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQL
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQLHBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQL
HBaseCon 2013: How (and Why) Phoenix Puts the SQL Back into NoSQL
 
Flex Tables Guide Software V. 7.0.x
Flex Tables Guide Software V. 7.0.xFlex Tables Guide Software V. 7.0.x
Flex Tables Guide Software V. 7.0.x
 
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!
Apache Kafka and KSQL in Action: Let's Build a Streaming Data Pipeline!
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Stream Analytics with SQL on Apache Flink
 Stream Analytics with SQL on Apache Flink Stream Analytics with SQL on Apache Flink
Stream Analytics with SQL on Apache Flink
 
What's new in Confluent 3.2 and Apache Kafka 0.10.2
What's new in Confluent 3.2 and Apache Kafka 0.10.2 What's new in Confluent 3.2 and Apache Kafka 0.10.2
What's new in Confluent 3.2 and Apache Kafka 0.10.2
 
Apache Big Data EU 2015 - Phoenix
Apache Big Data EU 2015 - PhoenixApache Big Data EU 2015 - Phoenix
Apache Big Data EU 2015 - Phoenix
 
How Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses CassandraHow Rackspace Cloud Monitoring uses Cassandra
How Rackspace Cloud Monitoring uses Cassandra
 
Hyperspace for Delta Lake
Hyperspace for Delta LakeHyperspace for Delta Lake
Hyperspace for Delta Lake
 
Patterns of Streaming Applications
Patterns of Streaming ApplicationsPatterns of Streaming Applications
Patterns of Streaming Applications
 
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...
Stephan Ewen - Stream Processing as a Foundational Paradigm and Apache Flink'...
 
Hadoop and Hive
Hadoop and HiveHadoop and Hive
Hadoop and Hive
 
2008 Ur Tech Talk Zshao
2008 Ur Tech Talk Zshao2008 Ur Tech Talk Zshao
2008 Ur Tech Talk Zshao
 
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and Spark
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and SparkHBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and Spark
HBaseConAsia2018 Track2-4: HTAP DB-System: AsparaDB HBase, Phoenix, and Spark
 
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
What is in All of Those SSTable Files Not Just the Data One but All the Rest ...
 
Architecting a Next Generation Data Platform
Architecting a Next Generation Data PlatformArchitecting a Next Generation Data Platform
Architecting a Next Generation Data Platform
 

Plus de DataWorks Summit

Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisDataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiDataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...DataWorks Summit
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal SystemDataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExampleDataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberDataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixDataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiDataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsDataWorks Summit
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureDataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudDataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiDataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerDataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...DataWorks Summit
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouDataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkDataWorks Summit
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...DataWorks Summit
 

Plus de DataWorks Summit (20)

Data Science Crash Course
Data Science Crash CourseData Science Crash Course
Data Science Crash Course
 
Floating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache RatisFloating on a RAFT: HBase Durability with Apache Ratis
Floating on a RAFT: HBase Durability with Apache Ratis
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFiTracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...HBase Tales From the Trenches - Short stories about most common HBase operati...
HBase Tales From the Trenches - Short stories about most common HBase operati...
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
 
Managing the Dewey Decimal System
Managing the Dewey Decimal SystemManaging the Dewey Decimal System
Managing the Dewey Decimal System
 
Practical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist ExamplePractical NoSQL: Accumulo's dirlist Example
Practical NoSQL: Accumulo's dirlist Example
 
HBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at UberHBase Global Indexing to support large-scale data ingestion at Uber
HBase Global Indexing to support large-scale data ingestion at Uber
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and PhoenixScaling Cloud-Scale Translytics Workloads with Omid and Phoenix
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFiBuilding the High Speed Cybersecurity Data Pipeline Using Apache NiFi
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability ImprovementsSupporting Apache HBase : Troubleshooting and Supportability Improvements
Supporting Apache HBase : Troubleshooting and Supportability Improvements
 
Security Framework for Multitenant Architecture
Security Framework for Multitenant ArchitectureSecurity Framework for Multitenant Architecture
Security Framework for Multitenant Architecture
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Extending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google CloudExtending Twitter's Data Platform to Google Cloud
Extending Twitter's Data Platform to Google Cloud
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFiEvent-Driven Messaging and Actions using Apache Flink and Apache NiFi
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache RangerSecuring Data in Hybrid on-premise and Cloud Environments using Apache Ranger
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
 
Computer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near YouComputer Vision: Coming to a Store Near You
Computer Vision: Coming to a Store Near You
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache SparkBig Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
 

Dernier

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdfChristopherTHyatt
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Dernier (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Apache Phoenix: Transforming HBase into a SQL Database

  • 1. Apache Phoenix: Transforming HBase into a SQL database James Taylor @JamesPlusPlus http://phoenix.apache.org
  • 2. About me Completed o Architect at Salesforce.com in Big Data group o Started Phoenix as internal project ~3 years ago o Open-source on Github ~1.5 years ago o Apache incubator for ~5 months o Graduated as Top Level Project in May 2014 o Engineer at BEA Systems o XQuery-based federated query engine o SQL-based complex event processing engine
  • 3. Agenda Completed o What is Apache Phoenix? o Why is it so fast? o How does it help HBase scale? o Roadmap o Q&A
  • 4. What is Apache Phoenix? Completed
  • 5. What is Apache Phoenix? Completed 1. Turns HBase into a SQL database o Query Engine o MetaData Repository o Embedded JDBC driver o Only for HBase data
  • 6. What is Apache Phoenix? Completed 2. Fastest way to access HBase data o HBase-specific push down o Compiles queries into native HBase calls (no map-reduce) o Executes scans in parallel
  • 7. Completed SELECT * FROM t WHERE k IN (?,?,?) Phoenix Stinger (Hive 0.11) 0.04 sec 280 sec * 110M row table 7,000x faster
  • 8. What is Apache Phoenix? Completed 3. Lightweight o No additional servers required o 100% Java o Included in HDP 2.1 distribution o Available in Amazon EMR o Otherwise copy Phoenix jar into HBase lib directory on each RS
  • 12. What is Apache Phoenix? Completed 4. Integration-friendly o Map to existing HBase table o Integrate with Apache Pig o Integrate with Apache Flume o Integrate with Apache Sqoop (wip)
  • 13. What is Apache Phoenix? Completed 1. Turns HBase into a SQL database 2. Fastest way to access HBase data 3. Lightweight 4. Integration-friendly
  • 14. Why is Phoenix so fast? Completed
  • 15. Why is Phoenix so fast? Completed 1. HBase o Fast, but “dumb” (on purpose) 2. Data model o Support for composite primary key o Binary data sorts naturally 3. Client-side parallelization 4. Push down o Custom filters and coprocessors
  • 16. Phoenix Data Model HBase Table Phoenix maps HBase data model to the relational world
  • 17. Phoenix Data Model HBase Table Column Family A Column Family B Phoenix maps HBase data model to the relational world
  • 18. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Phoenix maps HBase data model to the relational world
  • 19. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Phoenix maps HBase data model to the relational world
  • 20. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Phoenix maps HBase data model to the relational world
  • 21. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world
  • 22. HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 Value Row Key 2 Value Value Row Key 3 Value Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world
  • 23. HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 Value Row Key 2 Value Value Row Key 3 Value HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 Value Row Key 2 Value Value Row Key 3 Value Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world
  • 24. HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 Value Row Key 2 Value Value Row Key 3 Value HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 Value Row Key 2 Value Value Row Key 3 Value Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world Multiple Versions
  • 25. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world Phoenix Table
  • 26. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world Phoenix Table Key Value Columns
  • 27. Phoenix Data Model HBase Table Column Family A Column Family B Qualifier 1 Qualifier 2 Qualifier 3 Row Key 1 KeyValue Row Key 2 KeyValue KeyValue Row Key 3 KeyValue Phoenix maps HBase data model to the relational world Phoenix Table Key Value ColumnsPrimary Key Constraint
  • 28. Example Row Key SERVER METRICS HOST VARCHAR DATE DATE RESPONSE_TIME INTEGER GC_TIME INTEGER CPU_TIME INTEGER IO_TIME INTEGER Over metrics data for servers with a schema like this:
  • 29. Example Over metrics data for servers with a schema like this: Key Values SERVER METRICS HOST VARCHAR DATE DATE RESPONSE_TIME INTEGER GC_TIME INTEGER CPU_TIME INTEGER IO_TIME INTEGER
  • 30. Example CREATE TABLE SERVER_METRICS ( HOST VARCHAR, DATE DATE, RESPONSE_TIME INTEGER, GC_TIME INTEGER, CPU_TIME INTEGER, IO_TIME INTEGER, CONSTRAINT pk PRIMARY KEY (HOST, DATE)) DDL command looks like this:
  • 31. With data that looks like this: SERVER METRICS HOST + DATE RESPONSE_TIME GC_TIME SF1 1396743589 1234 SF1 1396743589 8012 … SF3 1396002345 2345 SF3 1396002345 2340 SF7 1396552341 5002 1234 … Example Row Key
  • 32. With data that looks like this: SERVER METRICS HOST + DATE RESPONSE_TIME GC_TIME SF1 1396743589 1234 SF1 1396743589 8012 … SF3 1396002345 2345 SF3 1396002345 2340 SF7 1396552341 5002 1234 … Example Key Values
  • 33. Phoenix Push Down: Example Completed SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 34. Phoenix Push Down: Example Completed SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 35. Phoenix Push Down: Example Completed SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 36. Phoenix Push Down: Example Completed SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 37. Phoenix Push Down: Example Completed SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 38. Phoenix Push Down 1. Skip scan filter 2. Aggregation 3. TopN 4. Hash Join
  • 39. Phoenix Push Down: Skip scan SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 40. Phoenix Push Down: Skip scan Completed R1 R2 R3 R4
  • 41. Phoenix Push Down: Skip scan Client-side parallel scans Completed R1 R2 R3 R4 scan1 scan3 scan2
  • 42. Phoenix Push Down: Skip scan Server-side filter Completed SKIP
  • 43. Phoenix Push Down: Skip scan Server-side filter Completed INCLUDE
  • 44. Phoenix Push Down: Skip scan Server-side filter Completed SKIP
  • 45. Phoenix Push Down: Skip scan Server-side filter Completed INCLUDE
  • 46. Phoenix Push Down: Skip scan Server-side filter SKIP
  • 47. Phoenix Push Down: Skip scan Server-side filter INCLUDE
  • 48. Phoenix Push Down: Skip scan Server-side filter INCLUDE INCLUDE INCLUDE
  • 49. Phoenix Push Down: Aggregation SELECT host, avg(response_time) FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ GROUP BY host
  • 50. SERVER METRICS HOST DATE KV1 KV2 KV3 SF1 Jun 2 10:10:10.234 239 234 674 SF1 Jun 3 23:05:44.975 23 234 SF1 Jun 9 08:10:32.147 256 314 341 SF1 Jun 9 08:10:32.147 235 256 SF1 Jun 1 11:18:28.456 235 23 SF1 Jun 3 22:03:22.142 234 314 SF1 Jun 3 22:03:22.142 432 234 256 SF2 Jun 1 10:29:58.950 23 432 SF2 Jun 2 14:55:34.104 314 876 23 SF2 Jun 3 12:46:19.123 256 234 314 SF2 Jun 3 12:46:19.123 432 SF2 Jun 8 08:23:23.456 876 876 235 SF2 Jun 1 10:31:10.234 234 234 876 SF3 Jun 1 10:31:10.234 432 432 234 SF3 Jun 3 10:31:10.234 890 SF3 Jun 8 10:31:10.234 314 314 235 SF3 Jun 1 10:31:10.234 256 256 876 SF3 Jun 1 10:31:10.234 235 234 SF3 Jun 8 10:31:10.234 876 876 432 SF3 Jun 9 10:31:10.234 234 234 SF3 Jun 3 10:31:10.234 432 276 … … … … … Phoenix Push Down: Aggregation Aggregate on server-side SERVER METRICS HOST AGGREGATE VALUES SF1 3421 SF2 2145 SF3 9823
  • 51. Phoenix Push Down: TopN Completed SELECT host, date, gc_time FROM server_metrics WHERE date > CURRENT_DATE() – 7 AND host LIKE ‘SF%’ ORDER BY gc_time DESC LIMIT 5
  • 52. Phoenix Push Down: TopN Client-side parallel scans Completed R1 R2 R3 R4 scan1 scan3 scan2
  • 53. Phoenix Push Down: TopN Each region holds N rows Completed R1 R2 R3 R4 scan1
  • 54. Phoenix Push Down: TopN Each region holds N rows Completed R1 R2 R3 R4 scan2
  • 55. Phoenix Push Down: TopN Each region holds N rows Completed R1 R2 R3 R4 scan3
  • 56. SERVER METRICS HOST DATE GC_TIME SF3 Jun 2 10:10:10.234 22123 SF5 Jun 3 23:05:44.975 19876 SF2 Jun 9 08:10:32.147 11345 SF2 Jun 1 11:18:28.456 10234 SF1 Jun 3 22:03:22.142 10111 Phoenix Push Down: TopN Client-side final merge sort Scan1 Scan2 Scan3
  • 57. Phoenix Push Down: TopN Secondary Index Completed CREATE INDEX gc_time_index ON server_metrics (gc_time DESC, date DESC) INCLUDE (response_time) Row Key GC_TIME_INDEX GC_TIME INTEGER DATE DATE HOST VARCHAR RESPONSE_TIME INTEGER
  • 58. Phoenix Push Down: TopN Secondary Index Completed CREATE INDEX gc_time_index ON server_metrics (gc_time DESC, date DESC) INCLUDE (response_time) Key Value GC_TIME_INDEX GC_TIME INTEGER DATE DATE HOST VARCHAR RESPONSE_TIME INTEGER
  • 59. Phoenix Push Down: TopN Secondary Index Completed o Original query doesn’t change o Phoenix rewrites query to use index table o All referenced columns must exist in index table for it to be considered o Stats coming soon!
  • 60. Phoenix Push Down: Hash Join Completed SELECT m.*, i.location FROM server_metrics m JOIN host_info i ON m.host = i.host WHERE m.date > CURRENT_DATE() – 7 AND i.location = ‘SF’ ORDER BY m.gc_time DESC LIMIT 5
  • 61. Phoenix Push Down: Hash Join Separate LHS and RHS Completed SELECT m.*, i.location FROM server_metrics m JOIN host_info i ON m.host = i.host WHERE m.date > CURRENT_DATE() – 7 AND i.location = ‘SF’ ORDER BY m.gc_time DESC LIMIT 5
  • 62. Phoenix Push Down: Hash Join Separate LHS and RHS Completed SELECT m.*, i.location FROM server_metrics m JOIN host_info i ON m.host = i.host WHERE m.date > CURRENT_DATE() – 7 AND i.location = ‘SF’ ORDER BY m.gc_time DESC LIMIT 5
  • 63. Phoenix Push Down: Hash Join Separate LHS and RHS Completed LHS SELECT * FROM server_metrics WHERE date > CURRENT_DATE() – 7 ORDER BY gc_time DESC LIMIT 5 RHS SELECT location FROM host_info WHERE location = ‘SF’
  • 64. Phoenix Push Down: Hash Join Execute & broadcast RHS to each RS Completed RS1 RS2 RHS
  • 65. Phoenix Push Down: TopN Server-side map lookup during scan Completed R1 R2 R3 R4 RHS RHS LHS scan1 scan2 scan3 scan4
  • 66. How does Phoenix help HBase scale?
  • 67. How does Phoenix help HBase scale? 1. Phoenix allows multiple tables to share same physical HBase table o Updateable VIEW o Multi-tenant TABLE + tenant-specific VIEW o Support for secondary indexes on VIEWs
  • 68. How does Phoenix help HBase scale? 2. HBase wants small # of big tables instead of large # of small tables o Each region for each column family of each table consumes resources
  • 69. Phoenix Shared Tables: VIEW Completed CREATE TABLE event ( type CHAR(1), event_id BIGINT, created_date DATE, created_by VARCHAR, CONSTRAINT pk PRIMARY KEY (type, event_id)); CREATE VIEW web_event ( referrer VARCHAR) AS SELECT * FROM event WHERE type=‘w’; • Includes columns from TABLE • Cannot define PK • Updateable if only equality expressions separated by AND
  • 70. Phoenix Shared Tables: VIEW Same physical HBase table Completed type = ‘c’ type = ‘m’ type = ‘p’ type = ‘w’ EVENT CHAT_EVENT MOBILE_EVENT PHONE_EVENT WEB_EVENT
  • 71. Phoenix Shared Table: MULTI_TENANT Completed CREATE TABLE event ( tenant_id VARCHAR, type CHAR(1), event_id BIGINT, created_date DATE, created_by VARCHAR, CONSTRAINT pk PRIMARY KEY (tenant_id, type, event_id)) MULTI_TENANT=true; First PK column identifies tenant ID
  • 72. Phoenix Shared Table: MULTI_TENANT Completed CREATE VIEW web_event ( referrer VARCHAR) AS SELECT * FROM event WHERE type=‘w’; DriverManager.connect(“jdbc:phoenix:localhost;tenantId=me”); CREATE VIEW my_web_event AS SELECT * FROM web_event; Tenant-specific view Tenant-specific connection
  • 73. Phoenix Shared Tables: MULTI_TENANT Same physical HBase table tenant_id = ‘aaa’ … EVENT tenant_id = ‘aab’ tenant_id = ‘zzz’
  • 74. Phoenix Shared Tables: MULTI_TENANT Same physical HBase table type = ‘c’ type = ‘m’ type = ‘p’ type = ‘w’ EVENT CHAT_EVENT MOBILE_EVENT PHONE_EVENT WEB_EVENT PER tenant_id
  • 75. Phoenix Shared Tables: MULTI_TENANT o Tenant-specific connection may only see and operate on their data o MetaData APIs honor this o Phoenix automatically manages scan ranges o Primary key constraint of base table may not be changed o Indexes in separate shared table may be added to a VIEW o DDL operations restricted o No ALTER of base table o No DROP of columns referenced in WHERE clause
  • 76. Phoenix Roadmap Completed o Derived/nested tables (in 3.1/4.1) o Local Indexes (in 4.1) o Transactions o More Join strategies o Correlated sub-queries o Cost-based query optimizer o OLAP extensions