SlideShare une entreprise Scribd logo
1  sur  59
1 © Hortonworks Inc. 2011–2018. All rights reserved
Meet HBase 2.0 and Phoenix 5.0
Ankit Singhal and Rajeshbabu Chintaguntla
HBase & Phoenix Engineers in Hortonworks
2 © Hortonworks Inc. 2011–2018. All rights reserved
About Us
Ankit Singhal
- Phoenix PMC and HBase contributor
- HBase dev @Hortonworks
Rajeshbabu Chintaguntla
- Phoenix PMC and HBase Committer
- HBase dev @Hortonworks
3 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix- 5.0
4 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix-5.0
Picture taken from: https://www.ostraining.com/blog/joomla/content-versioning-a-sneak-peek-at-joomla-s-newest-proposed-feature/
5 © Hortonworks Inc. 2011–2018. All rights reserved
2018 (repo and releases)
(master) 3.0.0-SNAPSHOT
(branch-1) 1.5.0-SNAPSHOT
(branch-1.4)
1.4.3
1.3.21.3.1
(branch-1.3)
1.2.0 1.2.6
1.1.0 1.1.13
(branch-1.2)
(branch-1.1)
(branch-2) 2.1.0-SNAPSHOT
1.4.4
2.0.-beta-1-2 2.0.0 2.1.0
(expected)
3.0.0
1.4.5
6 © Hortonworks Inc. 2011–2018. All rights reserved
How to choose a release
• If you are on 0.98.x or earlier, time to upgrade!
• 1.0 is EOL’ed. Use 1.3.x at least
• Both 1.3.2 and 1.4.4 are pretty stable
• Moving between minor versions is easy for 1.x
• Starting from scratch, use 1.4.4 or 2.0.1(RC)/2.1.0(planned)
Image is taken from:http://meritcd.com/blogs/improve-your-decision-making-improve-your-leadership-2/
7 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix-5.0
8 © Hortonworks Inc. 2011–2018. All rights reserved
Semantic Versioning
Starting with the 1.0 release, HBase works toward Semantic Versioning
MAJOR.MINOR.PATCH[-identifiers]
PATCH: only BC bug fixes.
MINOR: BC new features
MAJOR: Incompatible changes
9 © Hortonworks Inc. 2011–2018. All rights reserved
To be, or not to be (Compatible)
10 © Hortonworks Inc. 2011–2018. All rights reserved
To be, or not to be (Compatible)
• Compatibility is NOT a simple yes or no
• Many dimensions
• source, binary, wire, command line, dependencies etc
• Read https://hbase.apache.org/book.html#upgrading
11 © Hortonworks Inc. 2011–2018. All rights reserved
Major Minor Patch
Client-Server Wire Compatibility ✗ ✓ ✓
Server-Server Compatibility ✗ ✓ ✓
File Format Compatibility ✗* ✓ ✓
Client API Compatibility ✗ ✓ ✓
Client Binary Compatibility ✗ ✗ ✓
Server Side Limited API Compatibility ✗ ✗*/✓* ✓
Dependency Compatibility ✗ ✓ ✓
Operation Compatibility ✗ ✗ ✓
Compatibilities
12 © Hortonworks Inc. 2011–2018. All rights reserved
HBase API surface
• Client API (For Application Developer)
• Explicitly marked with InterfaceAudience.Public
• Get/Put/Table/Connection, etc
• LimitedPrivate API (For Advanced user)
• Explicitly marked with InterfaceAudience.LimitedPrivate
• Coprocessors, replication APIs
• Private API (For HBase Internal use)
• Explicitly marked with InterfaceAudience.Private
• All other classes not marked
• Also InterfaceAudience.{Stable,Evolving,Unstable}
13 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix-5.0
14 © Hortonworks Inc. 2011–2018. All rights reserved
Changes in behavior – HBase-2.0
• JDK-8+ only
• Hadoop-2.7+ (Hadoop-3 support)
• Filter and Coprocessor changes
• Managed connections are gone.
• “Rolling upgradable not supported” from 1.x
• Some DDL / Admin operations may not work with the old client
• Updated dependencies
• Deprecated client interfaces are gone
https://docs.google.com/document/d/1WCsVlnHjJeKUcl7wHwqb4z9iEu_ktczrlKHK8N4SZzs/edit#heading=h.v21r9nz8g01j
15 © Hortonworks Inc. 2011–2018. All rights reserved
HBase 0.98 Name(s) HBase 1.x + 2.x name(s)
HConnectionManager,
ConnectionManager
ConnectionFactory
HConnection, ClusterConnection Connection
HBaseAdmin Admin
HTable(HTableInterface) Table
RegionLocator
HTableDescriptor, HColumnDescriptor,
HRegionInfo
TableDescriptor,
ColumnFamilyDescriptor,
RegionInfo
API Changes
16 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix-5.0
17 © Hortonworks Inc. 2011–2018. All rights reserved
Offheaping Read Path
• Goals
• Make use of all available RAM
• Reduce temporary garbage for reading
• Make bucket cache as fast as LRU cache
• HBase block cache can be configured as L1 / L2
• “Bucket cache” can be backed by onheap, off-heap or SSD
• Different sized buckets in 4MB slices
• HFile blocks placed in different buckets
• Earlier Cell comparison only used to deal with byte[] and copy the whole block to heap
18 © Hortonworks Inc. 2011–2018. All rights reserved
https://www.slideshare.net/HBaseCon/offheaping-the-apache-hbase-read-path
19 © Hortonworks Inc. 2011–2018. All rights reserved
https://www.slideshare.net/HBaseCon/offheaping-the-apache-hbase-read-path
20 © Hortonworks Inc. 2011–2018. All rights reserved
Offheaping Read Path
https://blogs.apache.org/hbase/entry/offheaping_the_read_path_in1
21 © Hortonworks Inc. 2011–2018. All rights reserved
Offheaping Read Path
Offheap Read-Path in Production - The Alibaba story (https://blogs.apache.org/hbase/)
22 © Hortonworks Inc. 2011–2018. All rights reserved
Offheaping Write Path
• Reduce GC, improve throughput predictability
• Use off-heap buffer pools to read RPC requests
• Remove garbage created in Codec parsing, MSLAB copying, etc
• MSLAB is used for heap fragmentation
• Now MSLAB can allocate offheap pooled buffers [2MB]
23 © Hortonworks Inc. 2011–2018. All rights reserved
Offheaping Write Path
• Allows bigger total memstore space
• Cell data goes off-heap
• Cell metadata (and CSLM#Entry, etc) is on-heap (~100 byte overhead)
• Async WAL is used to avoid copying Cells into heap
• Works with new “Compacting Memstore”
24 © Hortonworks Inc. 2011–2018. All rights reserved
Compacting Memstore
• In memory flushes
• Reduce index overhead per cell
• Gains proportional to cell size
• In memory compaction
• Eliminate duplicates
• Gains are proportional to duplicates
• Reduce flushes to disk
• Reduce file creation
• Reduce write amplification
25 © Hortonworks Inc. 2011–2018. All rights reserved
Compacting Memstore
https://www.slideshare.net/HBaseCon/apache-hbase-accelerated-inmemory-flush-and-compaction
26 © Hortonworks Inc. 2011–2018. All rights reserved
26
27 © Hortonworks Inc. 2011–2018. All rights reserved
https://www.slideshare.net/HBaseCon/apache-hbase-accelerated-inmemory-flush-and-compaction
28 © Hortonworks Inc. 2011–2018. All rights reserved
Async Client
• A complete new client maintained in HBase
• Different than OpenTSDB/asynchbase
• Fully async end to end
• Based on Netty and JDK-8 CompletableFutures
• Includes most critical functionality including Admin / DDL
• Some advanced functionality (read replicas, etc) are not done yet
• Have both sync and async client.
29 © Hortonworks Inc. 2011–2018. All rights reserved
Async Client
30 © Hortonworks Inc. 2011–2018. All rights reserved
Region Assignment
• Region assignment is one of the pain points
• Complete overhaul of region assignment
• Based on the internal “Procedure V2” framework
• Series of operations are broken down into states in State Machine
• State’s are serialized to durable storage
• Uses hbase:meta to store the state of region assignments
• States of DDLs are stored in WAL on HDFS
• Backup master can recover the state from where left off
31 © Hortonworks Inc. 2011–2018. All rights reserved
Backup / Restore
• Need a reliable native Backup mechanism
32 © Hortonworks Inc. 2011–2018. All rights reserved
Backup / Restore
• Full backups and Incremental backups
• Backup destination to a remote FS
• HDFS, S3, ADLS, WASB, and any Hadoop-compatible FS
• Full backup based on “snapshots”
• Incremental backup ships Write-Ahead-Logs
• Backup + Replication are complimentary (for a full DR solution)
33 © Hortonworks Inc. 2011–2018. All rights reserved
Backup / Restore
• Flexible restore options
34 © Hortonworks Inc. 2011–2018. All rights reserved
FileSystem Quotas
• HBase supports quotas
• Request count or size per sec,min,hour,day (per server)
• Num tables / num regions in namespace
• New work enables quotas for disk space usage
• Master periodically gathers info about disk space usage
• Regionservers enforce limit when notified from master
• Limits per table or per namespace (not per user)
• Violation policy: {DISABLE, NO_WRITES_COMPACTIONS, NO_WRITES, NO_INSERTS}
35 © Hortonworks Inc. 2011–2018. All rights reserved
FileSystem Quotas
• Much more complex when snapshots are in the picture (similar to quotas with hard
links in FS)
• Snapshot usage is counted against the originating table
• A File is only counted once (even original table, or snapshot or restored table refers to
it)
• More info: HBASE-16961 => ‘prod_website’, LIMIT => ‘7125GB’
hbase> set_quota TYPE => SIZE, VIOLATION_POLICY
=>DELETES_WITH_COMPACTIONS, NAMESPACE =>
‘prod_website’, LIMIT => ‘7125GB’
36 © Hortonworks Inc. 2011–2018. All rights reserved
Will be released for the first time
• These will be released for the first time in Apache (although other backports exist)
• Medium Object Support
• Region Server groups
• Favored Nodes enhancements
• HBase-Spark module
• WAL’s and data files in separate FileSystems
37 © Hortonworks Inc. 2011–2018. All rights reserved
Outline
Recent releases
Versioning / Compatibility
Changes in behavior
Major Features in HBase-2.0
Phoenix- 5.0
38 © Hortonworks Inc. 2011–2018. All rights reserved
Phoenix 5.0 - Outline
• Phoenix releases
• Index Improvements.
• Join Improvements.
• Major Features
39 © Hortonworks Inc. 2011–2018. All rights reserved
2018 (repo and releases)
(master) 4.15.0-SNAPSHOT
(5.x-HBase-2.0) 5.0.0-SNAPSHOT
5.0.0
4.15.04.14.0
5.0.0-alpha-1
4.13.0
5.x will become master after 5.0 release
40 © Hortonworks Inc. 2011–2018. All rights reserved
Join Improvements
41 © Hortonworks Inc. 2011–2018. All rights reserved
Cost based Optimizations for Joins
• Phoenix support two Join algorithms
• Hash Join
• Sort Merge Join.
• Prior to 4.14.0 user need to explicitly Hint compiler to use Sort Merge Join when
required.
• From 4.14.0 onwards best join algorithm selection happen automatically based on
query join predicate and ordering
42 © Hortonworks Inc. 2011–2018. All rights reserved
• Sort merge join wins over Hash Join when both the children are ordered.
• Hash-join wins over sort-merge-join w/ smaller side ordered.
• Hash-join wins over sort-merge-join w/o any side ordered.
• Hash-join wins over sort-merge-join w/ both sides ordered in an order-by query – client
need to sort merge the results.
• Multi-table join: a mix of join strategies chosen based on cost.
43 © Hortonworks Inc. 2011–2018. All rights reserved
Index Improvements
44 © Hortonworks Inc. 2011–2018. All rights reserved
Index Write Failure Handling
• Phoenix supports many strategies for handling index failures.
• Blocking writes to data table until index restored to consistent state.
• Disable index with timestamp at which failure occurred and rebuild index in the background until
consistent state reached.
• Disable index and manual index rebuilding from MR tool to avoid overhead at server.
• Kill the RS so that the index updates replayed or recovered to consistent state.
45 © Hortonworks Inc. 2011–2018. All rights reserved
Index Write Failure Handling (Cont’d..)
Client Retries On Index Failure
• Prior to Phoenix 5.0.0 the retry happen at server on index write failures and disable
index even if the write not succeeded after multiple retries.
• This adds unnecessary overhead to server and block the handlers.
• Now the client handles the retries without any blocking.
46 © Hortonworks Inc. 2011–2018. All rights reserved
Incremental Index Rebuilding
• When data accumulated to rebuild is more(hours or days of data) after index failures
then it’s difficult to rebuild the indexes in single pass.
• So in latest version the index rebuilding happen in batches with check pointing by
timestamp so that any failures happen in the middle rebuilding starts from last
successful checkpoint.
• Automatic rebuilding stops when user initiates manual index rebuilding.
47 © Hortonworks Inc. 2011–2018. All rights reserved
MR based Index Rebuilding
• Alter index with REBUILD ASYNC option.
• Use IndexTool to rebuild the index.
48 © Hortonworks Inc. 2011–2018. All rights reserved
Global Index performance Improvement
• Presplit index table before asynchronous index building.
• Until unless we know about the indexed column value data set it’s difficult to pre define the
boundaries of index table.
• Automatic process has implemented in 4.14.0 version where tool identifies the approximate split
keys by collecting the index key values from data table sample(or full table) where the sampling
rate can be changed from 0 to 100%.
• The tool builds Equi-depth histogram to identify the approximate index region boundaries.
49 © Hortonworks Inc. 2011–2018. All rights reserved
Local Index performance Improvements
• As of now when we scan through local index we need to touch base all the regions
irrespective of filters.
• In the current version we can filter the regions to scan for local indexes based on data
table leading primary key filter conditions.
50 © Hortonworks Inc. 2011–2018. All rights reserved
Kafka Integration
51 © Hortonworks Inc. 2011–2018. All rights reserved
Apache Kafka Integration
• Apache Kafka plugin helps to reliably and efficiently stream
large amounts of data onto HBase using Phoenix API. As part
of this plugin we are providing PhoenixConsumer to receive
messages from KafkaProducer.
52 © Hortonworks Inc. 2011–2018. All rights reserved
Major Features
53 © Hortonworks Inc. 2011–2018. All rights reserved
Support for GRANT and REVOKE commands
• Internally uses HBase ACLs only, possible permissions are R - Read, W - Write, X -
Execute, C - Create and A - Admin.
• Access needs to be given on data table which automatically get propagated to all its
indexes and views
• No need to have WRITE access on SYSTEM.CATALOG table now for DDLs
• New Indexes will be given same permission as data table.
54 © Hortonworks Inc. 2011–2018. All rights reserved
Query log
• Run time information of the query is logged in the Phoenix table(SYSTEM.LOG)
• There are around 40+ columns of information to help in analzing the query at any point
of time.
• Logging is done Asynchronously and can be controlled through log
level(TRACE,DEBUG,INFO,OFF)
55 © Hortonworks Inc. 2011–2018. All rights reserved
Python driver for Phoenix Query Server
• It implements the Python DB 2.0 API to access Phoenix via the Phoenix Query Server
• Example
56 © Hortonworks Inc. 2011–2018. All rights reserved
Persistent Subquery Results Cache - WIP
• Persisting expensive subqueries data beyond life of query helps in significant
performance gain when the same subquery repeated in multiple queries.
• Make use of coprocessor based server cache to avoid caching at client.
• WIP at https://issues.apache.org/jira/browse/PHOENIX-4666
57 © Hortonworks Inc. 2011–2018. All rights reserved
Upgrades/New
• Apache Omid Integration for Transactions Support( along with Tephra)
• Hive 3.0.0
• Hadoop 3.0.0
• Spark 2.3.0
• Tephra 0.14.0
• Avatica 0.11.0
58 © Hortonworks Inc. 2011–2018. All rights reserved
Questions?
59 © Hortonworks Inc. 2011–2018. All rights reserved
Thank you

Contenu connexe

Tendances

Upgrading to Cognos Analytics Release 4: What You Need to Know
Upgrading to Cognos Analytics Release 4: What You Need to KnowUpgrading to Cognos Analytics Release 4: What You Need to Know
Upgrading to Cognos Analytics Release 4: What You Need to KnowSenturus
 
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday Copenhagen
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday CopenhagenWhat's new in SharePoint 2010 for Backup and Recovery - SP Saturday Copenhagen
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday CopenhagenIlia Sotnikov
 
SharePoint 2010 Upgrade Best Practices Croatia
SharePoint 2010 Upgrade Best Practices CroatiaSharePoint 2010 Upgrade Best Practices Croatia
SharePoint 2010 Upgrade Best Practices CroatiaJoel Oleson
 
Large Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsLarge Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsJoel Oleson
 
COUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_FeaturesCOUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_FeaturesAlfredo Abate
 
Cognos 11 installation step by step and notes
Cognos 11 installation step by step and notesCognos 11 installation step by step and notes
Cognos 11 installation step by step and notesCarlos Castro Rodríguez
 
SharePoint Microsoft IT - Swiss Presentation with Joel and Andre
SharePoint Microsoft IT - Swiss Presentation with Joel and AndreSharePoint Microsoft IT - Swiss Presentation with Joel and Andre
SharePoint Microsoft IT - Swiss Presentation with Joel and AndreJoel Oleson
 
Upgrading to SharePoint 2010 - Teched Middle East
Upgrading to SharePoint 2010 - Teched Middle EastUpgrading to SharePoint 2010 - Teched Middle East
Upgrading to SharePoint 2010 - Teched Middle EastJoel Oleson
 
SharePoint 2010 Upgrade Drill Down
SharePoint 2010 Upgrade Drill DownSharePoint 2010 Upgrade Drill Down
SharePoint 2010 Upgrade Drill DownJoel Oleson
 
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and Administer
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and AdministerOracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and Administer
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and AdministerAndrejs Karpovs
 
Anatomy of Autoconfig in Oracle E-Business Suite
Anatomy of Autoconfig in Oracle E-Business SuiteAnatomy of Autoconfig in Oracle E-Business Suite
Anatomy of Autoconfig in Oracle E-Business Suitevasuballa
 
SAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS08 Scale-Out, High Availability and Disaster RecoverySAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery SAP Technology
 
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2pasalapudi
 
SharePoint and Large Scale SQL Deployments - NZSPC
SharePoint and Large Scale SQL Deployments - NZSPCSharePoint and Large Scale SQL Deployments - NZSPC
SharePoint and Large Scale SQL Deployments - NZSPCguest7c2e070
 
MOSS 2007 Deployment Fundamentals -Part1
MOSS 2007 Deployment Fundamentals -Part1MOSS 2007 Deployment Fundamentals -Part1
MOSS 2007 Deployment Fundamentals -Part1Information Technology
 
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonSharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonJoel Oleson
 
Oracle EM12c Release 4 New Features!
Oracle EM12c Release 4 New Features!Oracle EM12c Release 4 New Features!
Oracle EM12c Release 4 New Features!Kellyn Pot'Vin-Gorman
 

Tendances (20)

websphere - Understanding web sphere performance
websphere - Understanding web sphere performancewebsphere - Understanding web sphere performance
websphere - Understanding web sphere performance
 
Upgrading to Cognos Analytics Release 4: What You Need to Know
Upgrading to Cognos Analytics Release 4: What You Need to KnowUpgrading to Cognos Analytics Release 4: What You Need to Know
Upgrading to Cognos Analytics Release 4: What You Need to Know
 
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday Copenhagen
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday CopenhagenWhat's new in SharePoint 2010 for Backup and Recovery - SP Saturday Copenhagen
What's new in SharePoint 2010 for Backup and Recovery - SP Saturday Copenhagen
 
SharePoint 2010 Upgrade Best Practices Croatia
SharePoint 2010 Upgrade Best Practices CroatiaSharePoint 2010 Upgrade Best Practices Croatia
SharePoint 2010 Upgrade Best Practices Croatia
 
Large Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint DeploymentsLarge Scale SQL Considerations for SharePoint Deployments
Large Scale SQL Considerations for SharePoint Deployments
 
COUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_FeaturesCOUG_AAbate_Oracle_Database_12c_New_Features
COUG_AAbate_Oracle_Database_12c_New_Features
 
Cognos 11 installation step by step and notes
Cognos 11 installation step by step and notesCognos 11 installation step by step and notes
Cognos 11 installation step by step and notes
 
SharePoint Microsoft IT - Swiss Presentation with Joel and Andre
SharePoint Microsoft IT - Swiss Presentation with Joel and AndreSharePoint Microsoft IT - Swiss Presentation with Joel and Andre
SharePoint Microsoft IT - Swiss Presentation with Joel and Andre
 
Upgrading to SharePoint 2010 - Teched Middle East
Upgrading to SharePoint 2010 - Teched Middle EastUpgrading to SharePoint 2010 - Teched Middle East
Upgrading to SharePoint 2010 - Teched Middle East
 
HBase Snapshots
HBase SnapshotsHBase Snapshots
HBase Snapshots
 
SharePoint 2010 Upgrade Drill Down
SharePoint 2010 Upgrade Drill DownSharePoint 2010 Upgrade Drill Down
SharePoint 2010 Upgrade Drill Down
 
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and Administer
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and AdministerOracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and Administer
Oracle E-Business Suite R12.2.5 on Database 12c: Install, Patch and Administer
 
Anatomy of Autoconfig in Oracle E-Business Suite
Anatomy of Autoconfig in Oracle E-Business SuiteAnatomy of Autoconfig in Oracle E-Business Suite
Anatomy of Autoconfig in Oracle E-Business Suite
 
SAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS08 Scale-Out, High Availability and Disaster RecoverySAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery
SAP HANA SPS08 Scale-Out, High Availability and Disaster Recovery
 
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2
Aioug2017 deploying-ebs-on-prem-and-on-oracle-cloud v2
 
SharePoint and Large Scale SQL Deployments - NZSPC
SharePoint and Large Scale SQL Deployments - NZSPCSharePoint and Large Scale SQL Deployments - NZSPC
SharePoint and Large Scale SQL Deployments - NZSPC
 
MOSS 2007 Deployment Fundamentals -Part1
MOSS 2007 Deployment Fundamentals -Part1MOSS 2007 Deployment Fundamentals -Part1
MOSS 2007 Deployment Fundamentals -Part1
 
Column encoding
Column encodingColumn encoding
Column encoding
 
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike WatsonSharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
SharePoint Advanced Administration with Joel Oleson, Shane Young and Mike Watson
 
Oracle EM12c Release 4 New Features!
Oracle EM12c Release 4 New Features!Oracle EM12c Release 4 New Features!
Oracle EM12c Release 4 New Features!
 

Similaire à Meet HBase 2.0 and Phoenix 5.0

Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0DataWorks Summit
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0DataWorks Summit
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive DataWorks Summit
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0enissoz
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0enissoz
 
Apache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAseApache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAseenissoz
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseApache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseDataWorks Summit/Hadoop Summit
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFTDataWorks Summit
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseApache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseDataWorks Summit/Hadoop Summit
 
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?DataWorks Summit
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?DataWorks Summit
 
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019alanfgates
 
Apache HBase Internals you hoped you Never Needed to Understand
Apache HBase Internals you hoped you Never Needed to UnderstandApache HBase Internals you hoped you Never Needed to Understand
Apache HBase Internals you hoped you Never Needed to UnderstandJosh Elser
 
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3DataWorks Summit
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...huguk
 
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage  object store integration in production (final)Hadoop & cloud storage  object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)Chris Nauroth
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionDataWorks Summit/Hadoop Summit
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingDataWorks Summit/Hadoop Summit
 
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseHBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseCloudera, Inc.
 

Similaire à Meet HBase 2.0 and Phoenix 5.0 (20)

Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0Meet HBase 2.0 and Phoenix-5.0
Meet HBase 2.0 and Phoenix-5.0
 
Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0Meet HBase 2.0 and Phoenix 5.0
Meet HBase 2.0 and Phoenix 5.0
 
What's new in apache hive
What's new in apache hive What's new in apache hive
What's new in apache hive
 
Meet Apache HBase - 2.0
Meet Apache HBase - 2.0Meet Apache HBase - 2.0
Meet Apache HBase - 2.0
 
Meet hbase 2.0
Meet hbase 2.0Meet hbase 2.0
Meet hbase 2.0
 
Meet HBase 2.0
Meet HBase 2.0Meet HBase 2.0
Meet HBase 2.0
 
Apache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAseApache phoenix: Past, Present and Future of SQL over HBAse
Apache phoenix: Past, Present and Future of SQL over HBAse
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseApache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
 
High throughput data replication over RAFT
High throughput data replication over RAFTHigh throughput data replication over RAFT
High throughput data replication over RAFT
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBaseApache Phoenix and HBase: Past, Present and Future of SQL over HBase
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
 
What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?What is new in Apache Hive 3.0?
What is new in Apache Hive 3.0?
 
What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?What is New in Apache Hive 3.0?
What is New in Apache Hive 3.0?
 
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019Hive 3 New Horizons DataWorks Summit Melbourne February 2019
Hive 3 New Horizons DataWorks Summit Melbourne February 2019
 
Apache HBase Internals you hoped you Never Needed to Understand
Apache HBase Internals you hoped you Never Needed to UnderstandApache HBase Internals you hoped you Never Needed to Understand
Apache HBase Internals you hoped you Never Needed to Understand
 
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
Migrating your clusters and workloads from Hadoop 2 to Hadoop 3
 
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
Apache Argus - How do I secure my entire Hadoop cluster? Olivier Renault @ Ho...
 
Hadoop & cloud storage object store integration in production (final)
Hadoop & cloud storage  object store integration in production (final)Hadoop & cloud storage  object store integration in production (final)
Hadoop & cloud storage object store integration in production (final)
 
Hadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in ProductionHadoop & Cloud Storage: Object Store Integration in Production
Hadoop & Cloud Storage: Object Store Integration in Production
 
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and TroubleshootingApache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
Apache Ambari - HDP Cluster Upgrades Operational Deep Dive and Troubleshooting
 
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBaseHBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
HBaseCon 2013: Apache HBase and HDFS - Understanding Filesystem Usage in HBase
 

Dernier

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Dernier (20)

Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Meet HBase 2.0 and Phoenix 5.0

  • 1. 1 © Hortonworks Inc. 2011–2018. All rights reserved Meet HBase 2.0 and Phoenix 5.0 Ankit Singhal and Rajeshbabu Chintaguntla HBase & Phoenix Engineers in Hortonworks
  • 2. 2 © Hortonworks Inc. 2011–2018. All rights reserved About Us Ankit Singhal - Phoenix PMC and HBase contributor - HBase dev @Hortonworks Rajeshbabu Chintaguntla - Phoenix PMC and HBase Committer - HBase dev @Hortonworks
  • 3. 3 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix- 5.0
  • 4. 4 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix-5.0 Picture taken from: https://www.ostraining.com/blog/joomla/content-versioning-a-sneak-peek-at-joomla-s-newest-proposed-feature/
  • 5. 5 © Hortonworks Inc. 2011–2018. All rights reserved 2018 (repo and releases) (master) 3.0.0-SNAPSHOT (branch-1) 1.5.0-SNAPSHOT (branch-1.4) 1.4.3 1.3.21.3.1 (branch-1.3) 1.2.0 1.2.6 1.1.0 1.1.13 (branch-1.2) (branch-1.1) (branch-2) 2.1.0-SNAPSHOT 1.4.4 2.0.-beta-1-2 2.0.0 2.1.0 (expected) 3.0.0 1.4.5
  • 6. 6 © Hortonworks Inc. 2011–2018. All rights reserved How to choose a release • If you are on 0.98.x or earlier, time to upgrade! • 1.0 is EOL’ed. Use 1.3.x at least • Both 1.3.2 and 1.4.4 are pretty stable • Moving between minor versions is easy for 1.x • Starting from scratch, use 1.4.4 or 2.0.1(RC)/2.1.0(planned) Image is taken from:http://meritcd.com/blogs/improve-your-decision-making-improve-your-leadership-2/
  • 7. 7 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix-5.0
  • 8. 8 © Hortonworks Inc. 2011–2018. All rights reserved Semantic Versioning Starting with the 1.0 release, HBase works toward Semantic Versioning MAJOR.MINOR.PATCH[-identifiers] PATCH: only BC bug fixes. MINOR: BC new features MAJOR: Incompatible changes
  • 9. 9 © Hortonworks Inc. 2011–2018. All rights reserved To be, or not to be (Compatible)
  • 10. 10 © Hortonworks Inc. 2011–2018. All rights reserved To be, or not to be (Compatible) • Compatibility is NOT a simple yes or no • Many dimensions • source, binary, wire, command line, dependencies etc • Read https://hbase.apache.org/book.html#upgrading
  • 11. 11 © Hortonworks Inc. 2011–2018. All rights reserved Major Minor Patch Client-Server Wire Compatibility ✗ ✓ ✓ Server-Server Compatibility ✗ ✓ ✓ File Format Compatibility ✗* ✓ ✓ Client API Compatibility ✗ ✓ ✓ Client Binary Compatibility ✗ ✗ ✓ Server Side Limited API Compatibility ✗ ✗*/✓* ✓ Dependency Compatibility ✗ ✓ ✓ Operation Compatibility ✗ ✗ ✓ Compatibilities
  • 12. 12 © Hortonworks Inc. 2011–2018. All rights reserved HBase API surface • Client API (For Application Developer) • Explicitly marked with InterfaceAudience.Public • Get/Put/Table/Connection, etc • LimitedPrivate API (For Advanced user) • Explicitly marked with InterfaceAudience.LimitedPrivate • Coprocessors, replication APIs • Private API (For HBase Internal use) • Explicitly marked with InterfaceAudience.Private • All other classes not marked • Also InterfaceAudience.{Stable,Evolving,Unstable}
  • 13. 13 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix-5.0
  • 14. 14 © Hortonworks Inc. 2011–2018. All rights reserved Changes in behavior – HBase-2.0 • JDK-8+ only • Hadoop-2.7+ (Hadoop-3 support) • Filter and Coprocessor changes • Managed connections are gone. • “Rolling upgradable not supported” from 1.x • Some DDL / Admin operations may not work with the old client • Updated dependencies • Deprecated client interfaces are gone https://docs.google.com/document/d/1WCsVlnHjJeKUcl7wHwqb4z9iEu_ktczrlKHK8N4SZzs/edit#heading=h.v21r9nz8g01j
  • 15. 15 © Hortonworks Inc. 2011–2018. All rights reserved HBase 0.98 Name(s) HBase 1.x + 2.x name(s) HConnectionManager, ConnectionManager ConnectionFactory HConnection, ClusterConnection Connection HBaseAdmin Admin HTable(HTableInterface) Table RegionLocator HTableDescriptor, HColumnDescriptor, HRegionInfo TableDescriptor, ColumnFamilyDescriptor, RegionInfo API Changes
  • 16. 16 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix-5.0
  • 17. 17 © Hortonworks Inc. 2011–2018. All rights reserved Offheaping Read Path • Goals • Make use of all available RAM • Reduce temporary garbage for reading • Make bucket cache as fast as LRU cache • HBase block cache can be configured as L1 / L2 • “Bucket cache” can be backed by onheap, off-heap or SSD • Different sized buckets in 4MB slices • HFile blocks placed in different buckets • Earlier Cell comparison only used to deal with byte[] and copy the whole block to heap
  • 18. 18 © Hortonworks Inc. 2011–2018. All rights reserved https://www.slideshare.net/HBaseCon/offheaping-the-apache-hbase-read-path
  • 19. 19 © Hortonworks Inc. 2011–2018. All rights reserved https://www.slideshare.net/HBaseCon/offheaping-the-apache-hbase-read-path
  • 20. 20 © Hortonworks Inc. 2011–2018. All rights reserved Offheaping Read Path https://blogs.apache.org/hbase/entry/offheaping_the_read_path_in1
  • 21. 21 © Hortonworks Inc. 2011–2018. All rights reserved Offheaping Read Path Offheap Read-Path in Production - The Alibaba story (https://blogs.apache.org/hbase/)
  • 22. 22 © Hortonworks Inc. 2011–2018. All rights reserved Offheaping Write Path • Reduce GC, improve throughput predictability • Use off-heap buffer pools to read RPC requests • Remove garbage created in Codec parsing, MSLAB copying, etc • MSLAB is used for heap fragmentation • Now MSLAB can allocate offheap pooled buffers [2MB]
  • 23. 23 © Hortonworks Inc. 2011–2018. All rights reserved Offheaping Write Path • Allows bigger total memstore space • Cell data goes off-heap • Cell metadata (and CSLM#Entry, etc) is on-heap (~100 byte overhead) • Async WAL is used to avoid copying Cells into heap • Works with new “Compacting Memstore”
  • 24. 24 © Hortonworks Inc. 2011–2018. All rights reserved Compacting Memstore • In memory flushes • Reduce index overhead per cell • Gains proportional to cell size • In memory compaction • Eliminate duplicates • Gains are proportional to duplicates • Reduce flushes to disk • Reduce file creation • Reduce write amplification
  • 25. 25 © Hortonworks Inc. 2011–2018. All rights reserved Compacting Memstore https://www.slideshare.net/HBaseCon/apache-hbase-accelerated-inmemory-flush-and-compaction
  • 26. 26 © Hortonworks Inc. 2011–2018. All rights reserved 26
  • 27. 27 © Hortonworks Inc. 2011–2018. All rights reserved https://www.slideshare.net/HBaseCon/apache-hbase-accelerated-inmemory-flush-and-compaction
  • 28. 28 © Hortonworks Inc. 2011–2018. All rights reserved Async Client • A complete new client maintained in HBase • Different than OpenTSDB/asynchbase • Fully async end to end • Based on Netty and JDK-8 CompletableFutures • Includes most critical functionality including Admin / DDL • Some advanced functionality (read replicas, etc) are not done yet • Have both sync and async client.
  • 29. 29 © Hortonworks Inc. 2011–2018. All rights reserved Async Client
  • 30. 30 © Hortonworks Inc. 2011–2018. All rights reserved Region Assignment • Region assignment is one of the pain points • Complete overhaul of region assignment • Based on the internal “Procedure V2” framework • Series of operations are broken down into states in State Machine • State’s are serialized to durable storage • Uses hbase:meta to store the state of region assignments • States of DDLs are stored in WAL on HDFS • Backup master can recover the state from where left off
  • 31. 31 © Hortonworks Inc. 2011–2018. All rights reserved Backup / Restore • Need a reliable native Backup mechanism
  • 32. 32 © Hortonworks Inc. 2011–2018. All rights reserved Backup / Restore • Full backups and Incremental backups • Backup destination to a remote FS • HDFS, S3, ADLS, WASB, and any Hadoop-compatible FS • Full backup based on “snapshots” • Incremental backup ships Write-Ahead-Logs • Backup + Replication are complimentary (for a full DR solution)
  • 33. 33 © Hortonworks Inc. 2011–2018. All rights reserved Backup / Restore • Flexible restore options
  • 34. 34 © Hortonworks Inc. 2011–2018. All rights reserved FileSystem Quotas • HBase supports quotas • Request count or size per sec,min,hour,day (per server) • Num tables / num regions in namespace • New work enables quotas for disk space usage • Master periodically gathers info about disk space usage • Regionservers enforce limit when notified from master • Limits per table or per namespace (not per user) • Violation policy: {DISABLE, NO_WRITES_COMPACTIONS, NO_WRITES, NO_INSERTS}
  • 35. 35 © Hortonworks Inc. 2011–2018. All rights reserved FileSystem Quotas • Much more complex when snapshots are in the picture (similar to quotas with hard links in FS) • Snapshot usage is counted against the originating table • A File is only counted once (even original table, or snapshot or restored table refers to it) • More info: HBASE-16961 => ‘prod_website’, LIMIT => ‘7125GB’ hbase> set_quota TYPE => SIZE, VIOLATION_POLICY =>DELETES_WITH_COMPACTIONS, NAMESPACE => ‘prod_website’, LIMIT => ‘7125GB’
  • 36. 36 © Hortonworks Inc. 2011–2018. All rights reserved Will be released for the first time • These will be released for the first time in Apache (although other backports exist) • Medium Object Support • Region Server groups • Favored Nodes enhancements • HBase-Spark module • WAL’s and data files in separate FileSystems
  • 37. 37 © Hortonworks Inc. 2011–2018. All rights reserved Outline Recent releases Versioning / Compatibility Changes in behavior Major Features in HBase-2.0 Phoenix- 5.0
  • 38. 38 © Hortonworks Inc. 2011–2018. All rights reserved Phoenix 5.0 - Outline • Phoenix releases • Index Improvements. • Join Improvements. • Major Features
  • 39. 39 © Hortonworks Inc. 2011–2018. All rights reserved 2018 (repo and releases) (master) 4.15.0-SNAPSHOT (5.x-HBase-2.0) 5.0.0-SNAPSHOT 5.0.0 4.15.04.14.0 5.0.0-alpha-1 4.13.0 5.x will become master after 5.0 release
  • 40. 40 © Hortonworks Inc. 2011–2018. All rights reserved Join Improvements
  • 41. 41 © Hortonworks Inc. 2011–2018. All rights reserved Cost based Optimizations for Joins • Phoenix support two Join algorithms • Hash Join • Sort Merge Join. • Prior to 4.14.0 user need to explicitly Hint compiler to use Sort Merge Join when required. • From 4.14.0 onwards best join algorithm selection happen automatically based on query join predicate and ordering
  • 42. 42 © Hortonworks Inc. 2011–2018. All rights reserved • Sort merge join wins over Hash Join when both the children are ordered. • Hash-join wins over sort-merge-join w/ smaller side ordered. • Hash-join wins over sort-merge-join w/o any side ordered. • Hash-join wins over sort-merge-join w/ both sides ordered in an order-by query – client need to sort merge the results. • Multi-table join: a mix of join strategies chosen based on cost.
  • 43. 43 © Hortonworks Inc. 2011–2018. All rights reserved Index Improvements
  • 44. 44 © Hortonworks Inc. 2011–2018. All rights reserved Index Write Failure Handling • Phoenix supports many strategies for handling index failures. • Blocking writes to data table until index restored to consistent state. • Disable index with timestamp at which failure occurred and rebuild index in the background until consistent state reached. • Disable index and manual index rebuilding from MR tool to avoid overhead at server. • Kill the RS so that the index updates replayed or recovered to consistent state.
  • 45. 45 © Hortonworks Inc. 2011–2018. All rights reserved Index Write Failure Handling (Cont’d..) Client Retries On Index Failure • Prior to Phoenix 5.0.0 the retry happen at server on index write failures and disable index even if the write not succeeded after multiple retries. • This adds unnecessary overhead to server and block the handlers. • Now the client handles the retries without any blocking.
  • 46. 46 © Hortonworks Inc. 2011–2018. All rights reserved Incremental Index Rebuilding • When data accumulated to rebuild is more(hours or days of data) after index failures then it’s difficult to rebuild the indexes in single pass. • So in latest version the index rebuilding happen in batches with check pointing by timestamp so that any failures happen in the middle rebuilding starts from last successful checkpoint. • Automatic rebuilding stops when user initiates manual index rebuilding.
  • 47. 47 © Hortonworks Inc. 2011–2018. All rights reserved MR based Index Rebuilding • Alter index with REBUILD ASYNC option. • Use IndexTool to rebuild the index.
  • 48. 48 © Hortonworks Inc. 2011–2018. All rights reserved Global Index performance Improvement • Presplit index table before asynchronous index building. • Until unless we know about the indexed column value data set it’s difficult to pre define the boundaries of index table. • Automatic process has implemented in 4.14.0 version where tool identifies the approximate split keys by collecting the index key values from data table sample(or full table) where the sampling rate can be changed from 0 to 100%. • The tool builds Equi-depth histogram to identify the approximate index region boundaries.
  • 49. 49 © Hortonworks Inc. 2011–2018. All rights reserved Local Index performance Improvements • As of now when we scan through local index we need to touch base all the regions irrespective of filters. • In the current version we can filter the regions to scan for local indexes based on data table leading primary key filter conditions.
  • 50. 50 © Hortonworks Inc. 2011–2018. All rights reserved Kafka Integration
  • 51. 51 © Hortonworks Inc. 2011–2018. All rights reserved Apache Kafka Integration • Apache Kafka plugin helps to reliably and efficiently stream large amounts of data onto HBase using Phoenix API. As part of this plugin we are providing PhoenixConsumer to receive messages from KafkaProducer.
  • 52. 52 © Hortonworks Inc. 2011–2018. All rights reserved Major Features
  • 53. 53 © Hortonworks Inc. 2011–2018. All rights reserved Support for GRANT and REVOKE commands • Internally uses HBase ACLs only, possible permissions are R - Read, W - Write, X - Execute, C - Create and A - Admin. • Access needs to be given on data table which automatically get propagated to all its indexes and views • No need to have WRITE access on SYSTEM.CATALOG table now for DDLs • New Indexes will be given same permission as data table.
  • 54. 54 © Hortonworks Inc. 2011–2018. All rights reserved Query log • Run time information of the query is logged in the Phoenix table(SYSTEM.LOG) • There are around 40+ columns of information to help in analzing the query at any point of time. • Logging is done Asynchronously and can be controlled through log level(TRACE,DEBUG,INFO,OFF)
  • 55. 55 © Hortonworks Inc. 2011–2018. All rights reserved Python driver for Phoenix Query Server • It implements the Python DB 2.0 API to access Phoenix via the Phoenix Query Server • Example
  • 56. 56 © Hortonworks Inc. 2011–2018. All rights reserved Persistent Subquery Results Cache - WIP • Persisting expensive subqueries data beyond life of query helps in significant performance gain when the same subquery repeated in multiple queries. • Make use of coprocessor based server cache to avoid caching at client. • WIP at https://issues.apache.org/jira/browse/PHOENIX-4666
  • 57. 57 © Hortonworks Inc. 2011–2018. All rights reserved Upgrades/New • Apache Omid Integration for Transactions Support( along with Tephra) • Hive 3.0.0 • Hadoop 3.0.0 • Spark 2.3.0 • Tephra 0.14.0 • Avatica 0.11.0
  • 58. 58 © Hortonworks Inc. 2011–2018. All rights reserved Questions?
  • 59. 59 © Hortonworks Inc. 2011–2018. All rights reserved Thank you

Notes de l'éditeur

  1. it may be possible that you will use less physical resources on latest releases than the old ones because of performance benefits.
  2. from personal opinion, always better to use last release with a patch no., as it is expected to be more stable, if you are not in hurry of getting new features
  3. Dependency Compatibility: An upgrade of HBase will not require an incompatible upgrade of a dependent project, including the Java runtime. Operational Compatibility: Metric changes, Behavioral changes of services, JMX APIs exposed via the /jmx/ endpoint Client Binary compatibility: Client code written using APIs doesnot recompilation
  4. Rolling upgrade you can’t run DDL with old client
  5. Each thread does 100 row multigets 100k, 10M rows Offheal size : 105 GB Single node cluster Througput 41k gets/second 25micro second per get
  6. ByteBuffer classes are extended to optimize the storage of offset and length NIO direct byte buffers are performed better than netty byte buffer in JMH comparison Cell class is extended with new methods for accessing ByteBuffer from offheap directly without copying , these methods are kept private and has no change from public interface for the cell. Some decision like using abstract class instead of interface for ByteBufferCell was decided on performance benchmarks because of Java L2 compiler inlining which was happening with interface but happening with abstract class. Graph was bad because of GC pauses
  7. To optimize java socket implementation during reading RPC requests where the data is temp copy from onheap to offheap buffer before copying to destination on heap buffer, with offheap buffer pools we can skip this temp copy and read RPC requests directly in offheap buffer. MSLAB pooling allocates fixed size byte buffer similar to traditional MSLAB pooling of onheap byte array to avoid fragmentation of memory. We create a pooll of fixed size buffers , and cells are stored in pipeline of these buffers ,
  8. MS CSLM is concurrentSkipListMap (it is a memstore data structure). To avoid heap usage by CSLM metadata, New Data structure is used to keep the cells in flattened array.
  9. Purpose of inmemory flushes is to have data stay more in memory for faster retrievel. Data structure is optimized to reduce index overhead of CSLM
  10. ComplimentsTephra ACID properties are Hard to scale when databases have to deal with very large amounts of data and thousands of concurrent users because the data must be partitioned, distributed and replicated. OMID provides lock-free transactional support on top of Hbase with snapshot isolation guarantees.