SlideShare a Scribd company logo
1 of 18
Introduction to HBase
Anil Gupta
@bigdatanoob
What is NoSql?
RDBMS vs NoSql
HBase
HBase Components
Architecture
HBase Cluster
HBase Data Model
Key -> Value
Region
Outline
NoSQL is acronym for Not Only SQL. These databases are
non-relational. This term was coined in 1998.
They do not use SQL as their primary language.
NoSQL is not a replacement of Relational
Database.
NoSQL is designed for distributed data stores
NoSQL was designed to store semi-structured
and sparse data
NoSQL RDBMS
Hardware Farm of Commodity(upto
several thousand)
1-3 High End or
Proprietary(costly)
Data Type Semi-structured and
Sparse
Structured and dense
Data Size PetaBytes(1015) TeraBytes(1012 bytes)
Auto-Sharding Yes No
Flexible Schema Yes No
Referential Integrity No Yes
Support for Joins No Yes
Support for Aggregations Basic Advance
HBase is an open-source, distributed, versioned,
key-value database modeled after Google's
Bigtable.
is optional for
HBase has real-time read/writes(in milliseconds)
HBase is highly fault tolerant(HA) and scalable
+ Random Read/Write
access= + Apache
Zookeeper
Selling Points of HBase
Highly Scalable
Auto-sharding
Strongly Consistent
Out of the box support for Historical Data
Very high read throughput
Readily compatible with Hadoop
Highly Fault-tolerant(HA)
HBase Components
1. HBase Master(HMaster): HMaster is the
Master Server.
 HMaster is responsible for monitoring all
RegionServers
 Performs load balancing a.k.a sharding
 Assigns regions to RegionServers
 All the metadata changes go through Master
 Periodically checks and cleans up the .META.
table
 Multiple HMaster can run in cluster but only one
HMaster will be active at any time.
HBase Components(cont.)
2. RegionServer(HRegionServer):
HRegionServer is the implementation of the
worker module.
 Runs as Java Service on worker nodes.
 Machine running a RegionServer is considered
a worker node.
 Serves get/put/scan requests
 Responsible for splitting and compacting regions
 Runs on DataNode
 Multiple RegionServers run in a cluster
Zookeeper in HBase
ZooKeeper: It allows distributed processes to
coordinate with each other through a shared
hierarchical name space. It is distributed and
highly reliable service.
In HBase it is responsible for following:
 Provide availability status of RegionServers
 To ensure single active HMaster in the cluster
 Provide location of “-ROOT-” table
 Selection of new HMaster in case of failure of
an active HMaster
HBase Architecture
HBase Cluster
Worker
Node
Worker
Node
Worker Node
DataNodeDataNode
TaskTracker
HRegionServe
r
DataNode
TaskTracker
HRegionServe
r
Worker Node
DataNode
Worker Node
DataNode
RegionServer
Worker Node
DataNode
Worker Node
DataNode
Worker Node
DataNode
Worker Node
DataNode
Name
Node
HMaster
Zoo
keeper
HMaster
RegionServer
RegionServer
RegionServerRegionServer
RegionServerRegionServer
Name
Node
Column Family and Column Qualifier
Column Family: Columns Qualifiers in HBase are grouped
into column families.
The colon character (:) delimits the column qualifier family
from the column family.
Combination of <Column Family>: <Column Qualifier> is
equivalent to a Column name.
Physically, all column qualifiers of a column family are stored
together on the file system.
• Column Qualifiers within a family are sorted lexicographically and
stored together
Example: txn:amt , Here “txn” is the Column Family and “amt” is
the Column Qualifier.
HBase Data Model
• Table maintains data in lexicographic order by RowKey.
• Everything except table names are stored as byte array
• Only column families are defined at the creation time of table
 Each family can have any number of columns(to a
maximum of few millions)
 Each row can have different columns in a column family
 Each column consists of any number of versions
 Columns only exist when inserted because HBase does
not have NULL values
(RowKey, Column Family:Column Qualifier,
Timestamp) is a “Key” in HBase.
“Value” is stored corresponding to a “Key”
Timestamp is used to support storing of Historical
Data
Table is always indexed on RowKey
Key -> Value in HBase
Region
Tables in HBase are divided into multiple Regions.
1 Region = 1 Partition of Table
Regions are hosted by RegionServers
1 RegionServer can host 100’s of Regions
RegionServer can host Regions from multiple
tables.
After a major compaction, every region has 1 HFile
for each column family.
Random Facts About
HBase
Data in HBase is stored in HFile Format
Values are stored as Byte Array in HFiles
HLog is the file format used for storing “Write
Ahead Logging” in HBase.
References
http://hbase.apache.org/
https://hadoop.apache.org/
http://www.larsgeorge.com/2009/10/hbase-
architecture-101-storage.html
Questions?

More Related Content

What's hot

Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Simplilearn
 

What's hot (20)

Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
Pig Tutorial | Apache Pig Tutorial | What Is Pig In Hadoop? | Apache Pig Arch...
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 
HBASE Overview
HBASE OverviewHBASE Overview
HBASE Overview
 
Map Reduce
Map ReduceMap Reduce
Map Reduce
 
Map reduce presentation
Map reduce presentationMap reduce presentation
Map reduce presentation
 
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
hbaseconasia2019 HBCK2: Concepts, trends, and recipes for fixing issues in HB...
 
When to Use MongoDB
When to Use MongoDBWhen to Use MongoDB
When to Use MongoDB
 
Sharding
ShardingSharding
Sharding
 
HDFS Architecture
HDFS ArchitectureHDFS Architecture
HDFS Architecture
 
Introduction to Redis
Introduction to RedisIntroduction to Redis
Introduction to Redis
 
6.hive
6.hive6.hive
6.hive
 
Hadoop HDFS Concepts
Hadoop HDFS ConceptsHadoop HDFS Concepts
Hadoop HDFS Concepts
 
Hadoop HDFS
Hadoop HDFSHadoop HDFS
Hadoop HDFS
 
Apache Flume
Apache FlumeApache Flume
Apache Flume
 
Hadoop YARN
Hadoop YARNHadoop YARN
Hadoop YARN
 
Hive tuning
Hive tuningHive tuning
Hive tuning
 
SQOOP PPT
SQOOP PPTSQOOP PPT
SQOOP PPT
 
Introduction to NOSQL databases
Introduction to NOSQL databasesIntroduction to NOSQL databases
Introduction to NOSQL databases
 
Hive+Tez: A performance deep dive
Hive+Tez: A performance deep diveHive+Tez: A performance deep dive
Hive+Tez: A performance deep dive
 
Hadoop Tutorial For Beginners
Hadoop Tutorial For BeginnersHadoop Tutorial For Beginners
Hadoop Tutorial For Beginners
 

Viewers also liked (6)

HiveServer2
HiveServer2HiveServer2
HiveServer2
 
HiveServer2 for Apache Hive
HiveServer2 for Apache HiveHiveServer2 for Apache Hive
HiveServer2 for Apache Hive
 
The Google File System (GFS)
The Google File System (GFS)The Google File System (GFS)
The Google File System (GFS)
 
GFS
GFSGFS
GFS
 
GOOGLE FILE SYSTEM
GOOGLE FILE SYSTEMGOOGLE FILE SYSTEM
GOOGLE FILE SYSTEM
 
SQL to Hive Cheat Sheet
SQL to Hive Cheat SheetSQL to Hive Cheat Sheet
SQL to Hive Cheat Sheet
 

Similar to Introduction To HBase

Hbase introduction
Hbase introductionHbase introduction
Hbase introduction
yangwm
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
Yiwei Ma
 

Similar to Introduction To HBase (20)

Hbase
HbaseHbase
Hbase
 
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPERCCS334 BIG DATA ANALYTICS UNIT 5 PPT  ELECTIVE PAPER
CCS334 BIG DATA ANALYTICS UNIT 5 PPT ELECTIVE PAPER
 
Introduction to HBase
Introduction to HBaseIntroduction to HBase
Introduction to HBase
 
HBase.pptx
HBase.pptxHBase.pptx
HBase.pptx
 
Hbase.pptx
Hbase.pptxHbase.pptx
Hbase.pptx
 
Hbase Quick Review Guide for Interviews
Hbase Quick Review Guide for InterviewsHbase Quick Review Guide for Interviews
Hbase Quick Review Guide for Interviews
 
4. hbase overview
4. hbase overview4. hbase overview
4. hbase overview
 
Hbase introduction
Hbase introductionHbase introduction
Hbase introduction
 
Apache hadoop hbase
Apache hadoop hbaseApache hadoop hbase
Apache hadoop hbase
 
Techincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql databaseTechincal Talk Hbase-Ditributed,no-sql database
Techincal Talk Hbase-Ditributed,no-sql database
 
Chicago Data Summit: Apache HBase: An Introduction
Chicago Data Summit: Apache HBase: An IntroductionChicago Data Summit: Apache HBase: An Introduction
Chicago Data Summit: Apache HBase: An Introduction
 
Hbase 20141003
Hbase 20141003Hbase 20141003
Hbase 20141003
 
Hadoop - Apache Hbase
Hadoop - Apache HbaseHadoop - Apache Hbase
Hadoop - Apache Hbase
 
Data Storage Management
Data Storage ManagementData Storage Management
Data Storage Management
 
01 hbase
01 hbase01 hbase
01 hbase
 
Big data hbase
Big data hbase Big data hbase
Big data hbase
 
Hbase
HbaseHbase
Hbase
 
Hbase
HbaseHbase
Hbase
 
Facebook keynote-nicolas-qcon
Facebook keynote-nicolas-qconFacebook keynote-nicolas-qcon
Facebook keynote-nicolas-qcon
 
Facebook Messages & HBase
Facebook Messages & HBaseFacebook Messages & HBase
Facebook Messages & HBase
 

Recently uploaded

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
masabamasaba
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
masabamasaba
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
masabamasaba
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
Health
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
VictoriaMetrics
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Medical / Health Care (+971588192166) Mifepristone and Misoprostol tablets 200mg
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
masabamasaba
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
masabamasaba
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
chiefasafspells
 

Recently uploaded (20)

%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
%+27788225528 love spells in Huntington Beach Psychic Readings, Attraction sp...
 
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
%+27788225528 love spells in new york Psychic Readings, Attraction spells,Bri...
 
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
%+27788225528 love spells in Colorado Springs Psychic Readings, Attraction sp...
 
tonesoftg
tonesoftgtonesoftg
tonesoftg
 
WSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaSWSO2CON 2024 Slides - Open Source to SaaS
WSO2CON 2024 Slides - Open Source to SaaS
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Atlanta Psychic Readings, Attraction spells,Brin...
 
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa%in tembisa+277-882-255-28 abortion pills for sale in tembisa
%in tembisa+277-882-255-28 abortion pills for sale in tembisa
 
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
Large-scale Logging Made Easy: Meetup at Deutsche Bank 2024
 
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
Abortion Pill Prices Tembisa [(+27832195400*)] 🏥 Women's Abortion Clinic in T...
 
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
WSO2CON 2024 - Navigating API Complexity: REST, GraphQL, gRPC, Websocket, Web...
 
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
MarTech Trend 2024 Book : Marketing Technology Trends (2024 Edition) How Data...
 
WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?WSO2CON 2024 - Does Open Source Still Matter?
WSO2CON 2024 - Does Open Source Still Matter?
 
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
WSO2CON 2024 - WSO2's Digital Transformation Journey with Choreo: A Platforml...
 
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
%in Rustenburg+277-882-255-28 abortion pills for sale in Rustenburg
 
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
%+27788225528 love spells in Knoxville Psychic Readings, Attraction spells,Br...
 
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
%+27788225528 love spells in Toronto Psychic Readings, Attraction spells,Brin...
 
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
Direct Style Effect Systems -The Print[A] Example- A Comprehension AidDirect Style Effect Systems -The Print[A] Example- A Comprehension Aid
Direct Style Effect Systems - The Print[A] Example - A Comprehension Aid
 
WSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - KeynoteWSO2Con204 - Hard Rock Presentation - Keynote
WSO2Con204 - Hard Rock Presentation - Keynote
 
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
Love witchcraft +27768521739 Binding love spell in Sandy Springs, GA |psychic...
 

Introduction To HBase

  • 1. Introduction to HBase Anil Gupta @bigdatanoob
  • 2. What is NoSql? RDBMS vs NoSql HBase HBase Components Architecture HBase Cluster HBase Data Model Key -> Value Region Outline
  • 3. NoSQL is acronym for Not Only SQL. These databases are non-relational. This term was coined in 1998. They do not use SQL as their primary language. NoSQL is not a replacement of Relational Database. NoSQL is designed for distributed data stores NoSQL was designed to store semi-structured and sparse data
  • 4. NoSQL RDBMS Hardware Farm of Commodity(upto several thousand) 1-3 High End or Proprietary(costly) Data Type Semi-structured and Sparse Structured and dense Data Size PetaBytes(1015) TeraBytes(1012 bytes) Auto-Sharding Yes No Flexible Schema Yes No Referential Integrity No Yes Support for Joins No Yes Support for Aggregations Basic Advance
  • 5. HBase is an open-source, distributed, versioned, key-value database modeled after Google's Bigtable. is optional for HBase has real-time read/writes(in milliseconds) HBase is highly fault tolerant(HA) and scalable + Random Read/Write access= + Apache Zookeeper
  • 6. Selling Points of HBase Highly Scalable Auto-sharding Strongly Consistent Out of the box support for Historical Data Very high read throughput Readily compatible with Hadoop Highly Fault-tolerant(HA)
  • 7. HBase Components 1. HBase Master(HMaster): HMaster is the Master Server.  HMaster is responsible for monitoring all RegionServers  Performs load balancing a.k.a sharding  Assigns regions to RegionServers  All the metadata changes go through Master  Periodically checks and cleans up the .META. table  Multiple HMaster can run in cluster but only one HMaster will be active at any time.
  • 8. HBase Components(cont.) 2. RegionServer(HRegionServer): HRegionServer is the implementation of the worker module.  Runs as Java Service on worker nodes.  Machine running a RegionServer is considered a worker node.  Serves get/put/scan requests  Responsible for splitting and compacting regions  Runs on DataNode  Multiple RegionServers run in a cluster
  • 9. Zookeeper in HBase ZooKeeper: It allows distributed processes to coordinate with each other through a shared hierarchical name space. It is distributed and highly reliable service. In HBase it is responsible for following:  Provide availability status of RegionServers  To ensure single active HMaster in the cluster  Provide location of “-ROOT-” table  Selection of new HMaster in case of failure of an active HMaster
  • 11. HBase Cluster Worker Node Worker Node Worker Node DataNodeDataNode TaskTracker HRegionServe r DataNode TaskTracker HRegionServe r Worker Node DataNode Worker Node DataNode RegionServer Worker Node DataNode Worker Node DataNode Worker Node DataNode Worker Node DataNode Name Node HMaster Zoo keeper HMaster RegionServer RegionServer RegionServerRegionServer RegionServerRegionServer Name Node
  • 12. Column Family and Column Qualifier Column Family: Columns Qualifiers in HBase are grouped into column families. The colon character (:) delimits the column qualifier family from the column family. Combination of <Column Family>: <Column Qualifier> is equivalent to a Column name. Physically, all column qualifiers of a column family are stored together on the file system. • Column Qualifiers within a family are sorted lexicographically and stored together Example: txn:amt , Here “txn” is the Column Family and “amt” is the Column Qualifier.
  • 13. HBase Data Model • Table maintains data in lexicographic order by RowKey. • Everything except table names are stored as byte array • Only column families are defined at the creation time of table  Each family can have any number of columns(to a maximum of few millions)  Each row can have different columns in a column family  Each column consists of any number of versions  Columns only exist when inserted because HBase does not have NULL values
  • 14. (RowKey, Column Family:Column Qualifier, Timestamp) is a “Key” in HBase. “Value” is stored corresponding to a “Key” Timestamp is used to support storing of Historical Data Table is always indexed on RowKey Key -> Value in HBase
  • 15. Region Tables in HBase are divided into multiple Regions. 1 Region = 1 Partition of Table Regions are hosted by RegionServers 1 RegionServer can host 100’s of Regions RegionServer can host Regions from multiple tables. After a major compaction, every region has 1 HFile for each column family.
  • 16. Random Facts About HBase Data in HBase is stored in HFile Format Values are stored as Byte Array in HFiles HLog is the file format used for storing “Write Ahead Logging” in HBase.