Omid Efficient Transaction Mgmt and Processing for HBase

Omid
Efficient Transaction Management and
Incremental Processing for HBase

Daniel Gómez Ferro
21/03/2013

Copyright © 2013 Yahoo! All rights reserved. No reproduction or distribution allowed without express written permission.

About me

§ Research engineer at Yahoo!

§ Long term Omid committer

§ Apache S4 committer

§ Contributor to:
• ZooKeeper
• HBase

Motivation

§ Traditional DBMS: § NoSQL stores:
• SQL • SQL
• Consistent • Consistent
• Scalable • Scalable

§ Some use cases require
• Scalability
• Consistent updates

Motivation

§ Lower latency
• MapReduce latencies measured in hours
• Usually low latency systems don’t provide strong
consistency guarantees

§ Requested feature
• Support for transactions in HBase is a recurrent request

§ Percolator
• Google implemented transactions on BigTable

Incremental Processing

§ Small, consistent updates to a shared dataset

§ Useful when:

• Very large datasets

• Consistent updates

• Incremental changes

Why Incremental Processing
Offline Online
MapReduce Incremental Processing
Fault tolerance Fault tolerance
Scalability Scalability
Shared state

State0 State1 Shared
State

Using Percolator, Google dramatically
reduced crawl-to-index time

Omid

§ ‘Hope’ in Persian

§ Optimistic Concurrency Control system
• Lock free
• Aborts transactions at commit time

§ Implements Snapshot Isolation

§ Low overhead

§ https://github.com/yahoo/omid

Snapshot Isolation

§ Each transaction reads from its own snapshot

§ Snapshot:
• Immutable
• Identified by creation time
• Contains all values committed before creation time

§ Transactions conflict if two conditions apply:
A) Write to the same row
B) Overlap in time

Overlapping transactions

Transaction 1

Transaction 2

Transaction 3

Transaction 4
Time

§ Transaction 2 could conflict with 1 or 3
• If they write to the same row

§ Transaction 4 doesn’t conflict
• No overlapping transactions

Simple API

§ Based on Java Transaction API and HBase API

§ TransactionManager:
Transaction begin();
void commit(Transaction t) throws RollbackException;
void rollback(Transaction t);

§ TTable:
Result get(Transaction t, Get g);
void put(Transaction t, Put p);
ResultScanner getScanner(Transaction t, Scan s);

Architecture

1.  Centralized server
2.  Transactional metadata replicated to clients
3.  Store transaction ids in HBase timestamps

HBase Client
HBase Client S
Omid
HBase Client
Clients O

(2) HBase
Status Oracle (SO) HBase
Region
HBase
Region
Servers
HBase
Region
Servers
Region
Servers
(1) Servers

(3)

Omid Performance
§ Centralized server = bottleneck?
• Focus on good performance
• In our experiments, it never became the bottleneck
30
2 rows
4 rows
25 8 rows
16 rows
32 rows
20 128 rows
Latency in ms

512 rows

15

10

5

0
1K 10K 100K
Throughput in TPS

Metadata Replication

§ Transactional metadata is replicated to clients

§ Clients guarantee Snapshot Isolation
• Ignore data not in their snapshot
• Talk directly to HBase
• Conflicts resolved by the Status Oracle

§ Expensive, but scalable up to 1000 clients

Fault Tolerance

§ Omid uses BookKeeper for fault tolerance
• BookKeeper is a Distributed Write Ahead Log

§ Before answering a commit request
• Log it to BookKeeper asynchronously
• Wait for a reply
• Notify the client

§ If the Status Oracle crashes
• Recover the state from BookKeeper

Fault Tolerance Overhead

§ Omid batches writes to BookKeeper
• Write every 5 ms or when the batch > 1 KB

§ Recovery: reads log’s tail (bounded time)
Downtime < 25 s
18000
Throughput in TPS

15000
12000
9000
6000
3000
0
5900 5950 6000 6050 6100
Time in s

Example Application
TF-IDF on Tweets

TF-IDF

§ Term Frequency – Inverse Document Frequency
• How important is a word to a document
• Useful for search engines

§ Given a set of words (query)
• Return documents with highest TF-IDF

TF-IDF on Tweets

§ Given a set of words
• Return relevant HashTags

§ Document
• collection of tweets with same hashtag

§ Update the index incrementally

Implementation

§ Read tweets’ stream, put them in queue
§ Workers process each tweet in parallel
• One transaction per tweet
• Update frequencies consistently
• In case of abort, retry

§ Queries have a consistent view of the database

Problems

§ Hard to distribute load to other machines

§ Complex processing cause big transactions
• More likely to conflict
• More expensive to retry

§ API is too low level for data processing

Future work

§ Framework for Incremental Processing
• Simpler API
• Trigger-based, easy to decompose operations
• Auto scalable

§ Integrate Omid with other Data Stores

Contributors

Ben Reed
Flavio Junqueira
Francisco Pérez-Sorrosal
Ivan Kelly
Matthieu Morel
Maysam Yabandeh

Omid Efficient Transaction Mgmt and Processing for HBase

Recommandé

Recommandé

Contenu connexe

Similaire à Omid Efficient Transaction Mgmt and Processing for HBase

Similaire à Omid Efficient Transaction Mgmt and Processing for HBase (20)

Plus de DataWorks Summit

Plus de DataWorks Summit (20)

Dernier

Dernier (20)

Omid Efficient Transaction Mgmt and Processing for HBase