12. Why?
You can scan through all data and use filters
for selecting specific data
But scanning with start & stop row speeds
things up (a lot)
13. HBase rowkey design
Do I need a fast user or a fast timespan
lookup?
User - clientid,ts<,connectionId>
Timespan - ts,clientid<,connectionId>
14. Inverse Timestamps
Data in HBase is stored lexicographicaly
sorted
Normal TS - scan would yield oldest results
first
Inverse TS - newer entries come first (and you
can cancel the scan if you have enough data)
21. Batch Processing
Calculate how many users are active on
platform A and also on B
Get Traffic of all Questions belonging to
Channel X sorted by Country
26. Recommendations @
GF.net
User emit signals on questions
view, like, gives answer, answer is voted best
Application sends signals through RabbitMQ
to recommendation servers