32. class MemcacheList(object):
def push(self, key, value):
""" Add an element to the front of the list """
packed = msgpack.packb(value)
self.connection.append(key, packed)
def _unpack(self, data):
if data == 'x90':
return [], 0
_unpacker = msgpack.Unpacker()
_unpacker.feed(data)
32
33. class MemcacheList(object):
def push(self, key, value):
""" Add an element to the front of the list """
packed = msgpack.packb(value)
self.connection.append(key, packed)
def _unpack(self, data):
if data == 'x90':
return [], 0
_unpacker = msgpack.Unpacker()
_unpacker.feed(data)
33
67. 1. How do I collect my data and how do I
store them?
Stream (access logs, standard error)
Bulk (historical data, sales
transactions, etc.)
Secure and reliable storage!
67
68. Client Server
Apache
App
App RDBMS
Other data sources
Treasure Data API
Layer
csv
json
68
69. 2. How do I query my data?
Ad hoc queries
Scheduled queries
Data schema?
69
70. Cmdline,
console Query
API
HIVE, PIG (to be supported) Processing
Layer
Apps (JDBC, Cluster
User ODBC, REST)
MapReduce
Jobs
Amazon S3 Hadoop cluster
70
72. 3. How do different users in my org
access query results?
Different roles need to access results
from different interfaces
• Analysts -> Excel
• Devs -> REST, MySQL
72
73. Google Spreadsheet
ODBC -> Excel (Coming Q1)
Analysts
Treasure
Data MySQL, Postgres
JDBC, REST API
POST to web server Engineers
73
74. 4. How do I scale?
More data?
More queries?
74
76. Number of records in TD (in billions)
120
100
80
60
40
20
Sep Nov Jan Mar May Jul Aug
2011 2011 2012 2012 2012 2012 2012
January 2013 – Now over 200 Billion!
76
77. Treasure Data High-Level Architecture
Log Data Spread Sheets
BI Tools
Application Data
Treasure Data
Subscribe
Data Warehouse SQL
td-agent Operational
3rd Party Data Interface Analytics
JDBC
ODBC
Databases
Sensor Data
Web/Mobile Data CLI
77
78. Our Customers – Fortune Global 500
leaders and start-ups including:
78
79. Japan’s #1 recipe website
15 million users
1 million recipes
79
<<<NOTE>>> We have to add that we can not disclose some customers’ name here, including some of world’s largest enterprises and one of the world’s largest web company.