Re-Engineering PostgreSQL as a Time-Series Database

TimescaleDB:
Re-engineering PostgreSQL
as a time-series database
David Kohn
R & D Engineer, Timescale
david@timescale.com · github.com/timescale · Apache 2 License

Open Source (Apache 2.0)
• github.com/timescale/timescaledb
Join the Community
• slack.timescale.com

Industrial
Machines
AI & ML
Inferences
Energy &
Utilities
Time-series
Data is
Everywhere
Web/mobile
Events
Transportation &
Logistics
Financial
Datacenter &
DevOps

Of every type
• Regular: Machines and sensors
• Irregular: Web and machine events
• Forward looking: Logistics and forecasting
• Derived data: Inferences from AI/ML models

Time-series data is recording
the change of your world

Time-series data is recording
every datapoint as a new entry

Existing databases don’t work for time series
Relational Databases NoSQL Databases
Every other time-series database today is NoSQL
Hard to scale
Underperform on complex queries, 
are hard to use, and lead to data silos

1 million+ downloads in <18 months

Empower Organizations to
Analyze the Past, Understand the
Present, and Predict the Future

Postgres 9.6.2 on Azure standard DS4 v2 (8 cores), SSD (premium LRS storage)
Each row has 12 columns (1 timestamp, indexed 1 host ID, 10 metrics)
Hard to scale

B-tree Insert Pain
1 2010
1 10 13 24 2925
5Insert batch: 178
Memory Capacity: 2 NODES
IN MEMORY
WRITE TO DISK

1 2010
1 10 13 24 2925
Insert batch: 8
5
17
B-tree Insert Pain
IN MEMORY
WRITE TO DISK

10 13
B-tree Insert Pain
1 2010
1 24 2925
Insert batch: 8
5 17
IN MEMORY
WRITE TO DISK

Challenge in scaling up
• Indexes write to random parts of B-tree
• As table grows large
– Indexes no longer ﬁt in memory
– Random writes cause swapping
Device: A
Time: 01:01:01
Device: Z
Time: 01:01:01
Device, Time DESC

• Ingest millions of datapoint
per second
• Scale to 100s billions of rows
• Elastically scale up and out
• Faster than Inﬂux, Cassandra,
Mongo, vanilla Postgres
Scale &
Performance
• Inherits 20+ years of
PostgreSQL reliability
• Streaming replication,
HA, backup/recovery
• Data lifecycle: continuous
rollups, retention, archiving
• Enterprise-grade security
Proven &
Enterprise Ready
• Zero learning curve
• Zero friction: Existing tools
and connectors work
• Enrich understanding: JOIN
against relational data
• Freedom for data model, no
cardinality issues
SQL for
time series
TimescaleDB 
Scalable time-series database, full SQL
Packaged as a PostgreSQL extension

>20x
TimescaleDB vs. PostgreSQL
(batch inserts)
TimescaleDB 0.5, Postgres 9.6.2 on Azure standard DS4 v2 (8 cores), SSD (LRS storage)
1.11M
METRICS / S

TimescaleDB vs.
PostgreSQL
SPEEDUP
Table scans, simple
column rollups
~0-20%
GROUPBYs 20-200%
Time-ordered
GROUPBYs
400-10000x
DELETEs 2000x
TimescaleDB 0.5, Postgres 9.6.2 on Azure standard DS4 v2 (8 cores), SSD (LRS storage)

Enjoy the entire PostgreSQL ecosystem

Key-value store with
indexed key lookup at
high-write rates
NoSQL champion: Log-Structured Merge Trees
• Compressed data storage
• Common approach for time series:
use key <name, tags, field, time>
+

NoSQL + LSMTs Come at a Cost
• Signiﬁcant memory overhead
• Lack of secondary indexes / tag lock-in
• Less powerful queries
• Weaker consistency (no ACID)
• No JOINS
• Loss of SQL ecosystem
+

Query Speedup
Table scans,
column rollups
~0%
GROUPBYs 4-6x
Time-ordered
GROUPBYs
1450x
Lastpoint 101xMongoDB TimescaleDB
vs. MongoDB
20% Higher Inserts
TimescaleDB 0.9.2, MongoDB 3.6, Azure standard D8s v3 (8 vCPU), 4 1-TB disks in raid0

Query Speedup
Table scans,
column rollups
2-44x
GROUPBYs 1-3x
Time-ordered
GROUPBYs
1900x
Lastpoint 1400x
vs. Cassandra
10x Higher Inserts
TimescaleDB 0.5, Cassandra 3.11.0, Azure standard DS4 v2 (8 cores), SSD (LRS storage)
Each TimescaleDB row has 12 columns (1 timestamp, indexed 1 host ID, 10 metrics)
Each Cassandra row has 2 columns (1 key, combo of tags + host + timestamp)

TimescaleDB
3 nodes
Cassandra
30 nodes
Ratio
Write
Throughput
(metrics / sec)
956,910 695,294 138%
Monthly Cost
(Azure)
$3,325 $33,251 10%

Time-series workloads are different

Time-series
• Primarily UPDATEs
• Writes randomly distributed
• Transactions to multiple  
primary keys
• Primarily INSERTs
• Writes to recent time interval
• Writes primarily associated 
with a timestamp
OLTP

Time-space partitioning 
(for both scaling up & out)
Time 
(older)
Intervals
1) manually speciﬁed
2) automatically adjusted

Space
Time 
(older)
(hash partitioning)
Intervals

Chunk (sub-table)
Space
Time 
(older)
(hash partitioning)
Intervals

Automatic Space-time Partitioning
Chunks

But treat it like a single table
Chunks
• Indexes
• Triggers
• Constraints
• Foreign keys
• UPSERTs
• Table mgmt
Hypertable

TimescaleDB: Easy to Get Started
CREATE TABLE conditions (
time timestamptz,
temp float,
humidity float,
device text
);
SELECT create_hypertable('conditions', 'time', ‘device', 4,
chunk_time_interval => interval '1 week’);
INSERT INTO conditions
VALUES ('2017-10-03 10:23:54+01', 73.4, 40.7, 'sensor3');
SELECT * FROM conditions;
time | temp | humidity | device
------------------------+------+----------+---------
2017-10-03 11:23:54+02 | 73.4 | 40.7 | sensor3

Create partitions
automatically at runtime.
 
Avoid a lot of manual
work.
CREATE TABLE conditions (
time timestamptz,
temp float,
humidity float,
device text
);
CREATE TABLE conditions_p1 PARTITION OF conditions
FOR VALUES FROM (MINVALUE) TO ('g')
PARTITION BY RANGE (time);
FOR VALUES FROM ('g') TO ('n')
FOR VALUES FROM ('n') TO ('t')
FOR VALUES FROM ('t') TO (MAXVALUE)
-- Create time partitions for the first week in each device partition
CREATE TABLE conditions_p1_y2017m10w01 PARTITION OF conditions_p1
FOR VALUES FROM ('2017-10-01') TO ('2017-10-07');
FOR VALUES FROM ('2017-10-01') TO (‘2017-10-07');
-- Create time-device index on each leaf partition
CREATE INDEX ON conditions_p1_y2017m10w01 (time);
INSERT INTO conditions VALUES ('2017-10-03 10:23:54+01',
73.4, 40.7, ‘sensor3');

Chunks are “right-sized”
Recent (hot) chunks ﬁt in memory

Single node: Scaling up via adding disks
• Faster inserts
• Parallelized queries
How Benefit
Chunks spread across many disks (elastically!)
either RAIDed or via distinct tablespaces

Writes
Schema
Changes
Reads
Multi-node: High availability and scaling read throughput

Multi-node: Scaling out across sharded primaries
U
nderdevelopm
ent
• Chunks spread across servers
• Insert/query to any server
• Distributed query optimizations
(push-down LIMITs and aggregates, etc.)

Chunk-aware query
optimizations

SELECT time, temp FROM data 
WHERE time > now() - interval ‘7 days’ 
AND device_id = ‘12345’
Avoid querying chunks via constraint exclusion

SELECT time, device_id, temp FROM data 
WHERE time > ‘2017-08-22 18:18:00+00’

WHERE time > now() - interval ’24 hours’

Additional time-based query optimizations
PG doesn’t
know to use
the index
CREATE INDEX ON readings(time);
SELECT date_trunc(‘minute’, time) as bucket,
avg(cpu)
FROM readings
GROUP BY bucket
ORDER BY bucket DESC
LIMIT 10;
Timescale
understands
time

Global queries but local indexes
• Constraint exclusion selects chunks globally
• Local indexes speed up queries on chunks
– B-tree, Hash, GiST, SP-GiST, GIN and BRIN
– Secondary and composite columns, UNIQUE* constraints

Optimized for many chunks
• Faster chunk exclusion
– Avoid opening / gather stats on all chunks during constraint exclusion:
Decreased planning on 4000 chunks from 600ms to 36ms
• Better LIMITs across chunks
– Avoid requiring one+ tuple per chunk during MergeAppend / LIMIT

“ We've been using TimescaleDB for over a year to
store all kinds of sensor and telemetry data as part of
our Power Management database.
We've scaled to 500 billion rows and the performance
we're seeing is monstrous, almost 70% faster queries.”
- Sean Wallace, Software Engineer
500B
ROWS
400K
ROWS / SEC
50K
CHUNKS
5min
INTERVALS

Efficient retention policies
WHERE time > now() - interval ’24 hours’
Drop chunks, don’t delete rows
avoids vacuuming

Simplify your stack
VS
TimescaleDB 
(with JOINS)
RDBMS NoSQL
Application Application

Geospatial Temporal Analysis (with PostGIS)

Data Retention + Aggregations
Granularity raw 15 min day
Retention 1 week 1 month forever

Unlock the richness of your monitoring data
TimescaleDB
+
PostgreSQL
Prometheus
Remote Storage Adapter
+
pg_prometheus
Prometheus Grafana

pg_prometheus
Prometheus Data Model in TimescaleDB / PostgreSQL
CREATE TABLE metrics (sample prom_sample);
INSERT INTO metrics
VALUES (‘cpu_usage{service=“nginx”,host=“machine1”} 34.6 1494595898000’);
• Scrape metrics with CURL:
curl http://myservice/metrics | grep -v “^#” | psql -c “COPY metrics FROM STDIN”
• New data type prom_sample: <time, name, value, labels>

Automate normalized storage
SELECT create_prometheus_table(‘metrics’);
Time
01:02:00 
01:03:00
01:04:00
01:04:00
01:04:00
Value
90
1024
70
900
70
Label
{host: “h001”}
{host: “h002”}
{host: “1984” }
{host: “super”}
{host: “marshal”}
Id
1
2
3
4
5
Label Id
1
1
2
2
5
Name
CPU
Mem
CPU
Mem
IO
Labels stored in separate host metadata table

Easily query auto-created view
SELECT sample
FROM metrics
WHERE time > NOW() - interval ’10 min’ AND
name = ‘cpu_usage’ AND
Labels @> ‘{“service”: “nginx”}’;
Columns: | sample | time | name | value | labels |

2PC
Multi-node: Scaling out across sharded primaries
U
nderdevelopm
ent
Writes Reads
Query planning +
constraint exclusion

minute
Continuous aggregations and hierarchical views
U
nderdevelopm
ent
Granularity raw hour

Tiered data storage and automated archiving
U
nderdevelopm
ent
SAN
Time 
(older)
archive_chunks (‘3 months’)
move_chunks (‘1 week’, ssd, hdd)

Scale Full clustering
Performance
+ ease-of-use
Continuous data aggregations and
intelligent hierarchical views
Performance
Lazy chunk management
(index creation, reindex, CLUSTER)
Ease-of-use
Analytical features
(gap ﬁlling, LOCF, fuzzy joins, etc.)
Total
Cost-of-Ownership
Tiered data storage
Automated data archiving

Re-Engineering PostgreSQL as a Time-Series Database

Re-Engineering PostgreSQL as a Time-Series Database

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Re-Engineering PostgreSQL as a Time-Series Database

Similaire à Re-Engineering PostgreSQL as a Time-Series Database (20)

Plus de All Things Open

Plus de All Things Open (20)

Dernier

Dernier (20)

Re-Engineering PostgreSQL as a Time-Series Database