OSMC 2022 | VictoriaMetrics: scaling to 100 million metrics per second by Aliaksandr Valialkin

© 2022 VictoriaMetrics
VictoriaMetrics:
scaling to 100 million metrics per second
Aliaksandr Valialkin, CTO @ VictoriaMetrics

Let’s meet
● I’m Aliaksandr Valialkin - core developer @ VictoriaMetrics

Let’s meet
● I like writing programs in Go

Let’s meet
● I like simple and clear code
doSimpleThing1()
doSimpleThing2()

Let’s meet
● I hate over-engineered code, useless abstractions and bloated dependencies
abstractSingletonFabricProducerVisitorOperatorPrototype

Let’s meet
● I like performance optimizations (fasthttp, fastjson, quicktemplate, fastcache)

Let’s meet
● I like performance optimizations (fasthttp, fastjson, quicktemplate, fastcache)
● https://github.com/valyala/

What is VictoriaMetrics?
● Open source monitoring solution and time series database

● Supports popular data ingestion protocols - Prometheus, InﬂuxDB, Graphite,
DataDog, OpenTSDB, CSV, JSON

● Can discover and scrape Prometheus targets (Kubernetes too)

● Easy to setup and operate

● Low resource usage

● Low resource usage
● High performance

VictoriaMetrics kinds
● Single-node - scales vertically

● Cluster - scales horizontally

● Cluster - scales horizontally
● Single-node and cluster share the same core code

VictoriaMetrics single-node: scaling data ingestion
● Read incoming data in blocks
Client
data Read data
blocks
VictoriaMetrics

● Read incoming data in blocks
● Process blocks in parallel on multiple CPU cores
Client
data Read data
blocks
CPU_1
CPU_2
CPU_N
…
Process blocks
blocks
VictoriaMetrics

● Put the parsed data into independent buffers
CPU_1
CPU_2
CPU_N
…
Parse blocks
Buffer_1
Buffer_2
Buffer_M
…
In-memory buffers
Buffer parsed data
Tech details

● Put the parsed data into independent buffers
● Periodically store buffers to disk as independent LSM parts
Part_1
Part_2
Part_P
…
LSM parts
CPU_1
CPU_2
CPU_N
…
Parse blocks
Buffer_1
Buffer_2
Buffer_M
…
In-memory buffers
Compress and store data
Buffer parsed data
Tech details

● VictoriaMetrics stores data in compressed blocks
VictoriaMetrics single-node: scaling querying path
block_1 block_N1
…
series_1
block_1 block_NM
…
series_M
…
block_1 block_N2
…
series_2

● Selected blocks are unpacked in parallel on available CPUs
block_1 block_N1
…
series_1
block_1 block_NM
…
series_M
…
CPU_1
CPU_P
…
blocks
block_1 block_N2
…
series_2

● Selected blocks are unpacked in parallel on available CPUs
● Selected time series are processed in parallel on available CPUs
block_1 block_N1
…
series_1
block_1 block_NM
…
series_M
…
CPU_1
CPU_P
…
blocks
CPU_1
CPU_P
…
series
block_1 block_N2
…
series_2

VictoriaMetrics single-node: scalability limits
● The performance is limited by a single host (CPU, RAM, disk)

● Benchmark numbers:
○ Data ingestion: 300k samples/sec per CPU
○ Active time series: 1 million per GB of RAM
○ Query path: 50 million samples/sec per CPU

● Benchmark numbers:
○ Data ingestion: 300k samples/sec per CPU
○ Active time series: 1 million per GB of RAM
○ Query path: 50 million samples/sec per CPU
● Production numbers:
○ Data ingestion: 2 million samples/sec
○ Active time series: 100 millions
○ Query path: 1 billion samples/sec
○ Total samples: 15 trillions

Scaling VictoriaMetrics cluster
● VictoriaMetrics cluster consists of three components:
○ vminsert - accepts incoming data
vminsert_1
vminsert_2
vminsert_M
…
HTTP
load
balancer
Incoming
data

○ vmselect - processes incoming queries
vminsert_1
vminsert_2
vminsert_M
…
HTTP
load
balancer
vmselect_1
vmselect_2
vmselect_P
…
Incoming
data
HTTP
load
balancer
Incoming
queries

○ vmstorage - stores the data
vmstorage_1
vmstorage_2
vmstorage_N
…
vminsert_1
vminsert_2
vminsert_M
…
data
HTTP
load
balancer
vmselect_1
vmselect_2
vmselect_P
…
queries
Incoming
data
HTTP
load
balancer
Incoming
queries

● Each component can run on the most suitable hardware
vmstorage_1
vmstorage_2
vmstorage_N
…
vminsert_1
vminsert_2
vminsert_M
…
data
HTTP
load
balancer
vmselect_1
vmselect_2
vmselect_P
…
queries
Incoming
data
HTTP
load
balancer
Incoming
queries

● Each component can run on the most suitable hardware
● Each component can scale independently to any number of instances
vmstorage_1
vmstorage_2
vmstorage_N
…
vminsert_1
vminsert_2
vminsert_M
…
data
HTTP
load
balancer
vmselect_1
vmselect_2
vmselect_P
…
queries
Incoming
data
HTTP
load
balancer
Incoming
queries

VictoriaMetrics cluster: scaling data ingestion
● An http load balancer spreads incoming data among vminsert nodes
● Data ingestion performance scales with the number of vminsert nodes
HTTP load
balancer
vminsert_2
vminsert_1
vminsert_N
…
incoming data

VictoriaMetrics cluster: scaling data ingestion
● vminsert automatically shards incoming data among available vmstorage nodes
via consistent hashing
● Each vmstorage node has its own subset of time series (ideally)
● Data ingestion performance scales with the number of vmstorage nodes
vminsert vmstorage_2
vmstorage_1
vmstorage_M
…
sharding

VictoriaMetrics cluster: scaling querying path
● An http load balancer spreads incoming queries among vmselect nodes
● QPS scales with the number of vmselect nodes
HTTP load
balancer
vmselect_2
vmselect_1
vmselect_P
…
incoming queries

● vmselect fetches the needed data from every vmstorage node in parallel
● Querying performance scales with the number of vmstorage nodes
vmselect vmstorage_2
vmstorage_1
vmstorage_N
…
compressed data

● vmselect fetches the needed data from every vmstorage node in parallel
● Querying performance scales with the number of vmstorage nodes
● vmselect unpacks the fetched data in parallel on available CPUs
● Querying performance scales with the number of vCPUs at a single vmselect node
vmselect vmstorage_2
vmstorage_1
vmstorage_N
…
compressed data

VictoriaMetrics cluster: scalability limits
● CPU?

● CPU? No - data ingestion and querying performance scales with CPUs

● RAM?

● RAM? No - cluster capacity scales with RAM

● Disk?

● Disk? No - cluster capacity scales with disk space and io

● Network?

● Network? Yes!

100M benchmark
● Can VictoriaMetrics cluster accept 100 million samples per second in
production?
● Can VictoriaMetrics cluster handle a billion of active time series
● How much resources does it need?

Benchmarketing?
● Artiﬁcial data?

Benchmarketing?
● Limited amounts of data?

Benchmarketing?
● Limited benchmark duration?

Benchmarketing?
● Special conﬁgs?

Benchmarketing?
● Special conﬁgs?
● Optimized hardware?

Prometheus-benchmark
● Helm chart for testing Prometheus-like systems

● Uses production-like workload for data ingestion and querying

● Pushes the real node-exporter metrics to the tested systems
vmagent
node_exporter
scrape
load generator
Prometheus-like system
remote_write

● Allows using the real alerting rules for node-exporter metrics
vmagent
node_exporter
scrape
load generator
remote_write
vmalert
alerting rules
read
queries

● Allows using the real alerting rules for node-exporter metrics
● https://github.com/VictoriaMetrics/prometheus-benchmark
vmagent
node_exporter
scrape
load generator
remote_write
vmalert
alerting rules
read
queries

100M benchmark: requirements
● Stable ingestion rate: 100.000.000 samples/sec

● Active time series: 1.000.000.000 (1 billion)

● Duration: 24 hours

● Duration: 24 hours
● Total samples: 100M*3600s*24h=8.640.000.000.000 (8.64 trillions)

100M benchmark: prometheus-benchmark conﬁgs
● 16 load generator pods (8vCPU, 25GB RAM each)

● Scrape targets (node_exporter v1.4.0): 16*51.250=820.000

● Each scrape targets exposes around 1220 metrics

● Total number of metrics (aka active series): 820K*1220=1 billion

● Scrape interval: 10 seconds

● Scrape interval: 10 seconds
● Scrape rate: 1 billion / 10 seconds = 100M samples/sec

100M benchmark: VictoriaMetrics cluster conﬁgs
● Runs in Google Kubernetes Engine via the ofﬁcial VictoriaMetrics helm charts

● vmstorage: 46 x (16 vCPU, 55GB RAM, 2200 GB hdd-based disk)

● vminsert: 18 x (16 vCPU, 55GB RAM)

● vminsert: 18 x (16 vCPU, 55GB RAM)
● vmselect: none (wait for the next talk)

100M benchmark: allocated resources
● Prometheus-benchmark resources:
○ vCPU cores: 16*8=128
○ RAM: 16*25GB=400GB

○ RAM: 16*25GB=400GB
● VictoriaMetrics cluster resources:
○ vCPU cores: (46vmstorage+18vminsert)*16=1024
○ RAM: (46vmstorage+18vminsert)*55=3520GB
○ Disk: 46 x 2200GB = 101.2 TB

○ RAM: 16*25GB=400GB
● VictoriaMetrics cluster resources:
○ vCPU cores: (46vmstorage+18vminsert)*16=1024
○ RAM: (46vmstorage+18vminsert)*55=3520GB
○ Disk: 46 x 2200GB = 101.2 TB
● Kubernetes cluster:
○ 36x e2-standard-32 nodes (32 vCPU, 128GB RAM each)
○ Total: 1152 vCPU, 4608GB RAM

100M benchmark: used resources
● vminsert: 206vCPU, 26GB RAM

● vmstorage: 510vCPU, 600GB RAM, 101.2TB disk

● Total: 716vCPU (70%), 626GB RAM (18%), 7.5TB disk (7.5%)

● Network: 140Gbit/s (can be reduced to 20Gbit/s at the cost of 10% CPU)

● Network: 140Gbit/s (can be reduced to 20Gbit/s at the cost of 10% CPU)
● Disk IO: 3GB/s write, 450MB/s read

100M benchmark: results
● Stable data ingestion at 100M samples/sec during 24 hours

● Active time series: 1 billion

● Total samples ingested: 8.77 trillions

● Total samples ingested: 8.77 trillions
● Average sample size: 0.85 bytes

100M benchmark: key takeaways
● VictoriaMetrics cluster performance and capacity scales linearly to 100 nodes
and more

and more
● A single VictoriaMetrics cluster can collect metrics from a million of hosts
vmagent
host_1
host_2
host_1.000.000
…
scrape
VictoriaMetrics cluster
remote_write
a million of hosts
scrape_interval=10s

and more
● Cluster stability improves with the number of nodes

and more
● HDD-based disks are enough - there is no need in SSD-based disks
HDD
$40/TB/month
SSD
$170/TB/month
vs

and more
● HDD-based disks are enough - there is no need in SSD-based disks
● VictoriaMetrics handles large workloads with default conﬁgs

Reproduce the 100M benchmark on yourself!
● https://github.com/VictoriaMetrics/prometheus-benchmark/tree/bm-100

● Benchmark conﬁgs

● Benchmark conﬁgs
● VictoriaMetrics cluster conﬁgs

What’s next?
● Benchmark querying performance (50M samples/sec per vCPU processing
speed)?

What’s next?
speed)?
● A billion samples/sec benchmark?

What’s next?
speed)?
● 10 billions of active time series?

What’s next?
speed)?
● Kubernetes-like time series churn rate?

What’s next?
speed)?
● A month-long benchmark (needs $$$)?

What’s next?
speed)?
● A month-long benchmark (needs $$$)?
● Share your results!

OSMC 2022 | VictoriaMetrics: scaling to 100 million metrics per second by Aliaksandr Valialkin

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à OSMC 2022 | VictoriaMetrics: scaling to 100 million metrics per second by Aliaksandr Valialkin

Similaire à OSMC 2022 | VictoriaMetrics: scaling to 100 million metrics per second by Aliaksandr Valialkin (20)

Dernier

Dernier (20)

OSMC 2022 | VictoriaMetrics: scaling to 100 million metrics per second by Aliaksandr Valialkin