SlideShare une entreprise Scribd logo
1  sur  52
Télécharger pour lire hors ligne
| © Copyright 2023, InfluxData
Understanding the New
InfluxDB Storage Engine
Anais Dotis-Georgiou
January 2023
| © Copyright 2023, InfluxData
2
Anais Dotis-Georgiou
Developer Advocate
LinkedIn
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
Agenda
● What is the new InfluxDB Engine?
● What requirements does the new InfluxDB Engine meet?
○ Understanding the Apache Ecosystem
● Offerings and Release Timeline
● New InfluxDB Cloud Features
● SQL support
● Interoperability plans
● Survey
● Resources
| © Copyright 2023, InfluxData
The new storage engine that
powers InfluxDB Cloud
4
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
InfluxDB’s new storage engine is built on
● Rust
● Apache Arrow
● Apache Parquet
● Arrow Flight
● DataFusion
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
Release Details
● Release Jan 31st
● Available in InfluxDB Cloud on AWS in:
○ AWS us-east-1
○ AWS eu-central-1
6
| © Copyright 2023, InfluxData
Requirements for the new storage engine
Requirement/Feature Rust Arrow DataFusion Parquet
1. No limits on cardinality. Write any kind of event data and don't
worry about what a tag or field is.
X X X X
2. Best-in-class performance on analytics queries in addition to
our already well-served metrics queries.
X X X X
3. Separate compute from storage and tiered data storage. The
DB should use cheaper object storage as its long-term durable
store.
X X
4. Operator control over memory usage. The operator should be
able to define how much memory is used for each buffering,
caching, and query processing.
X
5. Bulk data export and import. X
6. Broader ecosystem compatibility. Where possible, we should
aim to use and embrace emerging standards in the data and
analytics ecosystem.
X X X X
7. Run at the edge and in the datacenter. Federated by design. X X
| © Copyright 2023, InfluxData
Rust and InfluxDB Requirements
8
| © Copyright 2023, InfluxData
Rust and Requirement 1.
Requirement: No limits on cardinality. Write any kind of event data
and don't worry about what a tag or field is.
Rust Contributions:
• InfluxDB’s new storage engine is built on the Rust
implementation of Apache Arrow which contributes heavily to
meeting this requirement.
• Handling unlimited cardinality use cases requires non-trivial
CPU during query processing. Rust supports optimizing
resources for increased performance.
| © Copyright 2023, InfluxData
Rust and Requirement 2.
Requirement: Best-in-class performance on analytics queries in
addition to our already well-served metrics queries.
Rust Contributions:
• Arrow and DataFusion are built on Rust.
| © Copyright 2023, InfluxData
Rust and Requirement 4.
Requirement: Operator control over memory usage. The operator
should be able to define how much memory is used for each
buffering, caching, and query processing.
Rust Contributions:
• Rust is used for memory control.
| © Copyright 2023, InfluxData
Rust and Requirement 6.
Requirement: Broader ecosystem compatibility. Where possible,
we should aim to use and embrace emerging standards in the
data and analytics ecosystem.
Rust Contributions:
• Rust helps support the implementation of Arrow, DataFusion,
and Parquet.
| © Copyright 2023, InfluxData
Rust and Requirement 7.
Requirement: Run at the edge and in the datacenter. Federated
by design.
Rust Contributions:
• Optimizing your memory usage with Rust, means that InfluxDB
Cloud will also contain these memory optimizations at the edge
or in the datacenter.
| © Copyright 2023, InfluxData
Rust and Requirement 7.
Requirement: Run at the edge and in the datacenter. Federated
by design.
Rust Contributions:
• Optimizing your memory usage with Rust, means that InfluxDB
Cloud will also contain these memory optimizations at the edge
or in the datacenter.
| © Copyright 2023, InfluxData
Arrow and InfluxDB Requirements
15
| © Copyright 2023, InfluxData
Arrow and Requirement 1.
Requirement: No limits on cardinality. Write any kind of event data
and don't worry about what a tag or field is.
Arrow Contributions:
• Apache Arrow overcomes memory challenges associated with
large-cardinality use cases by providing efficient columnar data
exchange.
| © Copyright 2023, InfluxData
Advantages of Columnar Data Storage
(sidebar)
17
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
Sidebar–Advantages of Columnar Data Storage
18
measurement1,tag1=tagvalue1 field1=1i timestamp1
measurement1,tag1=tagvalue2 field1=2i timestamp2
measurement1,tag2=tagvalue3 field1=3i timestamp3
measurement1,tag1=tagvalue1,tag2=tagvalue3 field1=4i,field2=true timestamp4
measurement1, field1=1i timestamp5
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
Sidebar–Advantages of Columnar Data Storage
19
Name: measurement1
field1 field2 tag1 tag2 tag3 time
1i null tagvalue1 null null timestamp1
2i null tagvalue2 null null timestamp2
3i null null tagvalue3 null timestamp3
4i true tagvalue1 tagvalue3 tagvalue4 timestamp4
1i null null null null timestamp5
| © Copyright 2023, InfluxData
| © Copyright 2023, InfluxData
Sidebar–Advantages of Columnar Data Storage
20
1i 2i 3i 4i 1i
null null null true null
tagvalue1 tagvalue2 null tagvalue1 null
null null tagvalue3 tagvalue3 null
null null null tagvalue4 null
timestamp1 timestamp2 timestamp3 timestamp4 timestamp5
1i, 2i, 3i, 4i, 1i;
null, null, null, true, null;
tagvalue1, tagvalue2, null, tagvalue1, null;
null, null, null, tagvalue3, tagvalue3, null;
null, null, null, tagvalue4, null;
timestamp1, timestamp2, timestamp3, timestamp4, timestamp5.
| © Copyright 2023, InfluxData
Arrow and Requirement 2.
Requirement: Best-in-class performance on analytics queries in
addition to our already well-served metrics queries.
Arrow Contributions:
• Arrow offers best in class performance on analytics through the
memory optimizations and efficient data exchange.
| © Copyright 2023, InfluxData
Arrow and Requirement 3.
Requirement: Separate compute from storage and tiered data
storage. The DB should use cheaper object storage as its
long-term durable store.
Arrow Contributions:
• Arrow provides the in-memory columnar storage while Parquet
will provide the column-oriented data file format on disk.
| © Copyright 2023, InfluxData
Arrow and Requirement 4.
Requirement: Operator control over memory usage. The operator
should be able to define how much memory is used for each
buffering, caching, and query processing
Arrow Contributions:
• The Rust implementation of Apache Arrow provides fine
grained memory control.
| © Copyright 2023, InfluxData
Arrow and Requirement 6.
Requirement: Broader ecosystem compatibility. Where possible,
we should aim to use and embrace emerging standards in the
data and analytics ecosystem.
Arrow Contributions:
• Leveraging Arrow is easier with the 12 libraries it supports for C,
C++, Java, JavaScript, Python, Ruby, and more.
| © Copyright 2023, InfluxData
DataFusion and InfluxDB Requirements
25
| © Copyright 2023, InfluxData
DataFusion and Requirement 1.
Requirement: No limits on cardinality. Write any kind of event data
and don't worry about what a tag or field is.
DataFusion Contributions:
• What use is unlimited cardinality data if you can’t query it?
DataFusion provides the query, processing, and transformation
of this data.
| © Copyright 2023, InfluxData
DataFusion and Requirement 2.
Requirement: Best-in-class performance on analytics queries in
addition to our already well-served metrics queries.
DataFusion Contributions:
• What use is unlimited cardinality data if you can’t query it?
DataFusion provides the query, processing, and transformation
of this data.
| © Copyright 2023, InfluxData
DataFusion and Requirement 3.
Requirement: Separate compute from storage and tiered data
storage. The DB should use cheaper object storage as its
long-term durable store.
DataFusion Contributions:
• DataFusion enables fast query against data stored on cheaper
object store and separate compute.
| © Copyright 2023, InfluxData
DataFusion and Requirement 6.
Requirement: Broader ecosystem compatibility. Where possible,
we should aim to use and embrace emerging standards in the
data and analytics ecosystem.
DataFusion Contributions:
• DataFusion supports both a postgres compatible SQL and
DataFrame API.
| © Copyright 2023, InfluxData
Parquet and InfluxDB Requirements
30
| © Copyright 2023, InfluxData
Parquet and Requirement 2.
Requirement: Best-in-class performance on analytics queries in
addition to our already well-served metrics queries.
Parquet Contributions:
• Efficient compression and interoperability with ML and analytics
tooling.
| © Copyright 2023, InfluxData
Parquet and Requirement 3.
Requirement: Separate compute from storage and tiered data
storage. The DB should use cheaper object storage as its
long-term durable store.
Parquet Contributions:
• Parquet files take up little disk space and are fast to scan.
| © Copyright 2023, InfluxData
Parquet and Requirement 5.
Requirement: Bulk data export and import.
Parquet Contributions:
• Parquet files enable bulk data export and import.
| © Copyright 2023, InfluxData
Parquet and Requirement 5.
Requirement: Broader ecosystem compatibility. Where possible,
we should aim to use and embrace emerging standards in the
data and analytics ecosystem.
Parquet Contributions:
• Parquet offers interoperability with modern ML and analytics
tools.
| © Copyright 2023, InfluxData
Parquet and Requirement 5.
Requirement: Run at the edge and in the datacenter. Federated
by design.
Parquet Contributions:
• Because Parquet files are so efficient, they will facilitate and
increase the capacity for data storage at the edge.
| © Copyright 2023, InfluxData
New Data Explorer
36
| © Copyright 2023, InfluxData
New Data Explorer
37
| © Copyright 2023, InfluxData
iox.from() vs from()
38
| © Copyright 2023, InfluxData
iox.from() vs from()
39
from(bucket: "anais-iox")
|> range(start: 2022-12-01T19:05:41.000Z, stop: now())
|> filter(fn: (r) => r._measurement == "airSensors")
|> filter(fn: (r) => r._field == "temperature")
|> filter(fn: (r) => r.sensor_id == "TLM0100")
| © Copyright 2023, InfluxData
iox.from() vs from()
40
import "experimental/iox"
data = iox.from(bucket: "anais-iox", measurement: "airSensors")
|> range(start: 2022-12-01T19:05:41.000Z, stop: now())
|> filter(fn: (r) => r.sensor_id == "TLM0100")
|> yield()
| © Copyright 2023, InfluxData
SQL Support
41
| © Copyright 2023, InfluxData
SQL Functions Supported by Jan 31
• Supported Statements:
SELECT, FROM, WHERE,
GROUP BY, ORDER BY, JOIN
(left and inner), WITH clause,
HAVING, UNION, LIMIT,
OVER
• Subqueries: EXISTS, NOT
EXISTS, IN, NOT IN
• Agg Functions: COUNT(),
AVG(), MEAN(), SUM(), MIN(),
MAX()
• Time Series Functions:
TIME_BUCKET_GAPFILL(),
DATEBIN(), NOW()
• Other: EXPLAIN, EXPLAIN
ANALYZE
42
| © Copyright 2023, InfluxData
Interoperability plans
• FlightSQL plugins (Timeline TBD):
• Apache Superset
• Tableau
• PowerBI
• Grafana
43
| © Copyright 2023, InfluxData
Survey
44
● Please take 5 minutes to fill out this anonymous survey. Your
feedback is extremely valuable to us.
● https://bit.ly/3imWP7Y
| © Copyright 2023, InfluxData
Get Started
45
influxdata.com/influxdb-engine-beta/
| © Copyright 2023, InfluxData
InfluxDB
Community
Slack workspace
Please join us in the InfluxDB
Community Slack at
www.influxdata.com/slack.
To participate in conversations,
join the #influxdb_iox channel.
| © Copyright 2023, InfluxData
47
influxdata.com https://github.com/InfluxCommunity
| © Copyright 2023, InfluxData
48
Try it yourself
https://www.influxdata.com https://github.com/InfluxCommunity
Get Started
| © Copyright 2023, InfluxData
Related Blogs
• Understanding InfluxDB IOx and the Commitment to Open Source
• Querying Data in InfluxDB using Flux and SQL
• Intro to InfluxDB IOx
• Welcome to InfluxDB IOx: InfluxData’s New Storage Engine
• The Journey of InfluxDB by Paul Dix
• InfluxData Deploys Next-Generation InfluxDB Time Series Engine with Unlimited Scale
• Announcing InfluxDB IOx - The Future Core of InfluxDB Built with Rust and Arrow
• Evolving InfluxDB into the Smart Data Platform for Time Series
• InfluxData is Building a Fast Implementation of Apache Arrow in Go Using c2goasm and SIMD
• On InfluxData’s New Storage Engine. Q&A with Andrew Lamb
• Apache Arrow, Parquet, Flight and Their Ecosystem are a Game Changer for OLAP
49
| © Copyright 2023, InfluxData
Get Help + Resources!
50
Forums: community.influxdata.com
Slack: influxcommunity.slack.com
GH: github.com/InfluxCommunity
Book: awesome.influxdata.com
Docs: docs.influxdata.com
Blogs: influxdata.com/blog
InfluxDB University: influxdata.com/university
| © Copyright 2023, InfluxData
Questions?
51
| © Copyright 2023, InfluxData
T H A N K Y O U

Contenu connexe

Tendances

Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache Arrow
DataWorks Summit
 
Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0
Cloudera, Inc.
 
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
InfluxData
 

Tendances (20)

InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
 
Apache Arrow: Open Source Standard Becomes an Enterprise Necessity
Apache Arrow: Open Source Standard Becomes an Enterprise NecessityApache Arrow: Open Source Standard Becomes an Enterprise Necessity
Apache Arrow: Open Source Standard Becomes an Enterprise Necessity
 
Inside the InfluxDB storage engine
Inside the InfluxDB storage engineInside the InfluxDB storage engine
Inside the InfluxDB storage engine
 
Introducing InfluxDB’s New Time Series Database Storage Engine
Introducing InfluxDB’s New Time Series Database Storage EngineIntroducing InfluxDB’s New Time Series Database Storage Engine
Introducing InfluxDB’s New Time Series Database Storage Engine
 
The Apache Spark File Format Ecosystem
The Apache Spark File Format EcosystemThe Apache Spark File Format Ecosystem
The Apache Spark File Format Ecosystem
 
The Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization OpportunitiesThe Parquet Format and Performance Optimization Opportunities
The Parquet Format and Performance Optimization Opportunities
 
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
Improving SparkSQL Performance by 30%: How We Optimize Parquet Pushdown and P...
 
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
 
Scylla Summit 2022: Making Schema Changes Safe with Raft
Scylla Summit 2022: Making Schema Changes Safe with RaftScylla Summit 2022: Making Schema Changes Safe with Raft
Scylla Summit 2022: Making Schema Changes Safe with Raft
 
Using LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache ArrowUsing LLVM to accelerate processing of data in Apache Arrow
Using LLVM to accelerate processing of data in Apache Arrow
 
InfluxDB Roadmap: What’s New and What’s Coming
InfluxDB Roadmap: What’s New and What’s ComingInfluxDB Roadmap: What’s New and What’s Coming
InfluxDB Roadmap: What’s New and What’s Coming
 
Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0Efficient Data Storage for Analytics with Apache Parquet 2.0
Efficient Data Storage for Analytics with Apache Parquet 2.0
 
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
 
Sharding Methods for MongoDB
Sharding Methods for MongoDBSharding Methods for MongoDB
Sharding Methods for MongoDB
 
How to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and HudiHow to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and Hudi
 
RocksDB Performance and Reliability Practices
RocksDB Performance and Reliability PracticesRocksDB Performance and Reliability Practices
RocksDB Performance and Reliability Practices
 
Understanding and Improving Code Generation
Understanding and Improving Code GenerationUnderstanding and Improving Code Generation
Understanding and Improving Code Generation
 
InfluxDB 101 - Concepts and Architecture | Michael DeSa | InfluxData
InfluxDB 101 - Concepts and Architecture | Michael DeSa | InfluxDataInfluxDB 101 - Concepts and Architecture | Michael DeSa | InfluxData
InfluxDB 101 - Concepts and Architecture | Michael DeSa | InfluxData
 
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
Paul Dix [InfluxData] The Journey of InfluxDB | InfluxDays 2022
 
Inside Parquet Format
Inside Parquet FormatInside Parquet Format
Inside Parquet Format
 

Similaire à Understanding InfluxDB’s New Storage Engine

Discover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdfDiscover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdf
Neo4j
 
Archmage, Pinterest’s Real-time Analytics Platform on Druid
Archmage, Pinterest’s Real-time Analytics Platform on DruidArchmage, Pinterest’s Real-time Analytics Platform on Druid
Archmage, Pinterest’s Real-time Analytics Platform on Druid
Imply
 
Discover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptxDiscover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptx
Neo4j
 

Similaire à Understanding InfluxDB’s New Storage Engine (20)

Best Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow EcosystemBest Practices for Leveraging the Apache Arrow Ecosystem
Best Practices for Leveraging the Apache Arrow Ecosystem
 
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
Maximizing Real-Time Data Processing with Apache Kafka and InfluxDB: A Compre...
 
Introducing InfluxDB Cloud Dedicated
Introducing InfluxDB Cloud DedicatedIntroducing InfluxDB Cloud Dedicated
Introducing InfluxDB Cloud Dedicated
 
Gain Better Observability with OpenTelemetry and InfluxDB
Gain Better Observability with OpenTelemetry and InfluxDB Gain Better Observability with OpenTelemetry and InfluxDB
Gain Better Observability with OpenTelemetry and InfluxDB
 
Announcing InfluxDB Clustered
Announcing InfluxDB ClusteredAnnouncing InfluxDB Clustered
Announcing InfluxDB Clustered
 
How to Choose the Right Database for Your Workloads
How to Choose the Right Database for Your WorkloadsHow to Choose the Right Database for Your Workloads
How to Choose the Right Database for Your Workloads
 
Discover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdfDiscover Aura Workshop (12.5.23).pdf
Discover Aura Workshop (12.5.23).pdf
 
Automating a PostgreSQL High Availability Architecture with Ansible
Automating a PostgreSQL High Availability Architecture with AnsibleAutomating a PostgreSQL High Availability Architecture with Ansible
Automating a PostgreSQL High Availability Architecture with Ansible
 
The ultimate Kubernetes Deployment Checklist - Infra to Microservices
The ultimate Kubernetes Deployment Checklist - Infra to MicroservicesThe ultimate Kubernetes Deployment Checklist - Infra to Microservices
The ultimate Kubernetes Deployment Checklist - Infra to Microservices
 
20230614 LinuxONE Distinguished_Recognition ISSIP_Award_Talk.pptx
20230614 LinuxONE Distinguished_Recognition ISSIP_Award_Talk.pptx20230614 LinuxONE Distinguished_Recognition ISSIP_Award_Talk.pptx
20230614 LinuxONE Distinguished_Recognition ISSIP_Award_Talk.pptx
 
Big data journey to the cloud 5.30.18 asher bartch
Big data journey to the cloud 5.30.18   asher bartchBig data journey to the cloud 5.30.18   asher bartch
Big data journey to the cloud 5.30.18 asher bartch
 
DCEU 18: Use Cases and Practical Solutions for Docker Container Storage on Sw...
DCEU 18: Use Cases and Practical Solutions for Docker Container Storage on Sw...DCEU 18: Use Cases and Practical Solutions for Docker Container Storage on Sw...
DCEU 18: Use Cases and Practical Solutions for Docker Container Storage on Sw...
 
Linux world consolidation of storage infrastructures 2006
Linux world   consolidation of storage infrastructures 2006Linux world   consolidation of storage infrastructures 2006
Linux world consolidation of storage infrastructures 2006
 
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
Discover Neo4j Aura_ The Future of Graph Database-as-a-Service Workshop_3.13.24
 
Virtual training intro to InfluxDB - June 2021
Virtual training  intro to InfluxDB  - June 2021Virtual training  intro to InfluxDB  - June 2021
Virtual training intro to InfluxDB - June 2021
 
Archmage, Pinterest’s Real-time Analytics Platform on Druid
Archmage, Pinterest’s Real-time Analytics Platform on DruidArchmage, Pinterest’s Real-time Analytics Platform on Druid
Archmage, Pinterest’s Real-time Analytics Platform on Druid
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
 
Crossplane and a story about scaling Kubernetes custom resources.pdf
Crossplane and a story about scaling Kubernetes custom resources.pdfCrossplane and a story about scaling Kubernetes custom resources.pdf
Crossplane and a story about scaling Kubernetes custom resources.pdf
 
Discover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptxDiscover Neo4j Aura Workshop (9.27.23).pptx
Discover Neo4j Aura Workshop (9.27.23).pptx
 
Se training storage grid webscale technical overview
Se training   storage grid webscale technical overviewSe training   storage grid webscale technical overview
Se training storage grid webscale technical overview
 

Plus de InfluxData

How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
InfluxData
 
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
InfluxData
 
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
InfluxData
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
InfluxData
 

Plus de InfluxData (20)

How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
 
Power Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDBPower Your Predictive Analytics with InfluxDB
Power Your Predictive Analytics with InfluxDB
 
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
 
Build an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING StackBuild an Edge-to-Cloud Solution with the MING Stack
Build an Edge-to-Cloud Solution with the MING Stack
 
Meet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using RustMeet the Founders: An Open Discussion About Rewriting Using Rust
Meet the Founders: An Open Discussion About Rewriting Using Rust
 
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
 
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...How Delft University's Engineering Students Make Their EV Formula-Style Race ...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
 
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDBStreamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
 
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
 
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
 
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
 
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
 
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...
Jay Clifford [InfluxData] | Tips & Tricks for Analyzing IIoT in Real-Time | I...
 
Brian Gilmore [InfluxData] | Use Case: IIoT Overview | InfluxDays 2022
Brian Gilmore [InfluxData] | Use Case: IIoT Overview | InfluxDays 2022Brian Gilmore [InfluxData] | Use Case: IIoT Overview | InfluxDays 2022
Brian Gilmore [InfluxData] | Use Case: IIoT Overview | InfluxDays 2022
 
Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...
Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...
Gilmore, Palani [InfluxData] | Use Case: Monitoring / Observability | InfluxD...
 
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
Gilmore, Palani [InfluxData] | Use Case: Crypto & Fintech | InfluxDays 2022
 
Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022
Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022
Charles Mahler [InfluxData] | Use Case: Networking Monitoring | InfluxDays 2022
 
Anais Dotis-Georgiou [InfluxData] | Becoming a Flux Pro | InfluxDays 2022
Anais Dotis-Georgiou [InfluxData] | Becoming a Flux Pro | InfluxDays 2022Anais Dotis-Georgiou [InfluxData] | Becoming a Flux Pro | InfluxDays 2022
Anais Dotis-Georgiou [InfluxData] | Becoming a Flux Pro | InfluxDays 2022
 
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
Michael Hall [InfluxData] | Become an InfluxDB Pro in 20 Minutes | InfluxDays...
 

Dernier

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..Understanding the FAA Part 107 License ..
Understanding the FAA Part 107 License ..
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
WSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering DevelopersWSO2's API Vision: Unifying Control, Empowering Developers
WSO2's API Vision: Unifying Control, Empowering Developers
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 

Understanding InfluxDB’s New Storage Engine

  • 1. | © Copyright 2023, InfluxData Understanding the New InfluxDB Storage Engine Anais Dotis-Georgiou January 2023
  • 2. | © Copyright 2023, InfluxData 2 Anais Dotis-Georgiou Developer Advocate LinkedIn
  • 3. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData Agenda ● What is the new InfluxDB Engine? ● What requirements does the new InfluxDB Engine meet? ○ Understanding the Apache Ecosystem ● Offerings and Release Timeline ● New InfluxDB Cloud Features ● SQL support ● Interoperability plans ● Survey ● Resources
  • 4. | © Copyright 2023, InfluxData The new storage engine that powers InfluxDB Cloud 4
  • 5. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData InfluxDB’s new storage engine is built on ● Rust ● Apache Arrow ● Apache Parquet ● Arrow Flight ● DataFusion
  • 6. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData Release Details ● Release Jan 31st ● Available in InfluxDB Cloud on AWS in: ○ AWS us-east-1 ○ AWS eu-central-1 6
  • 7. | © Copyright 2023, InfluxData Requirements for the new storage engine Requirement/Feature Rust Arrow DataFusion Parquet 1. No limits on cardinality. Write any kind of event data and don't worry about what a tag or field is. X X X X 2. Best-in-class performance on analytics queries in addition to our already well-served metrics queries. X X X X 3. Separate compute from storage and tiered data storage. The DB should use cheaper object storage as its long-term durable store. X X 4. Operator control over memory usage. The operator should be able to define how much memory is used for each buffering, caching, and query processing. X 5. Bulk data export and import. X 6. Broader ecosystem compatibility. Where possible, we should aim to use and embrace emerging standards in the data and analytics ecosystem. X X X X 7. Run at the edge and in the datacenter. Federated by design. X X
  • 8. | © Copyright 2023, InfluxData Rust and InfluxDB Requirements 8
  • 9. | © Copyright 2023, InfluxData Rust and Requirement 1. Requirement: No limits on cardinality. Write any kind of event data and don't worry about what a tag or field is. Rust Contributions: • InfluxDB’s new storage engine is built on the Rust implementation of Apache Arrow which contributes heavily to meeting this requirement. • Handling unlimited cardinality use cases requires non-trivial CPU during query processing. Rust supports optimizing resources for increased performance.
  • 10. | © Copyright 2023, InfluxData Rust and Requirement 2. Requirement: Best-in-class performance on analytics queries in addition to our already well-served metrics queries. Rust Contributions: • Arrow and DataFusion are built on Rust.
  • 11. | © Copyright 2023, InfluxData Rust and Requirement 4. Requirement: Operator control over memory usage. The operator should be able to define how much memory is used for each buffering, caching, and query processing. Rust Contributions: • Rust is used for memory control.
  • 12. | © Copyright 2023, InfluxData Rust and Requirement 6. Requirement: Broader ecosystem compatibility. Where possible, we should aim to use and embrace emerging standards in the data and analytics ecosystem. Rust Contributions: • Rust helps support the implementation of Arrow, DataFusion, and Parquet.
  • 13. | © Copyright 2023, InfluxData Rust and Requirement 7. Requirement: Run at the edge and in the datacenter. Federated by design. Rust Contributions: • Optimizing your memory usage with Rust, means that InfluxDB Cloud will also contain these memory optimizations at the edge or in the datacenter.
  • 14. | © Copyright 2023, InfluxData Rust and Requirement 7. Requirement: Run at the edge and in the datacenter. Federated by design. Rust Contributions: • Optimizing your memory usage with Rust, means that InfluxDB Cloud will also contain these memory optimizations at the edge or in the datacenter.
  • 15. | © Copyright 2023, InfluxData Arrow and InfluxDB Requirements 15
  • 16. | © Copyright 2023, InfluxData Arrow and Requirement 1. Requirement: No limits on cardinality. Write any kind of event data and don't worry about what a tag or field is. Arrow Contributions: • Apache Arrow overcomes memory challenges associated with large-cardinality use cases by providing efficient columnar data exchange.
  • 17. | © Copyright 2023, InfluxData Advantages of Columnar Data Storage (sidebar) 17
  • 18. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData Sidebar–Advantages of Columnar Data Storage 18 measurement1,tag1=tagvalue1 field1=1i timestamp1 measurement1,tag1=tagvalue2 field1=2i timestamp2 measurement1,tag2=tagvalue3 field1=3i timestamp3 measurement1,tag1=tagvalue1,tag2=tagvalue3 field1=4i,field2=true timestamp4 measurement1, field1=1i timestamp5
  • 19. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData Sidebar–Advantages of Columnar Data Storage 19 Name: measurement1 field1 field2 tag1 tag2 tag3 time 1i null tagvalue1 null null timestamp1 2i null tagvalue2 null null timestamp2 3i null null tagvalue3 null timestamp3 4i true tagvalue1 tagvalue3 tagvalue4 timestamp4 1i null null null null timestamp5
  • 20. | © Copyright 2023, InfluxData | © Copyright 2023, InfluxData Sidebar–Advantages of Columnar Data Storage 20 1i 2i 3i 4i 1i null null null true null tagvalue1 tagvalue2 null tagvalue1 null null null tagvalue3 tagvalue3 null null null null tagvalue4 null timestamp1 timestamp2 timestamp3 timestamp4 timestamp5 1i, 2i, 3i, 4i, 1i; null, null, null, true, null; tagvalue1, tagvalue2, null, tagvalue1, null; null, null, null, tagvalue3, tagvalue3, null; null, null, null, tagvalue4, null; timestamp1, timestamp2, timestamp3, timestamp4, timestamp5.
  • 21. | © Copyright 2023, InfluxData Arrow and Requirement 2. Requirement: Best-in-class performance on analytics queries in addition to our already well-served metrics queries. Arrow Contributions: • Arrow offers best in class performance on analytics through the memory optimizations and efficient data exchange.
  • 22. | © Copyright 2023, InfluxData Arrow and Requirement 3. Requirement: Separate compute from storage and tiered data storage. The DB should use cheaper object storage as its long-term durable store. Arrow Contributions: • Arrow provides the in-memory columnar storage while Parquet will provide the column-oriented data file format on disk.
  • 23. | © Copyright 2023, InfluxData Arrow and Requirement 4. Requirement: Operator control over memory usage. The operator should be able to define how much memory is used for each buffering, caching, and query processing Arrow Contributions: • The Rust implementation of Apache Arrow provides fine grained memory control.
  • 24. | © Copyright 2023, InfluxData Arrow and Requirement 6. Requirement: Broader ecosystem compatibility. Where possible, we should aim to use and embrace emerging standards in the data and analytics ecosystem. Arrow Contributions: • Leveraging Arrow is easier with the 12 libraries it supports for C, C++, Java, JavaScript, Python, Ruby, and more.
  • 25. | © Copyright 2023, InfluxData DataFusion and InfluxDB Requirements 25
  • 26. | © Copyright 2023, InfluxData DataFusion and Requirement 1. Requirement: No limits on cardinality. Write any kind of event data and don't worry about what a tag or field is. DataFusion Contributions: • What use is unlimited cardinality data if you can’t query it? DataFusion provides the query, processing, and transformation of this data.
  • 27. | © Copyright 2023, InfluxData DataFusion and Requirement 2. Requirement: Best-in-class performance on analytics queries in addition to our already well-served metrics queries. DataFusion Contributions: • What use is unlimited cardinality data if you can’t query it? DataFusion provides the query, processing, and transformation of this data.
  • 28. | © Copyright 2023, InfluxData DataFusion and Requirement 3. Requirement: Separate compute from storage and tiered data storage. The DB should use cheaper object storage as its long-term durable store. DataFusion Contributions: • DataFusion enables fast query against data stored on cheaper object store and separate compute.
  • 29. | © Copyright 2023, InfluxData DataFusion and Requirement 6. Requirement: Broader ecosystem compatibility. Where possible, we should aim to use and embrace emerging standards in the data and analytics ecosystem. DataFusion Contributions: • DataFusion supports both a postgres compatible SQL and DataFrame API.
  • 30. | © Copyright 2023, InfluxData Parquet and InfluxDB Requirements 30
  • 31. | © Copyright 2023, InfluxData Parquet and Requirement 2. Requirement: Best-in-class performance on analytics queries in addition to our already well-served metrics queries. Parquet Contributions: • Efficient compression and interoperability with ML and analytics tooling.
  • 32. | © Copyright 2023, InfluxData Parquet and Requirement 3. Requirement: Separate compute from storage and tiered data storage. The DB should use cheaper object storage as its long-term durable store. Parquet Contributions: • Parquet files take up little disk space and are fast to scan.
  • 33. | © Copyright 2023, InfluxData Parquet and Requirement 5. Requirement: Bulk data export and import. Parquet Contributions: • Parquet files enable bulk data export and import.
  • 34. | © Copyright 2023, InfluxData Parquet and Requirement 5. Requirement: Broader ecosystem compatibility. Where possible, we should aim to use and embrace emerging standards in the data and analytics ecosystem. Parquet Contributions: • Parquet offers interoperability with modern ML and analytics tools.
  • 35. | © Copyright 2023, InfluxData Parquet and Requirement 5. Requirement: Run at the edge and in the datacenter. Federated by design. Parquet Contributions: • Because Parquet files are so efficient, they will facilitate and increase the capacity for data storage at the edge.
  • 36. | © Copyright 2023, InfluxData New Data Explorer 36
  • 37. | © Copyright 2023, InfluxData New Data Explorer 37
  • 38. | © Copyright 2023, InfluxData iox.from() vs from() 38
  • 39. | © Copyright 2023, InfluxData iox.from() vs from() 39 from(bucket: "anais-iox") |> range(start: 2022-12-01T19:05:41.000Z, stop: now()) |> filter(fn: (r) => r._measurement == "airSensors") |> filter(fn: (r) => r._field == "temperature") |> filter(fn: (r) => r.sensor_id == "TLM0100")
  • 40. | © Copyright 2023, InfluxData iox.from() vs from() 40 import "experimental/iox" data = iox.from(bucket: "anais-iox", measurement: "airSensors") |> range(start: 2022-12-01T19:05:41.000Z, stop: now()) |> filter(fn: (r) => r.sensor_id == "TLM0100") |> yield()
  • 41. | © Copyright 2023, InfluxData SQL Support 41
  • 42. | © Copyright 2023, InfluxData SQL Functions Supported by Jan 31 • Supported Statements: SELECT, FROM, WHERE, GROUP BY, ORDER BY, JOIN (left and inner), WITH clause, HAVING, UNION, LIMIT, OVER • Subqueries: EXISTS, NOT EXISTS, IN, NOT IN • Agg Functions: COUNT(), AVG(), MEAN(), SUM(), MIN(), MAX() • Time Series Functions: TIME_BUCKET_GAPFILL(), DATEBIN(), NOW() • Other: EXPLAIN, EXPLAIN ANALYZE 42
  • 43. | © Copyright 2023, InfluxData Interoperability plans • FlightSQL plugins (Timeline TBD): • Apache Superset • Tableau • PowerBI • Grafana 43
  • 44. | © Copyright 2023, InfluxData Survey 44 ● Please take 5 minutes to fill out this anonymous survey. Your feedback is extremely valuable to us. ● https://bit.ly/3imWP7Y
  • 45. | © Copyright 2023, InfluxData Get Started 45 influxdata.com/influxdb-engine-beta/
  • 46. | © Copyright 2023, InfluxData InfluxDB Community Slack workspace Please join us in the InfluxDB Community Slack at www.influxdata.com/slack. To participate in conversations, join the #influxdb_iox channel.
  • 47. | © Copyright 2023, InfluxData 47 influxdata.com https://github.com/InfluxCommunity
  • 48. | © Copyright 2023, InfluxData 48 Try it yourself https://www.influxdata.com https://github.com/InfluxCommunity Get Started
  • 49. | © Copyright 2023, InfluxData Related Blogs • Understanding InfluxDB IOx and the Commitment to Open Source • Querying Data in InfluxDB using Flux and SQL • Intro to InfluxDB IOx • Welcome to InfluxDB IOx: InfluxData’s New Storage Engine • The Journey of InfluxDB by Paul Dix • InfluxData Deploys Next-Generation InfluxDB Time Series Engine with Unlimited Scale • Announcing InfluxDB IOx - The Future Core of InfluxDB Built with Rust and Arrow • Evolving InfluxDB into the Smart Data Platform for Time Series • InfluxData is Building a Fast Implementation of Apache Arrow in Go Using c2goasm and SIMD • On InfluxData’s New Storage Engine. Q&A with Andrew Lamb • Apache Arrow, Parquet, Flight and Their Ecosystem are a Game Changer for OLAP 49
  • 50. | © Copyright 2023, InfluxData Get Help + Resources! 50 Forums: community.influxdata.com Slack: influxcommunity.slack.com GH: github.com/InfluxCommunity Book: awesome.influxdata.com Docs: docs.influxdata.com Blogs: influxdata.com/blog InfluxDB University: influxdata.com/university
  • 51. | © Copyright 2023, InfluxData Questions? 51
  • 52. | © Copyright 2023, InfluxData T H A N K Y O U