Open Source Experience Conference 2022

•

0 j'aime•56 vues

High-performance (as measured by sub-millisecond response time for queries) is a key characteristic of the Redis database, and it is one of the main reasons why Redis is the most popular key-value database in the world. In order to continue improving performance across all of the different Redis components, we’ve developed a framework for automatically triggering performance tests, telemetry gathering, profiling, and data visualization upon code commit. In this talk, we describe how this type of automation and “zero-touch” profiling scaled our ability to pursue performance regressions and to find opportunities to improve the efficiency of our code, helping us (as a company) to start shifting from a reactive to a more proactive performance mindset.

Technologie

E2E Performance Testing,
Profiling, and Analysis at Redis
Open Source Experience Conference, Nov 2022
Filipe Oliveira
Senior Performance Engineer
@Redis

IF YOU DON’T MEASURE IT
YOU CAN’T IMPROVE IT
2

THE SOONER
THE BETTER, THE CHEAPER, THE EASIER
3

PERFORMANCE @REDIS
4
OSS REDIS + Redis Ltd Projects
1. foster benchmark and observability standards
2. support the contributions to the OSS projects
3. optimize an industry-leading solution

Ordinarily, on our Companies Core Products
5
We have...
● automatic extensive tests to catch functional failures
...but when
● we accidentally commit a performance regression, nothing intercepts it*!

A Real Case From 2019
7
Simple request
1. RediSearch minor version bump
2. Required multiple patch
a. Feedback cycle took us at-least 1 day
b. prioritized over other projects
c. Siloed
d. Jul. 30, Nov. 27, 2019
You can relate to...
● your team run performance tests before releasing

Ordinarily, on our Companies Core Products
8
You can state...
● your team run performance tests before releasing
...but solving slowdowns just before releasing is...
● dangerous
● time-consuming
● one of the most difficult tasks to estimate time to
...is just buffering potential issues!

Goal: Reduce Feedback Cycle. Avoid Silos
9
Requirements for valid tests
- Stable testing environment
- Deterministic testing tools
- Deterministic outcomes
- Reduced testing/probing overhead
- Reduce tested changes to the minimal
Requirements for acceptance in
products
- Acceptable duration
- No manual work
- Actionable items
- Well defined key performance indicators
CODE REVIEW
PREVIEW /
UNSTABLE
RELEASE
MANUAL
PERF
CHECK
CODE REVIEW
PREVIEW /
UNSTABLE
RELEASE
ZERO TOUCH
PERF CHECK
ZERO TOUCH
PERF CHECK
ZERO TOUCH
PERF CHECK

10
This is Not New/Disruptive
Elastic
https://elasticsearch-benchmarks.elastic.co/#
Lucene
https://home.apache.org/~mikemccand/lucenebench/

Our Approach
12
Redis Developers Group
+ Partners ( )

by branch
scalability analysis
Our Approach
13
by version
1. Initial focus on OSS deployments
2. local and remote triggers
3. Used for testing, profiling
a. Regression analysis
i. and fix
b. Approval of features
c. Proactive optimization

14
Our Approach
1. Initial focus on OSS deployments
2. local and remote triggers
3. Used for testing, profiling
a. Regression analysis
i. and fix
b. Approval of features
c. Proactive optimization

Our Approach
15
Approval of features detail:

Our Approach
16
1. Full process Flame Graph + main thread Flame Graph
2. perf report per dso
3. perf report per dso,sym (w/wout callgraph)
4. perf report per dso,sym,srcline (w/wout callgraph)
5. identical stacks collapsed
6. hotpath callgraph
1
3
2
4
1. Initial focus on OSS deployments
2. local and remote triggers
3. Used for testing, profiling
a. Regression analysis
i. and fix
b. Approval of features
c. Proactive optimization

Our Approach
17
1. Full process Flame Graph + main thread Flame Graph
2. perf report per dso
3. perf report per dso,sym (w/wout callgraph)
4. perf report per dso,sym,srcline (w/wout callgraph)
5. identical stacks collapsed
6. hotpath callgraph
4
6
1. Initial focus on OSS deployments
2. local and remote triggers
3. Used for testing, profiling
a. Regression analysis
i. and fix
b. Approval of features
c. Proactive optimization

18
What We’ve Gained
● up to 68% performance boost on the covered commands
● Deeply reduced the feedback cycle ( days -> 1hour )
● Dev’s can easily add tests (243 full suites)
● Scaled team + more challenging!

19
What We’ve Gained
● Finding performance improvements is now everyone’s
power/responsibility
● A/B test new tech/state-of-the-art HW/SW components
● Continuous up-to-date numbers for use-cases that matter
● Foster openness

20
What’s Next
● aggregate performance data across a group of benchmarks
● better statistical analysis methods
● more visibility across API
● Increase OSS / Company adoption
○ expose data on docs

@fcosta_oliveira
questions?
ﬁlipe@redis.com or performance@redis.com
try redis cloud for free!
https://redis.com/try-free/
21

Recommandé

End-to-end performance testing, profiling, and analysis at RedisFilipe Oliveira

End-To-End Performance Testing, Profiling, and Analysis at RedisScyllaDB

CMG 2024 - Performance Testing, Profiling, and Analysis at RedisFilipe Oliveira

1803_STAMP_OpenCloudForum2018STAMP Project

Test Metrics in Agile - powerful tool to support changes - Zavertailo IuliiaYulia Zavertailo

IBM Cloud Bordeaux Meetup - 20190325 - Software FactoryIBM France Lab

Code campiasi scm-project-gabriel-cristescu-ditechCodecamp Romania

Ralph Jocham, Effective Agile | Agile Turkey Summit 2013Agile Turkey

Recommandé

End-to-end performance testing, profiling, and analysis at RedisFilipe Oliveira

End-To-End Performance Testing, Profiling, and Analysis at RedisScyllaDB

CMG 2024 - Performance Testing, Profiling, and Analysis at RedisFilipe Oliveira

1803_STAMP_OpenCloudForum2018STAMP Project

Test Metrics in Agile - powerful tool to support changes - Zavertailo IuliiaYulia Zavertailo

IBM Cloud Bordeaux Meetup - 20190325 - Software FactoryIBM France Lab

Code campiasi scm-project-gabriel-cristescu-ditechCodecamp Romania

Ralph Jocham, Effective Agile | Agile Turkey Summit 2013Agile Turkey

Ralph Jocham, effective agile - Scaled Scrum at Swiss Postal Services | Agile...Agile Greece

Git workflows á la-carte, Presenation at jdays2013 www.jdays.se by Nicola Pao...hamidsamadi

Building products - A Nifty ApproachGuruprasadBhat21

2 Weeks is Too Long.pdfJohn Doyle

給 RD 的 Kubernetes 初體驗 (GDG Cloud KH 2019-08 version) William Yeh

Continuous Integration to Shift Left Testing Across the Enterprise StackDevOps.com

Preparing for Enterprise Continuous Delivery - 5 Critical StepsXebiaLabs

Agile Project Management in a Waterfall World: Managing Sprints with Predicti...John Carter

8D Problem Solving Report Template with GuidanceSiu Wai Au, MBB, ASQ CSSBB, CMQ/OE, CQA, LBC, PMP®

Intro to DevOpsSanjay Saini

Acumen Fuse Independent ReviewAcumen

Five Flute OverviewWilliam Burke

DevOps at TestausOSY 20june2017Jouni Jätyri

CV CoralCoral Li

Introducing Continuous Delivery in the EnterpriseXebiaLabs

Accelerate User Driven Innovation [Webinar]Dynatrace

Advanced deployment scenariosSergio Navarro Pino

2012 - A Release OdysseyErnest Mueller

Scrum PulseReturn on Intelligence

How to test a Mainframe ApplicationMichael Erichsen

Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55

Google I/O Extended 2024 WarsawGDSC PJATK

Contenu connexe

Similaire à Open Source Experience Conference 2022

Ralph Jocham, effective agile - Scaled Scrum at Swiss Postal Services | Agile...Agile Greece

Git workflows á la-carte, Presenation at jdays2013 www.jdays.se by Nicola Pao...hamidsamadi

Building products - A Nifty ApproachGuruprasadBhat21

2 Weeks is Too Long.pdfJohn Doyle

給 RD 的 Kubernetes 初體驗 (GDG Cloud KH 2019-08 version) William Yeh

Continuous Integration to Shift Left Testing Across the Enterprise StackDevOps.com

Preparing for Enterprise Continuous Delivery - 5 Critical StepsXebiaLabs

Agile Project Management in a Waterfall World: Managing Sprints with Predicti...John Carter

8D Problem Solving Report Template with GuidanceSiu Wai Au, MBB, ASQ CSSBB, CMQ/OE, CQA, LBC, PMP®

Intro to DevOpsSanjay Saini

Acumen Fuse Independent ReviewAcumen

Five Flute OverviewWilliam Burke

DevOps at TestausOSY 20june2017Jouni Jätyri

CV CoralCoral Li

Introducing Continuous Delivery in the EnterpriseXebiaLabs

Accelerate User Driven Innovation [Webinar]Dynatrace

Advanced deployment scenariosSergio Navarro Pino

2012 - A Release OdysseyErnest Mueller

Scrum PulseReturn on Intelligence

How to test a Mainframe ApplicationMichael Erichsen

Similaire à Open Source Experience Conference 2022 (20)

Ralph Jocham, effective agile - Scaled Scrum at Swiss Postal Services | Agile...

Git workflows á la-carte, Presenation at jdays2013 www.jdays.se by Nicola Pao...

Building products - A Nifty Approach

2 Weeks is Too Long.pdf

給 RD 的 Kubernetes 初體驗 (GDG Cloud KH 2019-08 version)

Continuous Integration to Shift Left Testing Across the Enterprise Stack

Preparing for Enterprise Continuous Delivery - 5 Critical Steps

Agile Project Management in a Waterfall World: Managing Sprints with Predicti...

8D Problem Solving Report Template with Guidance

Intro to DevOps

Acumen Fuse Independent Review

Five Flute Overview

DevOps at TestausOSY 20june2017

CV Coral

Introducing Continuous Delivery in the Enterprise

Accelerate User Driven Innovation [Webinar]

Advanced deployment scenarios

2012 - A Release Odyssey

Scrum Pulse

How to test a Mainframe Application

Dernier

Oauth 2.0 Introduction and Flows with MuleSoftshyamraj55

Google I/O Extended 2024 WarsawGDSC PJATK

Design Guidelines for Passkeys 2024.pptxFIDO Alliance

The Zero-ETL Approach: Enhancing Data Agility and InsightSafe Software

CORS (Kitworks Team Study 양다윗 발표자료 240510)Wonjun Hwang

Introduction to use of FHIR Documents in ABDMKumar Satyam

الأمن السيبراني - ما لا يسع للمستخدم جهلهMohamed Sweelam

AI in Action: Real World Use Cases by AnitarajAnitaRaj43

ChatGPT and Beyond - Elevating DevOps ProductivityVictorSzoltysek

Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptxMasterG

How to Check GPS Location with a Live Tracker in Pakistandanishmna97

ERP Contender Series: Acumatica vs. Sage IntacctBrainSell Technologies

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)Paige Cruz

Intro to Passkeys and the State of Passwordless.pptxFIDO Alliance

AI mind or machine power point presentationyogeshlabana357357

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...ScyllaDB

Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021

Portal Kombat : extension du réseau de propagande russe中央社

Top 10 CodeIgniter Development CompaniesTopCSSGallery

Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsLeah Henrickson

Dernier (20)

Oauth 2.0 Introduction and Flows with MuleSoft

Google I/O Extended 2024 Warsaw

Design Guidelines for Passkeys 2024.pptx

The Zero-ETL Approach: Enhancing Data Agility and Insight

CORS (Kitworks Team Study 양다윗 발표자료 240510)

Introduction to use of FHIR Documents in ABDM

الأمن السيبراني - ما لا يسع للمستخدم جهله

AI in Action: Real World Use Cases by Anitaraj

ChatGPT and Beyond - Elevating DevOps Productivity

Cyber Insurance - RalphGilot - Embry-Riddle Aeronautical University.pptx

How to Check GPS Location with a Live Tracker in Pakistan

ERP Contender Series: Acumatica vs. Sage Intacct

Observability Concepts EVERY Developer Should Know (DevOpsDays Seattle)

Intro to Passkeys and the State of Passwordless.pptx

AI mind or machine power point presentation

Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...

Six Myths about Ontologies: The Basics of Formal Ontology

Portal Kombat : extension du réseau de propagande russe

Top 10 CodeIgniter Development Companies

Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots

Open Source Experience Conference 2022

1. E2E Performance Testing, Profiling, and Analysis at Redis Open Source Experience Conference, Nov 2022 Filipe Oliveira Senior Performance Engineer @Redis

2. IF YOU DON’T MEASURE IT YOU CAN’T IMPROVE IT 2

3. THE SOONER THE BETTER, THE CHEAPER, THE EASIER 3

4. PERFORMANCE @REDIS 4 OSS REDIS + Redis Ltd Projects 1. foster benchmark and observability standards 2. support the contributions to the OSS projects 3. optimize an industry-leading solution

5. Ordinarily, on our Companies Core Products 5 We have... ● automatic extensive tests to catch functional failures ...but when ● we accidentally commit a performance regression, nothing intercepts it*!

6. A Real Case From 2019 6

7. A Real Case From 2019 7 Simple request 1. RediSearch minor version bump 2. Required multiple patch a. Feedback cycle took us at-least 1 day b. prioritized over other projects c. Siloed d. Jul. 30, Nov. 27, 2019 You can relate to... ● your team run performance tests before releasing

8. Ordinarily, on our Companies Core Products 8 You can state... ● your team run performance tests before releasing ...but solving slowdowns just before releasing is... ● dangerous ● time-consuming ● one of the most difficult tasks to estimate time to ...is just buffering potential issues!

9. Goal: Reduce Feedback Cycle. Avoid Silos 9 Requirements for valid tests - Stable testing environment - Deterministic testing tools - Deterministic outcomes - Reduced testing/probing overhead - Reduce tested changes to the minimal Requirements for acceptance in products - Acceptable duration - No manual work - Actionable items - Well defined key performance indicators CODE REVIEW PREVIEW / UNSTABLE RELEASE MANUAL PERF CHECK CODE REVIEW PREVIEW / UNSTABLE RELEASE ZERO TOUCH PERF CHECK ZERO TOUCH PERF CHECK ZERO TOUCH PERF CHECK

10. 10 This is Not New/Disruptive Elastic https://elasticsearch-benchmarks.elastic.co/# Lucene https://home.apache.org/~mikemccand/lucenebench/

11. 11 This is Not New/Disruptive mongoDB

12. Our Approach 12 Redis Developers Group + Partners ( )

13. by branch scalability analysis Our Approach 13 by version 1. Initial focus on OSS deployments 2. local and remote triggers 3. Used for testing, profiling a. Regression analysis i. and fix b. Approval of features c. Proactive optimization

14. 14 Our Approach 1. Initial focus on OSS deployments 2. local and remote triggers 3. Used for testing, profiling a. Regression analysis i. and fix b. Approval of features c. Proactive optimization

15. Our Approach 15 Approval of features detail:

16. Our Approach 16 1. Full process Flame Graph + main thread Flame Graph 2. perf report per dso 3. perf report per dso,sym (w/wout callgraph) 4. perf report per dso,sym,srcline (w/wout callgraph) 5. identical stacks collapsed 6. hotpath callgraph 1 3 2 4 1. Initial focus on OSS deployments 2. local and remote triggers 3. Used for testing, profiling a. Regression analysis i. and fix b. Approval of features c. Proactive optimization

17. Our Approach 17 1. Full process Flame Graph + main thread Flame Graph 2. perf report per dso 3. perf report per dso,sym (w/wout callgraph) 4. perf report per dso,sym,srcline (w/wout callgraph) 5. identical stacks collapsed 6. hotpath callgraph 4 6 1. Initial focus on OSS deployments 2. local and remote triggers 3. Used for testing, profiling a. Regression analysis i. and fix b. Approval of features c. Proactive optimization

18. 18 What We’ve Gained ● up to 68% performance boost on the covered commands ● Deeply reduced the feedback cycle ( days -> 1hour ) ● Dev’s can easily add tests (243 full suites) ● Scaled team + more challenging!

19. 19 What We’ve Gained ● Finding performance improvements is now everyone’s power/responsibility ● A/B test new tech/state-of-the-art HW/SW components ● Continuous up-to-date numbers for use-cases that matter ● Foster openness

20. 20 What’s Next ● aggregate performance data across a group of benchmarks ● better statistical analysis methods ● more visibility across API ● Increase OSS / Company adoption ○ expose data on docs

21. @fcosta_oliveira questions? ﬁlipe@redis.com or performance@redis.com try redis cloud for free! https://redis.com/try-free/ 21