Cost Based Optimizer - Part 2 of 2

•Télécharger en tant que PPT, PDF•

1 j'aime•868 vues

This is a presentation that describes how Oracle uses histograms to make decisions on SQL query execution. To see the actual webinar and demo, go https://portal.hotsos.com/events/webinars/

Technologie Business

Cost Based Optimizer – 2 of 2 Hotsos Enterprises, Ltd. Grapevine, Texas Oracle. Performance. Now. [email_address]

Agenda ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Cost Based Optimizer (CBO) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

CBO will be part of your life if you keep working with Oracle. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The cost-based query optimizer chooses the plan that it computes as having the lowest estimated cost. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Execution plan changes can result in profoundly different application performance. ,[object Object],[object Object],[object Object],[object Object],size change performance change performance change performance change

Recap ,[object Object],[object Object],[object Object],[object Object]

Skewed Data ,[object Object],[object Object],[object Object],[object Object],[object Object]

Some kinds of data skew naturally; some don’t. ,[object Object],[object Object],[object Object],[object Object]

What are the costs and benefits of histograms? ,[object Object],[object Object],[object Object],[object Object],[object Object]

Histograms provide the optimizer with better information from which to derive an execution plan for a query. ,[object Object],[object Object],[object Object],[object Object]

Types of Histograms ,[object Object],[object Object],[object Object],[object Object]

Histograms can be gathered by setting the parameter for METHOD_OPT . ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],EXEC DBMS_STATS.GATHER_TABLE_STATS( ownname=>'OP', tabname=>'my_table', method_opt=>'FOR COLUMNS column_x SIZE 10')

Histograms are not useful in all cases. ,[object Object],[object Object],[object Object],[object Object],[object Object]

Even in the most recent Oracle versions, histogram optimization doesn’t completely work with bind variables. ,[object Object],[object Object],[object Object],[object Object],[object Object]

Be prepared for how application developers might have worked around skew problems. ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Where Histogram Information is Stored ,[object Object],[object Object]

Impact Performance in terms of Logical I/O’s

Recommandé

11i LogsMahesh Vallampati

SQL Optimization With Trace Data And Dbms Xplan V6Mahesh Vallampati

Cost Based Optimizer - Part 1 of 2Mahesh Vallampati

Explaining the explain_planarief12H

Ground Breakers Romania: Explain the explain_planMaria Colgan

Stored procedure tuning and optimization t sqlnishantdavid9

How to Analyze and Tune MySQL Queries for Better Performanceoysteing

Part3 Explain the Explain PlanMaria Colgan

Recommandé

11i LogsMahesh Vallampati

SQL Optimization With Trace Data And Dbms Xplan V6Mahesh Vallampati

Cost Based Optimizer - Part 1 of 2Mahesh Vallampati

Explaining the explain_planarief12H

Ground Breakers Romania: Explain the explain_planMaria Colgan

Stored procedure tuning and optimization t sqlnishantdavid9

How to Analyze and Tune MySQL Queries for Better Performanceoysteing

Part3 Explain the Explain PlanMaria Colgan

Part2 Best Practices for Managing Optimizer StatisticsMaria Colgan

Overview of query evaluationavniS

How to Analyze and Tune MySQL Queries for Better Performanceoysteing

Chapter15gourab87

Brad McGehee Intepreting Execution Plans Mar09guest9d79e073

Honey I Shrunk the DatabaseVanessa Hurst

How to analyze and tune sql queries for better performance vts2016oysteing

MySQL Optimizer Cost ModelOlav Sandstå

How to understand and analyze Apache Hive query execution plan for performanc...DataWorks Summit/Hadoop Summit

SQL Server 2016 Query storeVitaliy Popovych

Part4 Influencing Execution Plans with Optimizer HintsMaria Colgan

phoenix-on-calcite-nyc-meetupMaryann Xue

StacksAcad

02 database oprimization - improving sql performance - ent-dbuncleRhyme

8 query processing and optimizationKumar

How to analyze and tune sql queries for better performance percona15oysteing

Augustus Overview Open Source Analyticsjtrussell

Tech Talk - JPA and Query Optimization - publishGleydson Lima

ETL and pivoting in sparkSubhasish Guha

The Cost Based Optimiser in 11gR2Sage Computing Services

AODV ProtocolDarshan Rathi

E learningt3 4puketapapahomework2015-3Takahe One

Contenu connexe

Tendances

Part2 Best Practices for Managing Optimizer StatisticsMaria Colgan

Overview of query evaluationavniS

How to Analyze and Tune MySQL Queries for Better Performanceoysteing

Chapter15gourab87

Brad McGehee Intepreting Execution Plans Mar09guest9d79e073

Honey I Shrunk the DatabaseVanessa Hurst

How to analyze and tune sql queries for better performance vts2016oysteing

MySQL Optimizer Cost ModelOlav Sandstå

How to understand and analyze Apache Hive query execution plan for performanc...DataWorks Summit/Hadoop Summit

SQL Server 2016 Query storeVitaliy Popovych

Part4 Influencing Execution Plans with Optimizer HintsMaria Colgan

phoenix-on-calcite-nyc-meetupMaryann Xue

StacksAcad

02 database oprimization - improving sql performance - ent-dbuncleRhyme

8 query processing and optimizationKumar

How to analyze and tune sql queries for better performance percona15oysteing

Augustus Overview Open Source Analyticsjtrussell

Tech Talk - JPA and Query Optimization - publishGleydson Lima

ETL and pivoting in sparkSubhasish Guha

Tendances (19)

Part2 Best Practices for Managing Optimizer Statistics

Overview of query evaluation

How to Analyze and Tune MySQL Queries for Better Performance

Chapter15

Brad McGehee Intepreting Execution Plans Mar09

Honey I Shrunk the Database

How to analyze and tune sql queries for better performance vts2016

MySQL Optimizer Cost Model

How to understand and analyze Apache Hive query execution plan for performanc...

SQL Server 2016 Query store

Part4 Influencing Execution Plans with Optimizer Hints

phoenix-on-calcite-nyc-meetup

Stacks

02 database oprimization - improving sql performance - ent-db

8 query processing and optimization

How to analyze and tune sql queries for better performance percona15

Augustus Overview Open Source Analytics

Tech Talk - JPA and Query Optimization - publish

ETL and pivoting in spark

En vedette

The Cost Based Optimiser in 11gR2Sage Computing Services

AODV ProtocolDarshan Rathi

E learningt3 4puketapapahomework2015-3Takahe One

2013 stamps-intro-assemblyc.titus.brown

Review Adobe WallabyJulio Cesar Retamal Rojas

18 Di ConcettaYvonne Sgroi

La comunicazione-del-vino-ai-tempi-di-facebookSlawka G. Scarso

Analizador sintáctico de Pascal escrito en BisonEgdares Futch H.

Top 5 Issues Affecting the HR Profession in OhioKegler Brown Hill + Ritter

MoMoTLV Israel March 2010 - Aviv Revach - Mobile Apps Monetization OverviewMobileMonday Tel-Aviv

2016 legal seminar for credit professionalsKegler Brown Hill + Ritter

33 Lead Generation Tips in 33 MinutesAlex Rascanu

Velkomst 011210 passivhus nordvestBertel Bolt-Jørgensen

2015 Ohio Ballot IssuesKegler Brown Hill + Ritter

Kegler Brown's 2015 Managing Labor + Employee Relations SeminarKegler Brown Hill + Ritter

Global crisis2011sadettin

How to convert a file to Portable Document format (PDF)?jessecadelina

pl_global-powers-cons-products-2015Blossom Out

OSHA Goes On the Attack as the Obama Administration Winds Down: Are You Prepa...Kegler Brown Hill + Ritter

2015 ohsu-metagenomec.titus.brown

En vedette (20)

The Cost Based Optimiser in 11gR2

AODV Protocol

E learningt3 4puketapapahomework2015-3

2013 stamps-intro-assembly

Review Adobe Wallaby

18 Di Concetta

La comunicazione-del-vino-ai-tempi-di-facebook

Analizador sintáctico de Pascal escrito en Bison

Top 5 Issues Affecting the HR Profession in Ohio

MoMoTLV Israel March 2010 - Aviv Revach - Mobile Apps Monetization Overview

2016 legal seminar for credit professionals

33 Lead Generation Tips in 33 Minutes

Velkomst 011210 passivhus nordvest

2015 Ohio Ballot Issues

Kegler Brown's 2015 Managing Labor + Employee Relations Seminar

Global crisis2011

How to convert a file to Portable Document format (PDF)?

pl_global-powers-cons-products-2015

OSHA Goes On the Attack as the Obama Administration Winds Down: Are You Prepa...

2015 ohsu-metagenome

Similaire à Cost Based Optimizer - Part 2 of 2

Presentación Oracle Database Migración consideraciones 10g/11g/12cRonald Francisco Vargas Quesada

Processes in Query Optimization in (ABMS) Advanced Database Management Systems gamemaker762

Managing Statistics for Optimal Query PerformanceKaren Morton

DBSamchu Li

Implementation of query optimization for reducing run timeAlexander Decker

Explain the explain_planMaria Colgan

Oracle Query Optimizer - An Introductionadryanbub

Beginners guide to_optimizerMaria Colgan

Analysis Services Best Practices From Large Deploymentsrsnarayanan

Brad McGehee Intepreting Execution Plans Mar09Mark Ginnebaugh

Cost-Based Optimizer in Apache Spark 2.2 Databricks

Data warehousing testing strategies cognosSandeep Mehta

Best Practices for Oracle Exadata and the Oracle OptimizerEdgar Alejandro Villegas

SQL Server 2008 Development for ProgrammersAdam Hutson

Cost-Based Optimizer in Apache Spark 2.2 Ron Hu, Sameer Agarwal, Wenchen Fan ...Databricks

12 1-man-operation center-ug(2)Ron DeLong

Ps training mannual ( configuration )Soumya De

Oracle Sql TuningChris Adkin

Presentation v mware roi tco calculatorsolarisyourep

PHP UK 2020 Tutorial: MySQL Indexes, Histograms And other ways To Speed Up Yo...Dave Stokes

Similaire à Cost Based Optimizer - Part 2 of 2 (20)

Presentación Oracle Database Migración consideraciones 10g/11g/12c

Processes in Query Optimization in (ABMS) Advanced Database Management Systems

Managing Statistics for Optimal Query Performance

Implementation of query optimization for reducing run time

Explain the explain_plan

Oracle Query Optimizer - An Introduction

Beginners guide to_optimizer

Analysis Services Best Practices From Large Deployments

Brad McGehee Intepreting Execution Plans Mar09

Cost-Based Optimizer in Apache Spark 2.2

Data warehousing testing strategies cognos

Best Practices for Oracle Exadata and the Oracle Optimizer

SQL Server 2008 Development for Programmers

Cost-Based Optimizer in Apache Spark 2.2 Ron Hu, Sameer Agarwal, Wenchen Fan ...

12 1-man-operation center-ug(2)

Ps training mannual ( configuration )

Oracle Sql Tuning

Presentation v mware roi tco calculator

PHP UK 2020 Tutorial: MySQL Indexes, Histograms And other ways To Speed Up Yo...

Plus de Mahesh Vallampati

Operating a payables shared service organization in oracle cloud oow 2019_v4Mahesh Vallampati

Oracle BI Publisher to Transform Cloud ERP ReportsMahesh Vallampati

Cloudy with a chance of 1099Mahesh Vallampati

Banking on the CloudMahesh Vallampati

Statistical Accounts and Data in Oracle Cloud General LedgerMahesh Vallampati

Sparse Matrix Manipulation Made easy in an Oracle RDBMSMahesh Vallampati

The Data Architect ManifestoMahesh Vallampati

Five pillars of competencyMahesh Vallampati

Oracle EBS Change Projects Process FlowsMahesh Vallampati

Cutover plan template ToolMahesh Vallampati

CRM Lead Lifecycle ProcessMahesh Vallampati

Enough Blame for System Performance IssuesMahesh Vallampati

Oracle R12 12.1.3 Legal Entity Data Gathering TemplateMahesh Vallampati

ERP Manager meets SDLC and CMMIMahesh Vallampati

Oracle 11i OID AD IntegrationMahesh Vallampati

Generic Backup and Restore ProcessMahesh Vallampati

OIC Process Flow V7Mahesh Vallampati

XBRL in Oracle 11i and R12Mahesh Vallampati

Sales Process Flow V4Mahesh Vallampati

ITP Instance Management Process V2Mahesh Vallampati

Plus de Mahesh Vallampati (20)

Operating a payables shared service organization in oracle cloud oow 2019_v4

Oracle BI Publisher to Transform Cloud ERP Reports

Cloudy with a chance of 1099

Banking on the Cloud

Statistical Accounts and Data in Oracle Cloud General Ledger

Sparse Matrix Manipulation Made easy in an Oracle RDBMS

The Data Architect Manifesto

Five pillars of competency

Oracle EBS Change Projects Process Flows

Cutover plan template Tool

CRM Lead Lifecycle Process

Enough Blame for System Performance Issues

Oracle R12 12.1.3 Legal Entity Data Gathering Template

ERP Manager meets SDLC and CMMI

Oracle 11i OID AD Integration

Generic Backup and Restore Process

OIC Process Flow V7

XBRL in Oracle 11i and R12

Sales Process Flow V4

ITP Instance Management Process V2

Dernier

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1

Automating Google Workspace (GWS) & more with Apps Scriptwesley chun

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Principled Technologies

Artificial Intelligence: Facts and MythsJoaquim Jorge

Scaling API-first – The story of a global engineering organizationRadu Cotescu

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsRoshan Dwivedi

MINDCTI Revenue Release Quarter One 2024MIND CTI

Why Teams call analytics are critical to your entire businesspanagenda

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Apidays New York 2024 - The value of a flexible API Management solution for O...apidays

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Dernier (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Boost Fertility New Invention Ups Success Rates.pdf

Automating Google Workspace (GWS) & more with Apps Script

Powerful Google developer tools for immediate impact! (2023-24 C)

Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...

Artificial Intelligence: Facts and Myths

Scaling API-first – The story of a global engineering organization

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Top 5 Benefits OF Using Muvi Live Paywall For Live Streams

MINDCTI Revenue Release Quarter One 2024

Why Teams call analytics are critical to your entire business

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Apidays New York 2024 - The value of a flexible API Management solution for O...

Boost PC performance: How more available memory can improve productivity

Cost Based Optimizer - Part 2 of 2

1. Cost Based Optimizer – 2 of 2 Hotsos Enterprises, Ltd. Grapevine, Texas Oracle. Performance. Now. [email_address]

3. Cost Based Optimizer

7. Cost Based Optimizer

10. Skewed Data

11.

12.

13. Histograms

14.

15.

16.

17. Frequency Histogram

18. Height Balanced Histogram

19.

20.

21.

22.

23.

24. Demo Histogram Data Dictionary Tables

25. Impact Performance in terms of Logical I/O’s

26. Demo Cardinality

27. Demo Join Cardinality

28.

Notes de l'éditeur

Note that without properly collected statistics, the CBO will do one of two things: if no statistics exist for any object used in the SQL statement, the CBO may use rule-based optimization (prior to v10) or use dynamic sampling if statistics exist for any single object but not others in the SQL statement, the CBO may use a set of default statistics for the object without statistics or use dynamic sampling. CBO default statistics for objects without collected stats (prior to v10…in v10 dynamic sampling is typically used instead of defaults): TABLE SETTING DEFAULT STATISTICS cardinality (number of blocks * (block size – cache layer) / average row length average row length 100 bytes number of blocks 100 or actual value based on the extent map remote cardinality (distrib) 2000 rows remote average row length 100 bytes INDEX SETTING DEFAULT STATISTICS levels 1 leaf blocks 25 leaf blocks/key 1 data blocks/key 1 distinct keys 100 clustering factor 800
Plot A illustrates a situation in which the execution plan does not change, but the query response time varies significantly as the number of rows in the table changes. This kind of thing occurs when an application chooses a TABLE ACCESS (FULL) execution plan for a growing table. It’s what causes RBO-based applications to appear fast in a small development environment, but then behave poorly in the production environment. Plot B illustrates the marginal improvement that’s achievable, for example, by distributing an inefficient application’s workload more uniformly across the disks in a disk array. Notice that the execution plan (or “shape of the performance curve”) isn’t necessarily changed by such an operation (although, if the output of dbms_stats.gather_system_statistics changes as a result of the configuration change, then the plan might change). The performance for a given number of rows might change, however, as the plot here indicates. Plot C illustrates what is commonly the most profound type of performance change: an execution plan change. This situation can be caused by a change to any of CBO inputs. For example, an accidental deletion of a segment’s statistics can change a plan from a nice fast plan (depicted by the green curve, which is O(log n)) to a horrifically slow plan (depicted by the red curve, which is O(n 2 )). The phenomenon illustrated in plot C is what has happened when a query that was fast last week now runs for 14 hours without completing before you finally give up and kill the session.
Since the CBO determines the selectivity of predicates that appear in queries, it is important that there be adequate information for the CBO to make it's estimates properly. By gathering histogram data, the CBO can make improved selectivity estimates in the presence of data skew, resulting in optimal execution plans with non-uniform data distributions. The histogram approach provides an efficient and compact way to represent data distributions. Selectivity estimates are used to decide when to use an index and the order in which to join tables. Many table columns are not uniformly distributed. Therefore, the normal calculations for selectivity may not be accurate without the use of histograms.
Height-balanced histograms put approximately the same number of values into each interval, so that the endpoints of the interval are determined by the number of values in that interval. Only the last (largest) values in each bucket appear as bucket (end point) values. A height-balanced histogram will be created if the number of histogram buckets ( SIZE ) indicates a value smaller than the number of distinct values in the column. Frequency histograms (sometimes called value-based histograms) are created when the number of histogram buckets ( SIZE ) specified is greater than or equal to the number of distinct column values. In frequency histograms, all the individual values in the column have a corresponding bucket, and the bucket number reflects the repetition count of each value. The type of histogram is stored in the HISTOGRAM column of the *TAB_COL_STATISTICS views. The column can have values of HEIGHT BALANCED, FREQUENCY , or NONE . The SIZE of a histogram can be set by you or automatically by Oracle when the histogram is collected. The default SIZE (when no SIZE is specified) is 75. The maximum SIZE is 255.
DBMS_STATS Constants SIZE REPEAT Causes the histograms to be created with the same options as last time you created it. It reads the data dictionary to figure out what to do. SIZE AUTO Oracle looks at the data and using a magical, undocumented and changing algorithm, figures out all by itself what columns to gather stats on and how many buckets and all. It'll collect histograms in memory only for those columns which are used by your applications (those columns appearing in a predicate involving an equality, range, or like operators). It knows that a particular column was used by an application because at parse time, it will store workload information in SGA. Then it will store histograms in the data dictionary only if it has skewed data (and it worthy of a histogram). SIZE SKEWONLY When you collect histograms with the SIZE option set to SKEWONLY , it collects histogram data in memory for all specified columns (if you do not specify any, all columns are used). Once an &quot;in-memory&quot; histogram is computed for a column, it is stored inside the data dictionary only if it has &quot;popular&quot; values (multiple end-points with the same value which is what is meant by &quot;there is skew in the data&quot;).
In Oracle version 8, the use of bind variables in a predicate effectively disables the use of histograms. This is because the optimizer needs to know the value ( WHERE col = 'x' ) in order to check the histogram statistics for selectivity for that value. When a bind variable is used, it is not actually bound into the query until execution time. Since the execution plan is determined in the parse phase, the optimizer won't know the value and thus can't use the histogram to makes its decision. In Oracle version 9, the optimizer behavior regarding bind variables changed slightly. In version 9, when a query is initially parsed, the optimizer will &quot;peek&quot; at the value of the bind variable and use the value it finds to make decisions. Does that make the situation better or worse? It depends. Let's say that when the query is initially parsed, it has a bind variable value of 1 being used in the predicate. If the column has a histogram and the histogram indicates that selectivity is low for that value (few values match), then it will likely choose to use an index on that column if available. Everything works well, performance is sub-second and everyone is happy. Now, what happens if the query is executed a 2 nd time but passes the value of 0 in the bind variable (and the selectivity for the value 0 is high…lots of values match). What happens? The original plan is still used and the query will attempt to use the same index. If there are thousands of records in the row source, it is likely that the index scan will perform significantly worse than simply doing a full table scan. In this case, everything works but performance stinks and complaints arise. So, what do you do? For some, the best solution is to not use bind variables when you have a column with a limited number of values and the values are skewed and to just hard-code the value you need. The best way to know what to do is to test different approaches to find what works best for your environment.
The RBO workaround is forgivable because it’s all the RBO environment could offer as an option. The CBO technique shown here is particularly bad because it makes the application less flexible and therefore less able to respond appropriately to system changes. Ideally, if you (the developer) already know that data for certain columns tends to skew, you can write code to account for it. A good guideline to follow is to look at the number of distinct values in the column. If the column has only a few distinct values, then hard-coding the value will allow the optimizer to correctly choose the plan based on histogram data. If there are a lot of distinct values, but you know in advance the actual skewed values, you could write conditional code to use a bind variable in all cases except when the known skewed values are requested. In that case, the conditional code would branch to a SQL statement version which hard-codes the skewed value under those circumstances.