Addison Wesley SQL Performance Tuning Sep 2002 ISBN 0201791692

Therefore the rule-based optimizer makes a plan:find matching rows using the index on column2.. And the whole table can be scanned using two pagereads, whereas an index lookup would take

Trang 1

The non-standard EXPLAIN statement (see Table 17-1) is thevital way to find out what the optimizer has done We haven'tmentioned it up to now because this book's primary goal has

been to show what you can do before the fact But EXPLAIN is

the way to measure whether your estimates correspond to

DBMS reality In many shops, it's customary to get an EXPLAINfor every SQL statement before submitting it for execution That

is quite reasonable What's perhaps less reasonable is the

custom of trying out every transformation one can think of andsubmitting them all for explanation That is mere floundering.Understanding principles—in other words, estimating what's

Trang 2

rows.) That is a narrow search, and usually it's faster to

perform a narrow search with a B-tree rather than scan all rows

in the table Therefore the rule-based optimizer makes a plan:find matching rows using the index on column2

index for column2 Those facts change everything The equalsoperation will match on 60% of the rows, so it's not a narrowsearch And the whole table can be scanned using two pagereads, whereas an index lookup would take three page reads(one to lookup in the index, two more to fetch the data later).Therefore the cost-based optimizer makes a different plan: findmatching rows using a table scan

Trang 3

optimizers, as you can see from Table 17-1 The claims don'tmean much by themselves What's important is whether theoptimizer estimates cost correctly and how it acts on the

"Updates" Statistics for the Optimizer

Informix Yes SET EXPLAIN UPDATE STATISTICSIngres Yes EXECUTE QEP optimizedb utility

InterBase Yes SELECT … PLAN SET STATISTICS

Microsoft Yes EXPLAIN UPDATE STATISTICSMySQL No EXPLAIN ANALYZE TABLE

Oracle Yes EXPLAIN PLAN

Trang 4

Claims to be CBO column

This column is "Yes" if the DBMS's documentation makesthe claim that it operates with a cost-based optimizer

"Explains" the Access Plan column

This column shows the non-standard statement provided bythe DBMS so that you can examine the access plan theoptimizer will use to resolve an SQL statement For

example, if you want to know how Oracle will resolve a

specific SELECT statement, just execute an EXPLAIN PLANFOR statement for the SELECT

"Updates" Statistics for the Optimizer column

This column shows the non-standard statement or utilitythe DBMS provides so that you can update volatile

information, or statistics, for the optimizer For example, ifyour DBMS is MySQL and you've just added many rows to atable and want the optimizer to know about them, just

execute an ANALYZE TABLE statement for the table

Trang 5

This glossary contains only terms that specifically are used forSQL optimization For terms that apply to the subject of SQL ingeneral, consult the 1,000-term glossary on our Web site,

ourworld.compuserve.com/homepages/OCELOTSQL/glossary.htm

Before the definition there may be a "Used by" note For

example, "Used by: Microsoft, Sybase" indicates that Microsoftand Sybase authorities prefer the term and/or definition Thewords "Used by: this book only" indicate a temporary and non-standard term that exists only for this book's purposes

Trang 6

Second normal form table, a 1NF table that contains onlycolumns that are dependent upon the entire primary key

3NF

Third normal form table, a 2NF table whose non-key

columns are also mutually independent; that is, eachcolumn can be updated independently of all the rest

Trang 7

Application Programming Interface, the method by which aprogrammer writing an application program can make

requests of the operating system or another application

applet

A Java program that can be downloaded and executed by abrowser

B

B-tree

A structure for storing index keys; an ordered, hierarchical,paged assortment of index keys Some people say the "B"stands for "Balanced."

back compression

Making index keys shorter by throwing bytes away from theback

See also [lock]

leaf (page of an index)

A page at the bottom level of a B-tree (the page at the toplevel is the root) Typically a leaf page contains pointers tothe data pages (if it's a non-clustered index) or to the dataitself (if it's a clustered index)

Trang 32

A method the DBMS uses to prevent concurrent

transactions from interfering with one another Physically, alock is one of three things: a latch, a mark on the wall, or aRAM record

locking level

See [granularity (of a lock)]

lock mode

The type of lock a DBMS has arranged for a transaction.Options are exclusive, shared, or update

Trang 33

Transaction #1's change never happened You can avoidLost Update by using an isolation level of READ

UNCOMMITTED or higher

LRU

Least-Recently-Used, an algorithm that replaces the pagethat hasn't been accessed for the longest time

M

mark on the wall

An ITL slot or mark put against a row by the DBMS Byputting a mark right beside the row accessed by a

Trang 34

materialize

See [materialization]

materialized view

A view whose rows take up space When you select from aview, the DBMS can elect to do one of two things: (a) it canget the rows from the original table, convert any derivedcolumns, and pass the results to the application or (b) itcan create a temporary table and put the rows from theoriginal table(s) into the temporary table, then select fromthe temporary copy The latter case results in a materializedview Materialization is often necessary when there is noone-to-one correspondence between the original table'srows and the view's rows (because there is a grouping) orwhen many tables are affected and concurrency would beharmed (because there is a join)

Trang 35

The process of finding a home for an expanding update.When a page overflows due to a data change that increasesthe length of a variable-length column, a row must be

Trang 36

Notice the for loop nested within a for loop

Trang 37

A problem arising with concurrent transactions The Non-repeatable Read by using an isolation level of REPEATABLEREAD or higher.

Trang 38

The process of designing a database so that its tables followthe rules specified by relational theory In practice, this

usually means that all database tables are in third normalform

process, according to rules based on relational theory In anormalized table, one set of columns is the primary key(which uniquely identifies a row of the table) and all othercolumns are functionally dependent upon the entire primarykey

Trang 39

Locking that assumes conflict is unlikely Generally, this

means avoiding locks and checking for conflict between twotransactions only after data changes have been made

outer table

The table in the outer loop of a nested-loop join When youwrite an SQL statement with an inner join, the outer table isdetermined by the DBMS based on its join strategy for that

Trang 40

of the join determines the outer table: for the join

expression Table1 LEFT JOIN Table2 the outer tablemust be Table1 and for Table1 RIGHT JOIN Table2 theouter table must be Table2

out-of-place update

Used by: Microsoft, Sybase A data change that causes arow to move

Trang 41

A fixed-size hopper that stores rows of data or index keys;

a minimal unit for disk I/O Depending on the DBMS, a page

is also called a data block, a block, a blocking unit, a controlinterval, or a row group

on separate disks Informix calls this fragmentation

Trang 42

A problem arising with concurrent transactions The

Phantom problem occurs when a transaction reads multiplerows twice; once before and once after another transactiondoes a data change that affects the search condition in thefirst transaction's reads The result is that Transaction #1gets a different (larger) result set back from its second

read You can avoid Phantoms by using an isolation level ofSERIALIZABLE

PL/SQL

Used by: Oracle

Trang 43

stored procedures

positioned delete

A DELETE statement that allows you to delete the row atthe current cursor position Syntax: DELETE … WHERE

CURRENT OF <cursor>

positioned update

An UPDATE statement that allows you to update the row atthe current cursor position Syntax: UPDATE … WHERE

CURRENT OF <cursor>

precompiler

A utility you use to "compile" SQL code before you compilethe host program, that is, a utility that converts SQL

statements in a host program to statements that a compilercan understand A remnant of embedded-SQL days; there is

no such thing as an SQL compiler

prepared statement

An SQL statement that has been parsed and planned, forexample, with ODBC's SQLPrepare function

Định dạng
Số trang	68
Dung lượng	245,74 KB