SQL Indexes |
Indexes are objects based on data in tables. Unlike views, indexes do indeed require physical storage. The SQL indexes are used to retrieve data super faster from database. By the end of this splash reading, you will understand the following:
- How to create indexes
- How to create composite indexes
- What indexes do
- When indexes should be used
- When it is better not to use an index
Using Indexes Effectively
Another way, besides views, to present data in a different format than it physically exists on the disk is to use an index. In addition, indexes can also reorder the data stored on the disk (something that views cannot do).
Indexes are used in a SQL database for four supreme reasons:
- To enforce referential integrity constraints by using the UNIQUE or PRIMARY KEY keywords.
- To facilitate the ordering of data based on the contents of the indexes field or fields
- To optimize the execution speed of queries
- To sort and store the data rows in the table or view based on their key values using Clustered Indexes.
What are Indexes?
Data can be retrieved from a database using two methods. The first method, often called the Sequential Access Method, requires SQL to go through each record looking for a match. This search method is inefficient, but is the only way for SQL to locate the correct record. Think back to the days when libraries had massive card catalog filing systems. Suppose the librarian removed the alphabetical index cards, tossed the cards into the air, and then placed them back into the filing cabinets.
When you wanted to look up this book's shelf location, you would probably start start at the very beginning, and then go through one card at a time until you found the information you wanted. Now suppose the librarian sorted the book titles alphabetically. You could quickly access this books information by using the knowledge of the alphabet to move through the catalog.
Imagine the flexibility if the librarian was diligent enough to not only sort the books by title, but also create another catalog sorted by authors name and another sorted by topic. This process would provide you, the library user, with a great deal of flexibility in retrieving information. Also, you would be able to retrieve you information in a fraction of the time it originally would have taken.
Adding indexes to you database enables SQL to use the Direct Access Method. SQL uses tree like structure to store and retrieve indexes data. Pointers to a group of data are stored at the top of the tree. These groups are called nodes. Each node contains pointers to other nodes. The nodes pointing to the left contain values that are less than its parent node. The pointers to the right point to values greater than the parent node.
The basic SQL syntax to create an index is as follows:
When you wanted to look up this book's shelf location, you would probably start start at the very beginning, and then go through one card at a time until you found the information you wanted. Now suppose the librarian sorted the book titles alphabetically. You could quickly access this books information by using the knowledge of the alphabet to move through the catalog.
Imagine the flexibility if the librarian was diligent enough to not only sort the books by title, but also create another catalog sorted by authors name and another sorted by topic. This process would provide you, the library user, with a great deal of flexibility in retrieving information. Also, you would be able to retrieve you information in a fraction of the time it originally would have taken.
Adding indexes to you database enables SQL to use the Direct Access Method. SQL uses tree like structure to store and retrieve indexes data. Pointers to a group of data are stored at the top of the tree. These groups are called nodes. Each node contains pointers to other nodes. The nodes pointing to the left contain values that are less than its parent node. The pointers to the right point to values greater than the parent node.
The basic SQL syntax to create an index is as follows:
For instance, to create an index on the BOOK_ID field of the BOOKS table, the CREATE INDEX statement would look like this:
If you want to remove the Index that you have created, The DROP INDEX statement removes an INDEX from the database. As usual, the DROP INDEX statement is very straightforward:
Now the BOOKS table is in its original form. Using the simplest form of the CREATE INDEX statement did not physically change the way the table was stored.
You may be wondering why database systems even provide indexes of they also enable you to use the ORDER BY clause. The difference is that an ORDER BY clause re-sorts and orders the data each time you execute the corresponding SQL statement. When using an index, the database system creates a physical index object and reuses the same index each time you query the table.
You can achieve performance gains by selecting the column with the most unique values. For instance, every value in the NAME field of the BOOKS table is unique. When using a compound index, place the most selective field first in the column list. That is, place the field that you expect to select most often at the beginning of the list.
Composite indexes are also used to combine two or more columns that, by themselves, may have low selectivity.
The field used to create the clustered index is usually the primary key field. Using Transact-SQL, you could create a clustered, unique index on the BOOK_ID field of the BOOK_LISTS table using the following syntax:
You should add tables only to clusters that are frequently joined. Do not add tables to clusters that are accessed individually through a simple select statement.
Clusters are a vendor-specific feature of SQL. To know more about Clustered Index, you can go through the Microsoft documentation here.
Now the BOOKS table is in its original form. Using the simplest form of the CREATE INDEX statement did not physically change the way the table was stored.
You may be wondering why database systems even provide indexes of they also enable you to use the ORDER BY clause. The difference is that an ORDER BY clause re-sorts and orders the data each time you execute the corresponding SQL statement. When using an index, the database system creates a physical index object and reuses the same index each time you query the table.
How to Index on more than One Field?
SQL also enables you to index on more than one field. This type of index is a composite index. The following code illustrates a simple composite index. Note that even though two fields are being combined, only one physical index in created.You can achieve performance gains by selecting the column with the most unique values. For instance, every value in the NAME field of the BOOKS table is unique. When using a compound index, place the most selective field first in the column list. That is, place the field that you expect to select most often at the beginning of the list.
Composite indexes are also used to combine two or more columns that, by themselves, may have low selectivity.
Using Clustered Indexes
A special type of index supported by many database systems allows the database manager or developer to cluster data. When tables are clustered, the data is stored in the same data blocks, allowing fewer database block reads and resulting in quicker performance. When a clustered index is used, the physical arrangement of the data within a table is modified. Using a clustered index usually results in faster data retrieval than using a traditional, non clustered index. However, many database systems allow only one clustered index per table.The field used to create the clustered index is usually the primary key field. Using Transact-SQL, you could create a clustered, unique index on the BOOK_ID field of the BOOK_LISTS table using the following syntax:
You should add tables only to clusters that are frequently joined. Do not add tables to clusters that are accessed individually through a simple select statement.
Clusters are a vendor-specific feature of SQL. To know more about Clustered Index, you can go through the Microsoft documentation here.
Comments