DbSchema | Diagram Design Tips - The Secrets of Indexes & Fks
Indexes and foreign keys are great tools when confronted with large databases. They can be the answer to a good design and
great performance. In this article,, I will go through some tips that helped me understand how to use these tools efficiently
and streamline my work with complex databases.
Foreign Keys
It’s a common mistake to avoid creating foreign keys in a database because they negatively impact the performance.
It is true that foreign keys will impact INSERT, UPDATE and DELETE statements because they are data checking,
but they improve the overall performance of a database.
The main benefit of foreign keys is that they enforce data consistency, meaning that they keep the database clean.
As an example, consider the two tables below. If the two are missing foreign keys, and we delete a department, then the
employees associated with it will remain in the employee table as “bad records”. These scenarios happens very often and
leads to a database full of junk that decreases performance.
Without foreign keys:
With foreign keys:
Indexes
An index for a database is like a table of contents for a book. Searching for data based on a column that is part of the
index will allow us to make use of the index to quickly access the record.
Consider the employee table with an index on __firstname`. Executing the query below will use this index.
SELECT * from employees WHERE firstname = ?
Important things to know about indexes:
1.Databases Only Use One Index per Table and Query
Let’s say we create two indexes on the employee table, __idx_firstname and __idx_lastname
. If we execute the query below,
the database will decide to use only one of the two indexes.
SELECT * from employees WHERE firstname = ? and lastname=?
2.Index Comparison
When executing a query, the database will compare the indexes with regard to the number of different entries that they contain.
For example, if we have 2,000 different records in for __firstname and 5,000 different records for __lastname
, it is more probable
that using the __lastname` index will return a smaller number of rows that fit our search criteria. Therefore, it will use that index.
3.Composite Indexes
An index that contains two or more columns is a composite index. When using a composite index, the query should always
contain the first column of the index.
For example, consider we have the index __idx(firstname, lastname)`. This index will work perfectly on the next query:
SELECT * from employees WHERE firstname = ? and lastname=?
But won’t work for this one:
SELECT * from employees WHERE lastname=?
4.Clustered Indexes
Giving the analogy from the beginning, in a clustered index, the table of contents is located at the end of the book.
In a non-clustered index, the table of contents is located in a different place, outside the book.
Clustered indexes are recommended when you have fewer updates in your data.
When creating a clustered index, the table itself becomes the index.
On the left side of the employees table, we can see how the indexes have different notations.
The __employee_id is a primary key, __firstname
and __lastname form a composite index, while __deparment_id
is a unique index.
Conclusion
Foreign keys are useful for keeping a database clean and won’t affect the SELECT statements.
Indexes help you quickly browse the database and find the data you need. Knowing how and when to use them will improve database performance and make your work easier.