2012-04-17NOT NULL Constraints
To index an IS NULL condition in the Oracle Database, the index must have a column that can never be NULL.
But it is not enough that there are no NULL entries. The database has to know that there cannot be any NULL entry. Otherwise, the database must assume there are rows that are not in the index.
The following index supports the select only if the LAST_NAME column has a NOT NULL constraint:
DROP INDEX emp_dob;
CREATE INDEX emp_dob_name
ON employees (date_of_birth, last_name);
SELECT *
FROM employees
WHERE date_of_birth IS NULL;
--------------------------------------------------------------- |Id |Operation | Name | Rows | Cost | --------------------------------------------------------------- | 0 |SELECT STATEMENT | | 1 | 3 | | 1 | TABLE ACCESS BY INDEX ROWID| EMPLOYEES | 1 | 3 | |*2 | INDEX RANGE SCAN | EMP_DOB_NAME | 1 | 2 | --------------------------------------------------------------- Predicate Information (identified by operation id): --------------------------------------------------- 2 - access("DATE_OF_BIRTH" IS NULL)
Removing the NOT NULL constraint makes the index unusable for this query:
ALTER TABLE employees MODIFY last_name NULL; SELECT * FROM employees WHERE date_of_birth IS NULL; ---------------------------------------------------- | Id | Operation | Name | Rows | Cost | ---------------------------------------------------- | 0 | SELECT STATEMENT | | 1 | 477 | |* 1 | TABLE ACCESS FULL| EMPLOYEES | 1 | 477 | ----------------------------------------------------
Tip
A missing NOT NULL constraint can prevent using an index. Particularly for count(*) queries.
Besides NOT NULL constraints, the database also know that constant expressions like in the previous section cannot become NULL.
An index on a user defined function, however, does not impose any NOT NULL constraint on the index expression:
CREATE OR REPLACE FUNCTION blackbox(id IN NUMBER) RETURN NUMBER DETERMINISTIC IS BEGIN RETURN id; END; DROP INDEX emp_dob_name; CREATE INDEX emp_dob_bb ON employees (date_of_birth, blackbox(employee_id));
SELECT * FROM employees WHERE date_of_birth IS NULL; ---------------------------------------------------- | Id | Operation | Name | Rows | Cost | ---------------------------------------------------- | 0 | SELECT STATEMENT | | 1 | 477 | |* 1 | TABLE ACCESS FULL| EMPLOYEES | 1 | 477 | ----------------------------------------------------
The function name BLACKBOX emphasizes the fact that the optimizer has no idea what the function does (see Section 2.1). We can see that the function passes the input value straight through. But for the database, it is just a user defined function that returns a numeric value. The NOT NULL property of the parameter is lost. Although the index has all rows, the database does not know it. It can therefore not use the index for the query.
But if you know that the function never returns NULL, you can change the query accordingly:
SELECT * FROM employees WHERE date_of_birth IS NULL AND blackbox(employee_id) IS NOT NULL; ------------------------------------------------------------- |Id |Operation | Name | Rows | Cost | ------------------------------------------------------------- | 0 |SELECT STATEMENT | | 1 | 3 | | 1 | TABLE ACCESS BY INDEX ROWID| EMPLOYEES | 1 | 3 | |*2 | INDEX RANGE SCAN | EMP_DOB_BB | 1 | 2 | -------------------------------------------------------------
The extra condition in the where clause only selects rows that are in the index—per definition.
There is, unfortunately, no way to tag a function that never returns NULL. But you can move the function call to a virtual column (since 11g) and put a NOT NULL constraint on this column.
ALTER TABLE employees ADD bb_expression GENERATED ALWAYS AS (blackbox(employee_id)) NOT NULL; DROP INDEX emp_dob_bb; CREATE INDEX emp_dob_bb ON employees (date_of_birth, bb_expression);
SELECT * FROM employees WHERE date_of_birth IS NULL; ------------------------------------------------------------- |Id |Operation | Name | Rows | Cost | ------------------------------------------------------------- | 0 |SELECT STATEMENT | | 1 | 3 | | 1 | TABLE ACCESS BY INDEX ROWID| EMPLOYEES | 1 | 3 | |*2 | INDEX RANGE SCAN | EMP_DOB_BB | 1 | 2 | -------------------------------------------------------------
The Oracle Database knows that some functions only return NULL if NULL is provided as input. But that works only with for internal functions:
DROP INDEX emp_dob_bb;
CREATE INDEX emp_dob_upname
ON employees (date_of_birth, upper(last_name));
SELECT *
FROM employees
WHERE date_of_birth IS NULL;
----------------------------------------------------------
|Id |Operation | Name | Cost |
----------------------------------------------------------
| 0 |SELECT STATEMENT | | 3 |
| 1 | TABLE ACCESS BY INDEX ROWID| EMPLOYEES | 3 |
|*2 | INDEX RANGE SCAN | EMP_DOB_UPNAME | 2 |
----------------------------------------------------------
The UPPER function preserves the NOT NULL property of the LAST_NAME column. Removing the constraint, however, renders the index unusable:
ALTER TABLE employees MODIFY last_name NULL; SELECT * FROM employees WHERE date_of_birth IS NULL; ---------------------------------------------------- | Id | Operation | Name | Rows | Cost | ---------------------------------------------------- | 0 | SELECT STATEMENT | | 1 | 477 | |* 1 | TABLE ACCESS FULL| EMPLOYEES | 1 | 477 | ----------------------------------------------------

share and subscribe
RSS FeedFlattr this! Follow me on TwitterShare at Google+Like on Facebook