PostgreSQL – How to Efficiently Delete Records When Counter Reaches Zero

cursorsdeletepostgresqlpostgresql-performanceupdate

Consider a record with a counter field, which is to be decremented. When its value reaches zero (which is the common case), I want the record to be deleted. What is the most efficient way to do this in PostgreSQL?

The naive way involves two SQL statements and two searches in the table: a SELECT to fetch the counter value, followed by a DELETE or an UPDATE.

One alternative involves an UPDATE … RETURNING, followed by a DELETE only if the returned value of the counter is zero. However, this performs an unneeded record change in the case of a counter having a value of one, optimizing the uncommon case (counter has a value higher than one) at the expense of the expected common case.

Another alternative, which indeed optimizes the expected common case, involves a DELETE … WHERE … AND counter = 1, followed by an UPDATE when no deletion takes place.

Both alternatives may require a wasteful second search for the record in the table.

Can perhaps the two table searches be always avoided and the operation's efficiency increased by using a cursor? I haven't seen an example for this use case in the PostgreSQL documentation.

Best Answer

Targeting a single row, this avoids an "unneeded record change", i.e. writing a new row version without need:

DO
$$
BEGIN
   DELETE FROM tbl WHERE … AND counter = 1;  -- common case first!

   IF NOT FOUND THEN
      UPDATE tbl SET counter = counter - 1 WHERE …;
   END IF;
END
$$;

Should also be cheaper than a trigger solution, where a trigger function is executed for every affected row (and may or may not interfere).

However, this will not fly with possible concurrent write operations. When two or more transactions try the same at virtually the same time, the logic can break. Or additional rows could become visible in between the two commands in default READ COMMITTED transaction isolation.

This is inherent to the problem itself, rather than to my solution. (Applies to other solutions all the same.) Under concurrent write load, you'll have to at least write-lock the row to avoid race conditions, so you are back to finding the row twice. You can avoid writing rows without need (like UPDATE + DELETE) in any case, though.

Related Solutions

Postgresql – WorkAround for PHP PDO(with libpq V 9.1.4) binding for use of CITEXT

Locate the namespace inside which the citext type resides:

select nspname from pg_type t join pg_namespace n
  on n.oid=t.typnamespace where typname='citext';

Prepend that namespace (normally, 'public', but it might be different in your case and it might explain the problem) to the cast to citext:

$sql = "SELECT column1 from schema1.column1 where column1 = 'value'::public.citext";

If that solves the problem but in a way you find inelegant, you might reconsider how you're using schemas and search_path: make sure that all your custom types are accessible no matter what.

SQL Server 2012 – How Many Records to Delete in Batch Delete?

Note that regardless of whether you trigger a table lock, a 1000+ record INSERT transaction will lock many pages in your indexes, and has a high risk of causing deadlocking or other problems. The good news is that your post does not seem to provide a business case for taking on a such a large transactional operation.

The data source table is not going to be read by other users...

Is it crucial for the business to ensure that a large block records has been transferred in a large batch, such that anyone accessing the source table will not see them at the exact point in time anyone access the target time will be able to see them? Or, is it necessary for users of the target table to see large blocks of records appear in the target table all exactly at the same time?

If no to both of those, I second Spörri's below statement...

If you have a clustered index on an identity column, which is not fragmented and you are deleting records from a range thats old you can safely ask the engine to lock the extends that the old records are using and delete ~64Kb at a time without fearing of blocking readers.

Now 64KB is not a very large value. If your records are each 1000 bytes, that means you are talking at most 64 records. As such, I recommend you do these transactions one record at a time. If the migration terminates on an error, one of these records will get rolled back, and you can certainly pick up where you left off because the source table and target table will be in alignment. This will ensure maximum availability for your users.

Best Answer

Related Solutions

Postgresql – WorkAround for PHP PDO(with libpq V 9.1.4) binding for use of CITEXT

SQL Server 2012 – How Many Records to Delete in Batch Delete?

Related Question