From 4b0292253cfca558b76c7a869ba0930a5e6d82fe Mon Sep 17 00:00:00 2001 From: Tom Lane Date: Tue, 22 Dec 2020 13:23:49 -0500 Subject: [PATCH] Improve autoprewarm's handling of early-shutdown scenarios. Bad things happen if the DBA issues "pg_ctl stop -m fast" before autoprewarm finishes loading its list of blocks to prewarm. The current worker process successfully terminates early, but (if this wasn't the last database with blocks to prewarm) the leader process will just try to launch another worker for the next database. Since the postmaster is now in PM_WAIT_BACKENDS state, it ignores the launch request, and the leader just sits until it's killed manually. This is mostly the fault of our half-baked design for launching background workers, but a proper fix for that is likely to be too invasive to be back-patchable. To ameliorate the situation, fix apw_load_buffers() to check whether SIGTERM has arrived just before trying to launch another worker. That leaves us with only a very narrow window in each worker launch where SIGTERM could occur between the launch request and successful worker start. Another issue is that if the leader process does manage to exit, it unconditionally rewrites autoprewarm.blocks with only the blocks currently in shared buffers, thus forgetting any blocks that we hadn't reached yet while prewarming. This seems quite unhelpful, since the next database start will then not have the expected prewarming benefit. Fix it to not modify the file if we shut down before the initial load attempt is complete. Per bug #16785 from John Thompson. Back-patch to v11 where the autoprewarm code was introduced. Discussion: https://postgr.es/m/16785-c0207d8c67fb5f25@postgresql.org --- contrib/pg_prewarm/autoprewarm.c | 30 +++++++++++++++++++++++++----- 1 file changed, 25 insertions(+), 5 deletions(-) diff --git a/contrib/pg_prewarm/autoprewarm.c b/contrib/pg_prewarm/autoprewarm.c index 1d2ac2cabc..edec1dfd0b 100644 --- a/contrib/pg_prewarm/autoprewarm.c +++ b/contrib/pg_prewarm/autoprewarm.c @@ -157,6 +157,7 @@ void autoprewarm_main(Datum main_arg) { bool first_time = true; + bool final_dump_allowed = true; TimestampTz last_dump_time = 0; /* Establish signal handlers; once that's done, unblock signals. */ @@ -197,10 +198,15 @@ autoprewarm_main(Datum main_arg) * There's not much point in performing a dump immediately after we finish * preloading; so, if we do end up preloading, consider the last dump time * to be equal to the current time. + * + * If apw_load_buffers() is terminated early by a shutdown request, + * prevent dumping out our state below the loop, because we'd effectively + * just truncate the saved state to however much we'd managed to preload. */ if (first_time) { apw_load_buffers(); + final_dump_allowed = !got_sigterm; last_dump_time = GetCurrentTimestamp(); } @@ -258,7 +264,8 @@ autoprewarm_main(Datum main_arg) * Dump one last time. We assume this is probably the result of a system * shutdown, although it's possible that we've merely been terminated. */ - apw_dump_now(true, true); + if (final_dump_allowed) + apw_dump_now(true, true); } /* @@ -391,6 +398,18 @@ apw_load_buffers(void) if (!have_free_buffer()) break; + /* + * Likewise, don't launch if we've already been told to shut down. + * + * There is a race condition here: if the postmaster has received a + * fast-shutdown signal, but we've not heard about it yet, then the + * postmaster will ignore our worker start request and we'll wait + * forever. However, that's a bug in the general background-worker + * logic, not the fault of this module. + */ + if (got_sigterm) + break; + /* * Start a per-database worker to load blocks for this database; this * function will return once the per-database worker exits. @@ -408,10 +427,11 @@ apw_load_buffers(void) apw_state->pid_using_dumpfile = InvalidPid; LWLockRelease(&apw_state->lock); - /* Report our success. */ - ereport(LOG, - (errmsg("autoprewarm successfully prewarmed %d of %d previously-loaded blocks", - apw_state->prewarmed_blocks, num_elements))); + /* Report our success, if we were able to finish. */ + if (!got_sigterm) + ereport(LOG, + (errmsg("autoprewarm successfully prewarmed %d of %d previously-loaded blocks", + apw_state->prewarmed_blocks, num_elements))); } /*