Annual campaign playbook (operators)

Goal: keep the annual capture cycle predictable and operationally boring.

Canonical references:

Annual scope/seeds (source of truth): ../../annual-campaign.md
Automation implementation plan: ../../automation-implementation-plan.md
systemd units + enablement: ../../../deployment/systemd/README.md

Before Jan 01 UTC (readiness)

Review and update scope/seeds in ../../annual-campaign.md (docs-only change).
Ensure you have storage headroom and backups are healthy.
Run the crawl preflight audit (recommended):
YEAR=2026; ./scripts/vps-preflight-crawl.sh --year "$YEAR"
See: crawl-preflight.md
If the annual scheduler is enabled, dry-run it:
sudo systemctl start healtharchive-schedule-annual-dry-run.service
sudo journalctl -u healtharchive-schedule-annual-dry-run.service -n 200 --no-pager
Enable the annual campaign sentinel (recommended; sends notification on failure):
sudo systemctl enable --now healtharchive-annual-campaign-sentinel.timer
Optional: configure Healthchecks ping URL at /etc/healtharchive/healthchecks.env:
- HEALTHARCHIVE_HC_PING_ANNUAL_SENTINEL=https://hc-ping.com/UUID_HERE
- Note: this env file may also contain legacy HC_* variables (DB backup + disk check).

During/after the campaign (high level)

Use the automation plan (../../automation-implementation-plan.md) to decide what is enabled and what is manual.
Prefer safe, idempotent entrypoints (systemd services/timers or the provided scripts).
Annual jobs are scheduled with crawler monitoring enabled so stalls / error storms can trigger adaptive worker reduction.

If a crawl stalls or is interrupted

If a crawl is stalled, check the job logs under its output_dir (look for archive_*_attempt_*_*.combined.log) and the worker journal:
sudo journalctl -u healtharchive-worker -n 200 --no-pager
If the VPS reboots (or the worker/service is killed) mid-crawl, a job can be left in status=running. Recover safely:
Load env: set -a; source /etc/healtharchive/backend.env; set +a
Dry-run: /opt/healtharchive/.venv/bin/healtharchive recover-stale-jobs --older-than-minutes 180
Apply: /opt/healtharchive/.venv/bin/healtharchive recover-stale-jobs --older-than-minutes 180 --apply

Manual trigger (day-of)

If you want to run the annual sentinel immediately (safe; read-only except for metrics output):

sudo systemctl start healtharchive-annual-campaign-sentinel.service
sudo journalctl -u healtharchive-annual-campaign-sentinel.service -n 200 --no-pager

What “done” means

Annual scope is current in ../../annual-campaign.md.
If automation is enabled, the scheduler and follow-up tasks run as intended and are verifiable in logs/artifacts.