Quick Start: Manual Operations¶

Purpose: Quick reference for running scrapers and checking system health locally.

Prerequisites¶

cd backend
source .venv/bin/activate
export DATABASE_URL="your_neon_connection_string"

Run Scrapers¶

Run All Scrapers¶

python -m waittime.cli.scraper --all

Run Single Province¶

# Quebec
python -m waittime.cli.scraper --source quebec-msss

# Ontario
python -m waittime.cli.scraper --source ontario-health

# Alberta
python -m waittime.cli.scraper --source alberta-ahs

# British Columbia
python -m waittime.cli.scraper --source bc-phsa

Dry Run (No Database Writes)¶

python -m waittime.cli.scraper --all --dry-run

List Available Scrapers¶

python -m waittime.cli.scraper --list

Check System Health¶

Heartbeat Status¶

python -m waittime.cli.check_heartbeat --max-age 120

Check Specific Source¶

python -m waittime.cli.check_heartbeat --source quebec-msss

Dry Run (No Alerts)¶

python -m waittime.cli.check_heartbeat --max-age 120 --dry-run

Detailed Operational View (Last Known Good + Last Error)¶

python -m waittime.cli.check_heartbeat --max-age 120 --dry-run --verbose

Database Queries¶

Recent Measurements¶

SELECT
  source_id,
  COUNT(*) as count,
  MAX(timestamp_utc) as latest
FROM measurements
WHERE timestamp_utc > NOW() - INTERVAL '1 hour'
GROUP BY source_id;

Scraper Status¶

SELECT
  source_id,
  last_run,
  status,
  error_message,
  EXTRACT(EPOCH FROM (NOW() - last_run))/60 AS minutes_ago
FROM scraper_status
ORDER BY last_run DESC;

Hospital Visibility¶

SELECT
  province,
  COUNT(*) as total,
  COUNT(*) FILTER (WHERE is_visible) as visible
FROM hospitals
GROUP BY province;

Transfer Alert Triage (Neon)¶

SELECT
  source_id,
  COUNT(*) AS measurements_24h
FROM measurements
WHERE timestamp_utc > NOW() - INTERVAL '24 hours'
GROUP BY source_id
ORDER BY source_id;

If counts are normal but transfer is still high:

keep the existing scraper/frontend guardrails in place
use docs/operations/neon-production-upgrade.md for the current production Launch reference and post-upgrade monitoring posture
use docs/operations/scraper-scheduling.md under "Neon Public Transfer Guardrails" for cadence/guardrail review, not as a substitute for production plan monitoring

Current production guardrails already applied on the frontend path:

SystemStatus polls /api/health every 5 minutes and only while visible.
Read-heavy anonymous API routes use short cache headers plus a short-lived in-process response cache on the shared VPS runtime.
If transfer pressure persists, verify the deployed frontend revision and health before changing scraper cadence.

GitHub Actions¶

View Workflow Runs¶

Go to: https://github.com/yourusername/waittimecanada/actions
Select workflow: "Scraper Cron Job" or "Heartbeat Monitor"
Click latest run to view logs

Manual Trigger¶

Go to Actions tab
Select "Scraper Cron Job"
Click "Run workflow" button
Select branch (usually main)
Click green "Run workflow" button

Common Issues¶

"No module named 'waittime'"¶

cd backend
pip install -e .

"Database connection failed"¶

# Verify DATABASE_URL is set
echo $DATABASE_URL

# Test connection
python -c "import os; from waittime.services import DatabaseService; db = DatabaseService(); print('✅ Connected')"

"Playwright browsers not found"¶

playwright install chromium

Production Status¶

Current Schedule: - Scrapers run hourly on GitHub Actions - Heartbeat checks every 30 minutes - Heartbeat stale threshold is 120 minutes - All 4 provinces operational

Monitoring: - Pushover alerts on failures - GitHub Actions logs - Database scraper_status table

See Also: - Full Operations Guide - Methodology documentation lives in backend/docs/methodologies/