diff --git a/homeguard-iot/Dockerfile b/homeguard-iot/Dockerfile
new file mode 100644
index 0000000..3b2dace
--- /dev/null
+++ b/homeguard-iot/Dockerfile
@@ -0,0 +1,23 @@
+# Two-stage build. Produces a small image containing both binaries; the
+# docker-compose file decides which one to run via `command:`.
+
+FROM golang:1.22-alpine AS build
+WORKDIR /src
+
+COPY go.mod go.sum* ./
+RUN go mod download || true
+
+COPY . .
+
+RUN CGO_ENABLED=0 go build -o /out/server    ./cmd/server
+RUN CGO_ENABLED=0 go build -o /out/simulator ./cmd/simulator
+
+FROM gcr.io/distroless/static:nonroot
+WORKDIR /app
+COPY --from=build /out/server    /app/server
+COPY --from=build /out/simulator /app/simulator
+USER nonroot:nonroot
+
+# Use CMD (not ENTRYPOINT) so docker-compose's per-service `command:` fully
+# REPLACES this rather than being appended as argv to the default binary.
+CMD ["/app/server"]
diff --git a/homeguard-iot/README.md b/homeguard-iot/README.md
new file mode 100644
index 0000000..a03dbb5
--- /dev/null
+++ b/homeguard-iot/README.md
@@ -0,0 +1,385 @@
+# HomeGuard IoT — CedarDB Operator Console
+
+![Screenshot of demo running on 192 cores, 384 GB RAM](./view_of_app_on_192_cores.jpg)
+
+A small Go project that simulates an alarm-monitoring company's IoT
+backplane and showcases CedarDB serving both the OLTP-style operator
+console *and* the OLAP-style ingest-rate analytics off the same table at
+the same time.
+
+The pitch: a customer running ~3-4 TB/day of incoming sensor events is
+almost certainly running two stacks today — BigQuery for offline analytics
+plus some streaming pipeline (Pub/Sub + Dataflow + a KV store) for the
+monitoring center operators. This demo collapses both onto a single
+CedarDB instance, with a multi-table normalized schema that lets the
+dashboard run real joins instead of operating over pre-denormalized flat
+tables.
+
+## Talking points for the demo
+
+- This is one database. The operator queue, the live event stream, the
+  drill-down, the footer ingest counters, and the storage growth gauge
+  all read against the same `events` table while the simulator is
+  writing to it at hundreds of thousands of rows/sec — the `-rate` knob
+  goes from 2,500 (gentle demo) to 1,500,000+ (driving toward the
+  3 TB/day target on real hardware).
+
+- Today you'd run BigQuery for the aggregates and Pub/Sub → Dataflow →
+  Bigtable for the operator console. Two pipelines, two SLAs, one ETL
+  step in between with replication lag measured in seconds-to-minutes.
+
+- Notice the joins — Plan tier determines SLA, dispatch center comes
+  from Region, device code comes from the catalog. BigQuery prefers
+  denormalized flat tables; CedarDB does these joins on hot data without
+  flinching.
+
+## What's inside
+
+```
+schema.sql                  -- canonical reference (see internal/db/)
+docker-compose.yml          -- CedarDB + simulator + dashboard
+Dockerfile                  -- builds both Go binaries (CMD, not ENTRYPOINT!)
+cmd/simulator/main.go       -- drives the event stream
+cmd/server/main.go          -- serves the operator dashboard
+internal/db/                -- connection pool + embedded schema bootstrap
+internal/sim/               -- catalog data, fleet synthesis, alert rules
+internal/web/               -- HTTP server, SSE + HTMX, embedded UI assets
+```
+
+## Schema (8 tables, real joins)
+
+```
+plans            (dimension)   monthly_price, sla_seconds
+regions          (dimension)   dispatch center + timezone
+device_types     (dimension)   MOTION, DOOR, SMOKE, CO, WATER, ...
+households       (dimension)   plan_id + region_id; armed/disarmed
+devices          (dimension)   one row per sensor; FK to household + type
+events           (HOT)         100 K+ rows/sec sustained; the firehose
+alerts           (HOT, small)  derived from triggered events via rules
+storage_samples  (telemetry)   (sampled_at, uncompressed_bytes) per
+                               HG_STORAGE_SAMPLER_INTERVAL; drives the
+                               dashboard growth gauge
+```
+
+`events.event_id` is a plain `BIGINT` (not `BIGSERIAL`) — the simulator
+allocates IDs from an in-process `atomic.Int64` so it can keep the hot
+write path on `pgx.CopyFrom`. CedarDB rejects the binary `COPY` frame
+when the destination column carries a sequence default ("unable to
+cast from void to bigint"), and we never wanted the round trip a
+server-side sequence would imply.
+
+Indexes are sized for the join-heavy reads: `alerts (status, raised_at
+DESC)`, `events (household_id, ts DESC)`, `events (kind, ts DESC)`.
+See `internal/db/schema.sql` for the full list.
+
+## Run it (Docker)
+
+```
+docker compose up --build
+# open http://localhost:8080
+```
+
+Three containers come up: `hg-cedardb` (the database), `hg-simulator` (the
+data generator), and `hg-server` (the dashboard). The simulator embeds
+`schema.sql` and applies it automatically on first run.
+
+The simulator logs every DDL statement it runs and prints its target
+event rate; you should see something like:
+
+```
+hg-simulator | … connected to CedarDB
+hg-simulator | … schema-presence probe: households table missing
+hg-simulator | … applying schema: 15 statements
+hg-simulator | …   [ 1/15] CREATE TABLE IF NOT EXISTS plans … — ok
+…
+hg-simulator | … synthesized fleet: 30000 households · 297218 devices
+hg-simulator | … event_id counter seeded at 0
+hg-simulator | … storage sampler: interval=5s
+hg-simulator | … ingestor: batchSize=10000 flushInterval=50ms
+hg-simulator | … simulator running: writers=8 tickHz=10 target=2500 ev/s hb/tick/writer=31 fleet=297218 devices · ingestors=1 batch=10000 flush=50ms
+hg-simulator | … heartbeat: queue=0/64 eventID=24320 delta=2432 rows/s lastCopy=12ms ago copyFails=0
+hg-server    | … dashboard listening on :8080
+```
+
+If you need a clean slate (drop all tables and re-create), the simulator
+takes `-reset-schema`:
+
+```
+docker compose run --rm simulator /app/simulator -reset-schema
+docker compose up
+```
+
+## Dashboard layout
+
+```
+┌────────────────────────────────────────────────────────────────────────────┐
+│ ACTIVE ALERTS · SLA-aware              refresh HG_ALERTS_REFRESH (1s)      │
+│ ┌────────────────────────────────────────────────────────────────────────┐ │
+│ │ SEV · HH · PLAN · REGION / DC · DETAIL · AGE · SLA REMAINING           │ │
+│ │  5  #1024131  Premium  Atlanta DC  SMOKE detected         12 s   18 s  │ │
+│ │  4  #1009823  Plus     Boston DC   GLASS_BREAK            03 s   57 s  │ │
+│ │  …                                                                     │ │
+│ └────────────────────────────────────────────────────────────────────────┘ │
+├──────────────────────┬──────────────────────┬──────────────────────────────┤
+│ LIVE EVENT STREAM    │ CUSTOMER DRILL-DOWN  │ STORAGE GROWTH               │
+│ SSE HG_SSE_INTERVAL  │ HG_DRILLDOWN_REFRESH │ HG_STORAGE_REFRESH (1s)      │
+│ (200 ms)             │ (2s · auto-rotates)  │                              │
+│                      │                      │       ╭───────────╮          │
+│ 15:42:08 SMOKE  kit. │ #1019823  Premium    │       │   25.3    │  MB/s    │
+│ 15:42:07 MOTION hall │ ● ARMED              │       ╰───────────╯  1m avg  │
+│ 15:42:06 DOOR   front│ ──────────────────── │  target 34.7 MB/s · 3 TB/day │
+│ …                    │ 15:42:08 SMOKE kit.  │  total uncompressed 62.0 GB  │
+│                      │ 15:42:01 MOTION hall │  1m 25.3 · 5m 23.1 · 15m …   │
+└──────────────────────┴──────────────────────┴──────────────────────────────┘
+INSERTs: 312,049 ev/sec · 1,251,514,277 total · 1,270,594 active / 2,022,994 alerts
+                              one table, five concurrent reads · CedarDB · SQL queries↗
+```
+
+Each panel is driven by a different query against the same `events`,
+`alerts`, and `storage_samples` tables the simulator is still writing
+into. Cadences are configurable per-panel — see *Tuning at runtime*
+below.
+
+## The queries that matter
+
+Hit `http://localhost:8080/static/queries.html` once the dashboard is up
+for a syntax-highlighted reference of every SQL statement the app runs,
+where it lives in the code, and how often it fires. The three to draw
+attention to during a demo:
+
+**Active-alerts queue (joins 4 tables, refreshes every `HG_ALERTS_REFRESH`):**
+
+```sql
+SELECT a.alert_id, a.severity, a.detail,
+       EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS age_s,
+       p.sla_seconds,
+       p.sla_seconds - EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS sla_remaining,
+       h.household_id, h.address_hash,
+       p.name AS plan_name,
+       r.name AS region_name, r.dispatch_center
+FROM   alerts a
+JOIN   households h ON h.household_id = a.household_id
+JOIN   plans      p ON p.plan_id      = h.plan_id
+JOIN   regions    r ON r.region_id    = h.region_id
+WHERE  a.status = 'active'
+ORDER  BY a.severity DESC, a.raised_at ASC
+LIMIT  25
+```
+
+**Live event stream (joins 5 tables, server-pushed every `HG_SSE_INTERVAL`):**
+
+```sql
+SELECT e.event_id, e.ts, e.household_id, h.address_hash,
+       dt.code, d.location, e.kind, e.severity,
+       COALESCE(e.battery_pct, -1), r.name
+FROM   events e
+JOIN   devices       d  ON d.device_id      = e.device_id
+JOIN   device_types  dt ON dt.device_type_id = d.device_type_id
+JOIN   households    h  ON h.household_id   = e.household_id
+JOIN   regions       r  ON r.region_id      = h.region_id
+WHERE  e.kind > 0
+ORDER  BY e.ts DESC
+LIMIT  25
+```
+
+**Live ingest rate (the meta-query, refreshes every `HG_STATS_REFRESH`):**
+
+The footer's events/sec and total-events counters come from
+`storage_samples` — *not* from a `COUNT(*)` over a billion-row events
+table. The simulator writes `(now(), 48 × eventID)` into
+`storage_samples` every `HG_STORAGE_SAMPLER_INTERVAL` from its
+in-process atomic counter, so the two most recent rows give both the
+size and the rate without ever scanning the hot table.
+
+```sql
+SELECT
+    (SELECT uncompressed_bytes FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_bytes,
+    (SELECT sampled_at         FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_ts,
+    (SELECT uncompressed_bytes FROM storage_samples
+         WHERE sampled_at < (SELECT MAX(sampled_at) FROM storage_samples)
+         ORDER BY sampled_at DESC LIMIT 1) AS prior_bytes,
+    (SELECT sampled_at         FROM storage_samples
+         WHERE sampled_at < (SELECT MAX(sampled_at) FROM storage_samples)
+         ORDER BY sampled_at DESC LIMIT 1) AS prior_ts;
+-- total_events = latest_bytes / 48
+-- rows_per_sec = (latest_bytes - prior_bytes) / dt / 48
+```
+
+The published `rows_per_sec` is therefore an average over the sampler
+interval (default 5 s), not a strict 1-second window.
+
+## Knobs
+
+```
+docker compose run --rm simulator /app/simulator \
+    -households 50000 \
+    -devices-per-household 12 \
+    -rate 5000 \
+    -hz 20 \
+    -writers 8
+```
+
+`-rate` sets the sustained events/sec target (heartbeats + triggered).
+`-hz` sets the simulator's tick rate; higher Hz = smoother bursts but
+more network round-trips. `-writers` is the number of producer
+goroutines that partition the device fleet and generate rows in
+parallel; they hand off to a single ingestor goroutine that runs
+`CopyFrom` against CedarDB. The fleet sizing knobs control how many
+`households` and `devices` rows get synthesized on startup.
+
+## Tuning at runtime
+
+Most dashboard cadences and a few pipeline knobs are configurable via
+`HG_*` environment variables in `docker-compose.yml`. Defaults match
+the original hardcoded values, so leaving everything unset preserves
+the standard demo behaviour. All durations are Go duration strings
+(`200ms`, `1s`, `30s`, `1m30s`).
+
+### Dashboard refresh cadences (server container)
+
+| Env var | Default | UI region |
+|---|---|---|
+| `HG_SSE_INTERVAL` | `200ms` | **Live Event Stream** panel (bottom-left). Server-side SSE push rate — the browser holds one long-lived connection and the server pushes a frame at this interval. The "last frame" timestamp in the bottom-right of the footer tracks this one. |
+| `HG_ALERTS_REFRESH` | `1s` | **Active Alerts** queue (top, full-width). htmx polls `/api/alerts`. |
+| `HG_DRILLDOWN_REFRESH` | `2s` | **Customer Drill-Down** (bottom-middle). htmx polls `/api/drilldown`. This runs three SQL queries per tick (pick highest-severity household → fetch its plan/region header → fetch its last 20 events), so it's the biggest cost-per-tick and the most useful one to dial down if dashboard load is competing with ingest. |
+| `HG_STATS_REFRESH` | `1s` | **Footer counters** *and* the **header counters** ("Active alerts N · Events ingested N"). JS polls `/api/stats`. The only env var that updates two regions at once, so slowing it down has the most visible effect. |
+| `HG_STORAGE_REFRESH` | `1s` | **Storage Growth** gauge (bottom-right): the arc, the 1m/5m/15m rate table, the projected-daily figure. The cheapest poll of the bunch — it just reads `storage_samples`, a few hundred rows — so there's rarely a reason to raise this. |
+
+### Pipeline knobs (simulator container)
+
+| Env var | Default | Effect |
+|---|---|---|
+| `HG_STORAGE_SAMPLER_INTERVAL` | `5s` | How often the simulator writes a row into `storage_samples`. Doesn't drive any UI poll directly, but sets the minimum window over which the gauge can compute a rate — set this to `30s` and the dashboard's 1m/5m/15m rates become 30-second moving averages. Cheap to leave at `5s`. |
+| `HG_INGEST_BATCH` | `10000` | Rows the ingestor accumulates before firing one `pgx.CopyFrom`. Bigger → better per-COPY amortization; smaller → lower latency to the live event stream. Watch the heartbeat `lastCopy` and `delta` numbers when tuning. |
+| `HG_INGESTORS` | `1` | Number of ingestor goroutines running `CopyFrom` in parallel. Default `1` is safe everywhere — older CedarDB rejected overlapping COPYs with `SQLSTATE 40P01`; newer versions accept concurrent COPYs, in which case `2`, `4`, `8` may give a meaningful throughput bump. If the heartbeat's `queue=CAP/CAP` ceiling drops as you raise this, the single ingestor was the bottleneck and CedarDB can absorb more parallelism. If the queue stays pegged, CedarDB is serializing the work server-side and adding more ingestors won't help. |
+| `HG_RESOLVE_INTERVAL` | `2s` | How often the background alert resolver fires. |
+| `HG_RESOLVE_AUTOTUNE` | `true` | When on, the resolver picks each tick's LIMIT as `max(floor, ceil(deltaFired × 1.2 + backlog × 0.01))`, capped at 100 K. `deltaFired` and `backlog` are tracked via in-process atomic counters — no `COUNT(*)` on the alerts table needed. Set to `false` to pin the limits at their floors (handy when you want to demo what happens to AGE/SLA as a backlog grows). |
+| `HG_RESOLVE_LOW_LIMIT` | `2000` | **Floor** for the per-tick low-severity (1-2) resolver, which marks alerts `false_alarm`. Auto-tune can push higher; without auto-tune this is just the limit. |
+| `HG_RESOLVE_HIGH_LIMIT` | `600` | **Floor** for the per-tick high-severity (3+) resolver, which marks alerts `resolved`. High-severity alerts must also have aged at least 20 seconds to be eligible, so they sit in the operator queue long enough to look like real triage work. |
+
+### Reading the simulator heartbeat
+
+Once per second the simulator logs a one-line status report you can use
+to triage ingest behaviour. Tail it with `docker logs hg-simulator`:
+
+```
+heartbeat: queue=12/64 eventID=1483920475 delta=748520 rows/s lastCopy=12ms ago copyFails=0
+```
+
+- **`queue=N/CAP`** — depth of the producer→ingestor channel. Near `CAP` means CedarDB is the bottleneck.
+- **`eventID`** — monotonic atomic counter; each row generated bumps it. Doubles as a precise total-rows figure.
+- **`delta=N rows/s`** — row generation rate computed from the eventID counter.
+- **`lastCopy=Xms ago`** — wall time since the most recent successful `CopyFrom`. Should be under one second when ingest is healthy.
+- **`copyFails=N`** — cumulative `CopyFrom` errors since startup. Non-zero means CedarDB is rejecting; the error text appears on the line above the heartbeat.
+
+Three patterns worth recognising:
+
+- `delta=0 queue=0` → producers stopped. Look for goroutine panics in the simulator log.
+- `delta=N queue=CAP lastCopy growing` → CedarDB stopped accepting writes. Check `hg-cedardb` logs, disk space, and the compactor.
+- `delta=N queue=CAP lastCopy<1s copyFails=0` → healthy steady-state at the write-path cap. To push past it, try larger `HG_INGEST_BATCH` and/or more `HG_INGESTORS`, or move to bigger hardware.
+
+The resolver emits its own status line once every 10 seconds:
+
+```
+resolver: drain low=42/2000 high=3187/3825 · backlog low=0 high=14
+```
+
+`drain low=X/Y` is "Y rows of low-severity LIMITed, X actually resolved" — when X < Y the queue is empty for that tier; when X = Y the resolver is at the cap. `backlog` is the in-process `(fired − resolved)` estimate; if it climbs steadily, auto-tune is falling behind (rare — the controller's 1% backlog decay is normally enough to keep up).
+
+## Scaling up
+
+The defaults in `docker-compose.yml` are sized for a developer laptop
+(~10 cores, 16–32 GB RAM). On real demo hardware — for example a
+192-core x86 box with 384 GB RAM — there's a lot of headroom that the
+laptop config simply can't use. A reasonable starting point on a box
+like that:
+
+```yaml
+simulator:
+  command: ["/app/simulator",
+    "-households=200000",
+    "-devices-per-household=12",
+    "-rate=2000000",        # 2 M ev/s ≈ 96 MB/s ≈ 8 TB/day uncompressed
+    "-hz=20",
+    "-writers=64"]
+  environment:
+    HG_INGESTORS:    "8"        # parallel CopyFroms (requires recent CedarDB)
+    HG_INGEST_BATCH: "50000"    # bigger batches amortise the COPY round trip
+    HG_STORAGE_SAMPLER_INTERVAL: "5s"
+    # Alert generation scales with -writers, but the resolver auto-tunes
+    # its LIMITs each tick from the in-process generation rate and
+    # backlog, so no manual sizing is needed when you bump -writers.
+    # The defaults stay fine. If you'd rather see the queue grow (to
+    # demo SLA breach), set HG_RESOLVE_AUTOTUNE=false.
+```
+
+The pool's `MaxConns=64` in `internal/db/db.go` will need to grow if
+`-writers` × `HG_INGESTORS` plus the dashboard's concurrent reads
+exceed it — roughly speaking, set it to `writers + ingestors + 16`.
+
+The heartbeat is your scoreboard while you push the dial up:
+
+- **`delta` rises and `queue` no longer pegs at CAP** as you raise
+  `HG_INGESTORS` → the single ingestor was the bottleneck and CedarDB
+  can absorb more parallel COPYs.
+- **`delta` is flat regardless of `HG_INGESTORS`** → CedarDB is
+  serializing the work internally; the next move is `HG_INGEST_BATCH`
+  or CedarDB-side tuning.
+- **`copyFails > 0`** → CedarDB is rejecting. The error message on the
+  preceding log line tells you why (most likely you've raised
+  `HG_INGESTORS` past what your CedarDB version allows and gone back
+  to the 40P01 territory).
+
+For the dashboard side at higher rates, keep the read cadences honest
+about what the queries cost:
+
+```yaml
+server:
+  environment:
+    HG_SSE_INTERVAL:      "200ms"  # joins 5 tables, kind > 0 filter
+    HG_ALERTS_REFRESH:    "1s"     # joins 4 tables on the small alerts table
+    HG_DRILLDOWN_REFRESH: "2s"     # three queries per tick; the costliest
+    HG_STATS_REFRESH:     "1s"     # reads storage_samples only — cheap
+    HG_STORAGE_REFRESH:   "1s"     # reads storage_samples only — cheap
+```
+
+On a 192-core box these are easily affordable; on a laptop you'll
+want to crank them up to give CedarDB more headroom for the write
+path.
+
+### Which indexes to keep
+
+There's a real tension between write throughput and dashboard read
+latency, but it isn't symmetric across tables:
+
+| Table | Indexes? | Why |
+|---|---|---|
+| `alerts` | **Keep them.** | The table is in the millions, never billions; index maintenance is trivial. *All four read paths* on the dashboard (queue, drilldown via household, resolver inner SELECT) need them. Without an `(status, raised_at DESC)` index the active-alerts query becomes a multi-million-row scan that the resolver fights with every 2 s, and you'll see the panel intermittently render "no active alerts" because the iteration timed out mid-scan. |
+| `events` | **Optional.** Drop if you want to maximise write rate. | One index entry per inserted row at ~30 K/s is real write cost; the dashboard's events queries are all small-LIMIT scans of the recent tail and CedarDB's column store handles them respectably even without the index. The trade-off is that the SSE event-stream panel and the drill-down's per-household scan will be slower at very large table sizes — usually still tolerable. |
+
+If you dropped the alerts indexes during a write-throughput experiment,
+put them back before treating the dashboard as canonical:
+
+```sql
+CREATE INDEX alerts_status_raised_idx    ON alerts (status, raised_at DESC);
+CREATE INDEX alerts_household_raised_idx ON alerts (household_id, raised_at DESC);
+```
+
+### When a panel intermittently shows empty
+
+The dashboard handlers now log `rows.Err()` after each iteration, so a
+mid-scan context cancellation no longer looks identical to an empty
+result. If you see "no active alerts" in the panel, check the server
+container's logs:
+
+```
+docker logs hg-server 2>&1 | grep "rows.Err"
+```
+
+A non-empty stream of `context canceled` or `deadline exceeded` lines
+means the underlying SQL is taking long enough that requests are
+aborting before it returns. The fix is almost always either to add the
+missing index for that query or to lower the polling frequency.
+
diff --git a/homeguard-iot/build.sh b/homeguard-iot/build.sh
new file mode 100755
index 0000000..7e63092
--- /dev/null
+++ b/homeguard-iot/build.sh
@@ -0,0 +1,5 @@
+#!/bin/bash
+
+CGO_ENABLED=0 go build -o ./out/server ./cmd/server
+CGO_ENABLED=0 go build -o ./out/simulator ./cmd/simulator
+
diff --git a/homeguard-iot/clean.sh b/homeguard-iot/clean.sh
new file mode 100755
index 0000000..dd0277e
--- /dev/null
+++ b/homeguard-iot/clean.sh
@@ -0,0 +1,5 @@
+#!/bin/bash
+
+rm -f ./out/server
+rm -f ./out/simulator
+
diff --git a/homeguard-iot/cmd/server/main.go b/homeguard-iot/cmd/server/main.go
new file mode 100644
index 0000000..de2a764
--- /dev/null
+++ b/homeguard-iot/cmd/server/main.go
@@ -0,0 +1,52 @@
+// Command server runs the HomeGuard IoT operator dashboard.
+package main
+
+import (
+	"context"
+	"flag"
+	"log"
+	"net/http"
+	"os/signal"
+	"syscall"
+	"time"
+
+	"github.com/cedardb-demo/homeguard-iot/internal/db"
+	"github.com/cedardb-demo/homeguard-iot/internal/web"
+)
+
+func main() {
+	addr := flag.String("addr", ":8080", "http listen address")
+	flag.Parse()
+
+	ctx, cancel := signal.NotifyContext(context.Background(),
+		syscall.SIGINT, syscall.SIGTERM)
+	defer cancel()
+
+	pool, err := db.Connect(ctx)
+	if err != nil {
+		log.Fatalf("db: %v", err)
+	}
+	defer pool.Close()
+
+	srv, err := web.NewServer(pool)
+	if err != nil {
+		log.Fatalf("server: %v", err)
+	}
+
+	httpSrv := &http.Server{
+		Addr:              *addr,
+		Handler:           srv.Routes(),
+		ReadHeaderTimeout: 5 * time.Second,
+	}
+	go func() {
+		<-ctx.Done()
+		shutdown, c := context.WithTimeout(context.Background(), 5*time.Second)
+		defer c()
+		_ = httpSrv.Shutdown(shutdown)
+	}()
+
+	log.Printf("dashboard listening on %s", *addr)
+	if err := httpSrv.ListenAndServe(); err != nil && err != http.ErrServerClosed {
+		log.Fatalf("listen: %v", err)
+	}
+}
diff --git a/homeguard-iot/cmd/simulator/main.go b/homeguard-iot/cmd/simulator/main.go
new file mode 100644
index 0000000..ff3571b
--- /dev/null
+++ b/homeguard-iot/cmd/simulator/main.go
@@ -0,0 +1,70 @@
+// Command simulator generates the IoT event stream and writes it to
+// CedarDB. Run it once per demo session; run cmd/server in parallel for
+// the dashboard.
+package main
+
+import (
+	"context"
+	"flag"
+	"log"
+	"os"
+	"os/signal"
+	"syscall"
+
+	"github.com/cedardb-demo/homeguard-iot/internal/db"
+	"github.com/cedardb-demo/homeguard-iot/internal/sim"
+)
+
+func main() {
+	householdCount := flag.Int("households", 30000, "synthesized household fleet size")
+	devicesPerHH := flag.Int("devices-per-household", 10, "avg devices per household")
+	tickHz := flag.Int("hz", 10, "simulator tick rate (events flush each tick)")
+	rate := flag.Int("rate", 2500, "sustained events/sec target (heartbeats + triggered)")
+	writers := flag.Int("writers", 8,
+		"number of parallel writer goroutines; each owns a slice of the device fleet "+
+			"and its own pgx conn. Bump alongside -rate to scale ingest toward 3 TB/day.")
+	resetSchema := flag.Bool("reset-schema", false,
+		"drop & recreate all tables on startup — destroys all data")
+	flag.Parse()
+
+	ctx, cancel := signal.NotifyContext(context.Background(),
+		os.Interrupt, syscall.SIGTERM)
+	defer cancel()
+
+	pool, err := db.Connect(ctx)
+	if err != nil {
+		log.Fatalf("db: %v", err)
+	}
+	defer pool.Close()
+	log.Printf("connected to CedarDB")
+
+	if *resetSchema {
+		log.Printf("-reset-schema set: wiping all data and re-creating tables")
+		if err := db.ResetSchema(ctx, pool); err != nil {
+			log.Fatalf("reset schema: %v", err)
+		}
+	} else {
+		present, err := db.SchemaPresent(ctx, pool)
+		switch {
+		case err != nil:
+			log.Printf("schema-presence probe failed (continuing anyway): %v", err)
+		case present:
+			log.Printf("schema-presence probe: households table already present")
+		default:
+			log.Printf("schema-presence probe: households table missing")
+      if err := db.ApplySchema(ctx, pool); err != nil {
+			  log.Fatalf("apply schema: %v", err)
+		  }
+		}
+	}
+
+	s, err := sim.New(ctx, pool, *householdCount, *devicesPerHH, *tickHz, *rate, *writers)
+	if err != nil {
+		log.Fatalf("simulator setup: %v", err)
+	}
+
+	if err := s.Run(ctx); err != nil && ctx.Err() == nil {
+		log.Fatalf("simulator run: %v", err)
+	}
+	log.Printf("simulator shut down cleanly")
+}
diff --git a/homeguard-iot/db_error.txt b/homeguard-iot/db_error.txt
new file mode 100644
index 0000000..ee2a2a2
--- /dev/null
+++ b/homeguard-iot/db_error.txt
@@ -0,0 +1,25 @@
+hg-cedardb    | 2026-05-19 19:58:46.272801386 UTC    DEBUG1:  connection 1084378400 terminated
+hg-simulator  | 2026/05/19 19:58:48 simulator running: tickHz=10  target=2500 ev/s  per-tick=250  fleet=285028 devices
+hg-simulator  | 2026/05/19 19:58:49 len(rows): 254
+hg-cedardb    | 2026-05-19 19:58:49.038975971 UTC    ERROR:   unable to cast from void to bigint
+hg-cedardb    | Input data does not match the expected type or input format. Docs: https://cedardb.com/docs/references/datatypes/
+hg-cedardb    | 2026-05-19 19:58:49.039580971 UTC    ERROR:   invalid message in simple query mode
+hg-cedardb    | 2026-05-19 19:58:49.039624013 UTC    ERROR:   invalid message in simple query mode
+hg-simulator  | 2026/05/19 19:58:49 copy events: ERROR: unable to cast from void to bigint (SQLSTATE 42804)
+hg-simulator  | 2026/05/19 19:58:49 insert alert: ERROR: invalid message in simple query mode (SQLSTATE 08P01)
+hg-simulator  | 2026/05/19 19:58:49 insert alert: ERROR: invalid message in simple query mode (SQLSTATE 08P01)
+hg-simulator  | 2026/05/19 19:58:49 len(rows): 256
+hg-cedardb    | 2026-05-19 19:58:49.149095304 UTC    ERROR:   unable to cast from void to bigint
+hg-cedardb    | Input data does not match the expected type or input format. Docs: https://cedardb.com/docs/references/datatypes/
+hg-simulator  | panic: runtime error: index out of range [0] with length 0
+hg-simulator  |
+hg-simulator  | goroutine 441 [running]:
+hg-simulator  | github.com/jackc/pgx/v5.(*copyFrom).buildCopyBuf(0x40002030e0, {0x400009c400?, 0x0?, 0x0?}, 0x400243e280)
+hg-simulator  |     /go/pkg/mod/github.com/jackc/pgx/v5@v5.6.0/copy_from.go:235 +0x2f4
+hg-simulator  | github.com/jackc/pgx/v5.(*copyFrom).run.func1()
+hg-simulator  |     /go/pkg/mod/github.com/jackc/pgx/v5@v5.6.0/copy_from.go:177 +0x1c4
+hg-simulator  | created by github.com/jackc/pgx/v5.(*copyFrom).run in goroutine 1
+hg-simulator  |     /go/pkg/mod/github.com/jackc/pgx/v5@v5.6.0/copy_from.go:164 +0x3c0
+hg-cedardb    | 2026-05-19 19:58:49.158704471 UTC    LOG:     recv: Connection reset by peer
+hg-cedardb    | 2026-05-19 19:58:49.158726721 UTC    ERROR:   unable to read from client
+
diff --git a/homeguard-iot/docker-compose.yml b/homeguard-iot/docker-compose.yml
new file mode 100644
index 0000000..9763aa2
--- /dev/null
+++ b/homeguard-iot/docker-compose.yml
@@ -0,0 +1,106 @@
+# Brings up CedarDB alongside the Go simulator and operator dashboard.
+#
+# IMPORTANT: the CedarDB image name and env vars below mirror the Postgres
+# image conventions. If your CedarDB image differs, adjust `image:` and the
+# DATABASE_URL accordingly — the rest of the demo is pure Postgres wire
+# protocol and should not need to change.
+
+services:
+  cedardb:
+    image: cedardb/cedardb:latest
+    container_name: hg-cedardb
+    ports:
+      - "5432:5432"
+    environment:
+      CEDAR_PASSWORD: postgres
+      VERBOSITY: DEBUG1
+      LICENSE_KEY:
+    # Schema bootstrap is handled by the simulator (it embeds schema.sql via
+    # //go:embed and applies it on first run), so we do not mount any init
+    # scripts here.
+    volumes:
+      - cedar-data:/var/lib/cedardb
+    healthcheck:
+      test: ["CMD-SHELL", "pg_isready -U postgres -d homeguard || exit 1"]
+      interval: 3s
+      timeout: 2s
+      retries: 30
+
+  simulator:
+    build: .
+    container_name: hg-simulator
+    #command: ["/app/simulator", "-households=100000", "-devices-per-household=10", "-rate=1750000", "-hz=20", "-writers=32"]
+    #command: ["/app/simulator", "-households=100000", "-devices-per-household=10", "-rate=500000", "-hz=20", "-writers=16"]
+    command: ["/app/simulator", "-households=100000", "-devices-per-household=10", "-rate=600000", "-hz=20", "-writers=16"]
+    #command: ["/app/simulator", "-households=30000", "-devices-per-household=10", "-rate=2500", "-hz=10", "-writers=4"]
+    
+    depends_on:
+      cedardb:
+        condition: service_healthy
+    environment:
+      DATABASE_URL: "postgresql://postgres:postgres@cedardb:5432/postgres?sslmode=require"
+
+      # How often the storage sampler writes a row into storage_samples.
+      # Lower values → fresher gauge, more INSERTs into storage_samples.
+      # Value is any Go duration string (e.g. "5s", "30s", "1m").
+      HG_STORAGE_SAMPLER_INTERVAL: "5s"
+
+      # How many rows the ingestor accumulates before firing one
+      # pgx.CopyFrom. Bigger → better per-COPY amortization and
+      # (sometimes) higher throughput; smaller → lower end-to-end
+      # latency on the live event stream. Watch the heartbeat
+      # `lastCopy` and `delta` numbers when tuning.
+      HG_INGEST_BATCH: "25000"
+
+      # How many ingestor goroutines run CopyFrom in parallel. Default
+      # 1 is safe everywhere. Older CedarDB rejected overlapping COPYs
+      # with SQLSTATE 40P01; newer versions accept concurrent COPYs,
+      # in which case cranking this up (try 2, 4, 8) may give a
+      # meaningful throughput bump. If the heartbeat's queue=64/64
+      # ceiling disappears as you raise this, the previous bottleneck
+      # was the single ingestor (and CedarDB can take more); if queue
+      # stays pegged, CedarDB is serializing the work internally and
+      # adding ingestors won't help.
+      HG_INGESTORS: "4"
+
+      # Alert resolver. The background resolveAlertsLoop picks up the
+      # oldest active alerts every HG_RESOLVE_INTERVAL and marks them
+      # resolved/false_alarm.
+      #
+      # By default the resolver auto-tunes its per-tick LIMITs from
+      # the in-process alert generation rate and backlog (no DB-side
+      # COUNT(*) required), so the queue stays bounded as -writers
+      # and -rate scale. The two LIMIT vars are now FLOORS — auto-tune
+      # can push higher but never below them. Set HG_RESOLVE_AUTOTUNE
+      # to false to disable auto-tune (useful if you want to demo what
+      # happens to AGE/SLA when the queue grows on purpose).
+      # HG_RESOLVE_INTERVAL:   "2s"
+      # HG_RESOLVE_LOW_LIMIT:  "2000"  # floor for severity 1-2 → false_alarm
+      # HG_RESOLVE_HIGH_LIMIT: "600"   # floor for severity 3+  → resolved
+      # HG_RESOLVE_AUTOTUNE:   "true"  # auto-grow LIMITs to match gen+backlog
+    restart: on-failure
+
+  server:
+    build: .
+    container_name: hg-server
+    command: ["/app/server", "-addr=:8080"]
+    depends_on:
+      cedardb:
+        condition: service_healthy
+    environment:
+      DATABASE_URL: "postgresql://postgres:postgres@cedardb:5432/postgres?sslmode=require"
+      # Dashboard polling cadences — all Go duration strings. Crank up
+      # when you want to ease the read load CedarDB has to serve next to
+      # the simulator's writes.
+      HG_SSE_INTERVAL: "1s"      # live event stream push rate
+      HG_ALERTS_REFRESH: "5s"    # active-alerts queue panel
+      HG_DRILLDOWN_REFRESH: "5s" # customer drill-down panel
+      HG_STATS_REFRESH: "1s"     # footer ingest counters
+      HG_STORAGE_REFRESH: "1s"   # storage growth gauge
+    ports:
+      - "8080:8080"
+    restart: on-failure
+
+volumes:
+  cedar-data:
+
diff --git a/homeguard-iot/docker_compose_run.sh b/homeguard-iot/docker_compose_run.sh
new file mode 100755
index 0000000..9ed1daa
--- /dev/null
+++ b/homeguard-iot/docker_compose_run.sh
@@ -0,0 +1,4 @@
+#!/bin/bash
+
+docker compose up --build
+
diff --git a/homeguard-iot/docker_rm_images.sh b/homeguard-iot/docker_rm_images.sh
new file mode 100755
index 0000000..84acf4a
--- /dev/null
+++ b/homeguard-iot/docker_rm_images.sh
@@ -0,0 +1,4 @@
+#!/bin/bash
+
+docker image rm homeguard-iot-server:latest homeguard-iot-simulator:latest
+
diff --git a/homeguard-iot/docker_rm_volume.sh b/homeguard-iot/docker_rm_volume.sh
new file mode 100755
index 0000000..f045902
--- /dev/null
+++ b/homeguard-iot/docker_rm_volume.sh
@@ -0,0 +1,4 @@
+#!/bin/bash
+
+docker compose down -v
+
diff --git a/homeguard-iot/go.mod b/homeguard-iot/go.mod
new file mode 100644
index 0000000..e29acf4
--- /dev/null
+++ b/homeguard-iot/go.mod
@@ -0,0 +1,14 @@
+module github.com/cedardb-demo/homeguard-iot
+
+go 1.22
+
+require github.com/jackc/pgx/v5 v5.6.0
+
+require (
+	github.com/jackc/pgpassfile v1.0.0 // indirect
+	github.com/jackc/pgservicefile v0.0.0-20240606120523-5a60cdf6a761 // indirect
+	github.com/jackc/puddle/v2 v2.2.1 // indirect
+	golang.org/x/crypto v0.27.0 // indirect
+	golang.org/x/sync v0.8.0 // indirect
+	golang.org/x/text v0.18.0 // indirect
+)
diff --git a/homeguard-iot/go.sum b/homeguard-iot/go.sum
new file mode 100644
index 0000000..b9bb4b2
--- /dev/null
+++ b/homeguard-iot/go.sum
@@ -0,0 +1,28 @@
+github.com/davecgh/go-spew v1.1.0/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
+github.com/davecgh/go-spew v1.1.1 h1:vj9j/u1bqnvCEfJOwUhtlOARqs3+rkHYY13jYWTU97c=
+github.com/davecgh/go-spew v1.1.1/go.mod h1:J7Y8YcW2NihsgmVo/mv3lAwl/skON4iLHjSsI+c5H38=
+github.com/jackc/pgpassfile v1.0.0 h1:/6Hmqy13Ss2zCq62VdNG8tM1wchn8zjSGOBJ6icpsIM=
+github.com/jackc/pgpassfile v1.0.0/go.mod h1:CEx0iS5ambNFdcRtxPj5JhEz+xB6uRky5eyVu/W2HEg=
+github.com/jackc/pgservicefile v0.0.0-20240606120523-5a60cdf6a761 h1:iCEnooe7UlwOQYpKFhBabPMi4aNAfoODPEFNiAnClxo=
+github.com/jackc/pgservicefile v0.0.0-20240606120523-5a60cdf6a761/go.mod h1:5TJZWKEWniPve33vlWYSoGYefn3gLQRzjfDlhSJ9ZKM=
+github.com/jackc/pgx/v5 v5.6.0 h1:SWJzexBzPL5jb0GEsrPMLIsi/3jOo7RHlzTjcAeDrPY=
+github.com/jackc/pgx/v5 v5.6.0/go.mod h1:DNZ/vlrUnhWCoFGxHAG8U2ljioxukquj7utPDgtQdTw=
+github.com/jackc/puddle/v2 v2.2.1 h1:RhxXJtFG022u4ibrCSMSiu5aOq1i77R3OHKNJj77OAk=
+github.com/jackc/puddle/v2 v2.2.1/go.mod h1:vriiEXHvEE654aYKXXjOvZM39qJ0q+azkZFrfEOc3H4=
+github.com/pmezard/go-difflib v1.0.0 h1:4DBwDE0NGyQoBHbLQYPwSUPoCMWR5BEzIk/f1lZbAQM=
+github.com/pmezard/go-difflib v1.0.0/go.mod h1:iKH77koFhYxTK1pcRnkKkqfTogsbg7gZNVY4sRDYZ/4=
+github.com/stretchr/objx v0.1.0/go.mod h1:HFkY916IF+rwdDfMAkV7OtwuqBVzrE8GR6GFx+wExME=
+github.com/stretchr/testify v1.3.0/go.mod h1:M5WIy9Dh21IEIfnGCwXGc5bZfKNJtfHm1UVUgZn+9EI=
+github.com/stretchr/testify v1.7.0/go.mod h1:6Fq8oRcR53rry900zMqJjRRixrwX3KX962/h/Wwjteg=
+github.com/stretchr/testify v1.8.1 h1:w7B6lhMri9wdJUVmEZPGGhZzrYTPvgJArz7wNPgYKsk=
+github.com/stretchr/testify v1.8.1/go.mod h1:w2LPCIKwWwSfY2zedu0+kehJoqGctiVI29o6fzry7u4=
+golang.org/x/crypto v0.27.0 h1:GXm2NjJrPaiv/h1tb2UH8QfgC/hOf/+z0p6PT8o1w7A=
+golang.org/x/crypto v0.27.0/go.mod h1:1Xngt8kV6Dvbssa53Ziq6Eqn0HqbZi5Z6R0ZpwQzt70=
+golang.org/x/sync v0.8.0 h1:3NFvSEYkUoMifnESzZl15y791HH1qU2xm6eCJU5ZPXQ=
+golang.org/x/sync v0.8.0/go.mod h1:Czt+wKu1gCyEFDUtn0jG5QVvpJ6rzVqr5aXyt9drQfk=
+golang.org/x/text v0.18.0 h1:XvMDiNzPAl0jr17s6W9lcaIhGUfUORdGCNsuLmPG224=
+golang.org/x/text v0.18.0/go.mod h1:BuEKDfySbSR4drPmRPG/7iBdf8hvFMuRexcpahXilzY=
+gopkg.in/check.v1 v0.0.0-20161208181325-20d25e280405/go.mod h1:Co6ibVJAznAaIkqp8huTwlJQCZ016jof/cbN4VW5Yz0=
+gopkg.in/yaml.v3 v3.0.0-20200313102051-9f266ea9e77c/go.mod h1:K4uyk7z7BCEPqu6E+C64Yfv1cQ7kz7rIZviUmN+EgEM=
+gopkg.in/yaml.v3 v3.0.1 h1:fxVm/GzAzEWqLHuvctI91KS9hhNmmWOoWu0XTYJS7CA=
+gopkg.in/yaml.v3 v3.0.1/go.mod h1:K4uyk7z7BCEPqu6E+C64Yfv1cQ7kz7rIZviUmN+EgEM=
diff --git a/homeguard-iot/internal/db/db.go b/homeguard-iot/internal/db/db.go
new file mode 100644
index 0000000..00dbf99
--- /dev/null
+++ b/homeguard-iot/internal/db/db.go
@@ -0,0 +1,62 @@
+// Package db is a thin wrapper around the pgx connection pool to CedarDB.
+//
+// CedarDB speaks the Postgres wire protocol, so pgx works unmodified — only
+// the DSN is supplied differently. The pool is sized generously so the
+// simulator's writer goroutines (each holding a conn for the duration of
+// a CopyFrom) don't compete with the dashboard's concurrent reads or with
+// the alert resolution / armed-state shuffle background jobs.
+package db
+
+import (
+	"context"
+	"fmt"
+	"os"
+	"time"
+
+	"github.com/jackc/pgx/v5"
+	"github.com/jackc/pgx/v5/pgxpool"
+)
+
+// Connect dials CedarDB using DATABASE_URL from the environment.
+//
+//	postgres://USER:PASS@HOST:5432/DB?sslmode=disable
+func Connect(ctx context.Context) (*pgxpool.Pool, error) {
+	dsn := os.Getenv("DATABASE_URL")
+	if dsn == "" {
+		return nil, fmt.Errorf("DATABASE_URL is not set")
+	}
+
+	cfg, err := pgxpool.ParseConfig(dsn)
+	if err != nil {
+		return nil, fmt.Errorf("parse DATABASE_URL: %w", err)
+	}
+	// Sized for the high-rate demo: up to ~16 simulator writer goroutines
+	// each holding a long-lived pgx conn during CopyFrom, plus the alert
+	// resolver, the armed-state shuffler, the inline alert inserters, and
+	// the dashboard's concurrent read paths.
+	cfg.MaxConns = 64
+	cfg.MinConns = 4
+	cfg.MaxConnLifetime = 30 * time.Minute
+
+	// CedarDB doesn't currently support the Postgres extended query protocol
+	// (Parse / Bind / Execute), and signals SQLSTATE 08P01
+	// "invalid message in simple query mode" when pgx tries to use it.
+	// Forcing simple-protocol mode makes pgx inline parameters as text
+	// literals and send each query as a single `Q` message — the only
+	// protocol path CedarDB accepts. COPY is unaffected (it uses its own
+	// dedicated protocol regardless of this setting).
+	cfg.ConnConfig.DefaultQueryExecMode = pgx.QueryExecModeSimpleProtocol
+
+	pool, err := pgxpool.NewWithConfig(ctx, cfg)
+	if err != nil {
+		return nil, fmt.Errorf("dial CedarDB: %w", err)
+	}
+
+	pingCtx, cancel := context.WithTimeout(ctx, 5*time.Second)
+	defer cancel()
+	if err := pool.Ping(pingCtx); err != nil {
+		pool.Close()
+		return nil, fmt.Errorf("ping CedarDB: %w", err)
+	}
+	return pool, nil
+}
diff --git a/homeguard-iot/internal/db/schema.go b/homeguard-iot/internal/db/schema.go
new file mode 100644
index 0000000..d060c10
--- /dev/null
+++ b/homeguard-iot/internal/db/schema.go
@@ -0,0 +1,113 @@
+package db
+
+import (
+	"context"
+	_ "embed"
+	"fmt"
+	"log"
+	"strings"
+
+	"github.com/jackc/pgx/v5"
+	"github.com/jackc/pgx/v5/pgxpool"
+)
+
+// SchemaSQL is the canonical schema, embedded at build time. Edit
+// schema.sql and `go build` re-bakes it.
+//
+//go:embed schema.sql
+var SchemaSQL string
+
+// SchemaPresent reports whether the `households` table exists. Used for a
+// diagnostic log line on startup — we always call ApplySchema anyway since
+// the file is idempotent (CREATE TABLE IF NOT EXISTS).
+func SchemaPresent(ctx context.Context, pool *pgxpool.Pool) (bool, error) {
+	var present bool
+	if err := pool.QueryRow(ctx, `
+		SELECT (SELECT COUNT(*) FROM information_schema.tables
+		         WHERE table_name = 'households') = 1
+	`).Scan(&present); err != nil {
+		return false, fmt.Errorf("schema presence check: %w", err)
+	}
+	return present, nil
+}
+
+// ApplySchema runs the embedded schema.sql one statement at a time using
+// the Postgres simple-query protocol (pgx.QueryExecModeSimpleProtocol).
+// CedarDB doesn't accept DDL through the extended Parse/Bind/Execute path
+// the same way vanilla Postgres does, so prepared-statement mode silently
+// fails to apply CREATE TABLE; simple-protocol bypasses that.
+func ApplySchema(ctx context.Context, pool *pgxpool.Pool) error {
+	statements := splitSQLStatements(SchemaSQL)
+	log.Printf("applying schema: %d statements", len(statements))
+	for i, stmt := range statements {
+		if _, err := pool.Exec(ctx, stmt, pgx.QueryExecModeSimpleProtocol); err != nil {
+			return fmt.Errorf(
+				"apply schema (statement %d of %d failed): %w\n--- failing statement ---\n%s\n",
+				i+1, len(statements), err, stmt,
+			)
+		}
+		log.Printf("  [%2d/%d] %s — ok", i+1, len(statements), firstLine(stmt))
+	}
+	log.Printf("schema applied")
+	return nil
+}
+
+// ResetSchema drops everything and re-applies — destructive. Wired behind
+// the simulator's -reset-schema flag.
+func ResetSchema(ctx context.Context, pool *pgxpool.Pool) error {
+	drops := []string{
+		"DROP TABLE IF EXISTS storage_samples CASCADE",
+		"DROP TABLE IF EXISTS alerts          CASCADE",
+		"DROP TABLE IF EXISTS events          CASCADE",
+		"DROP TABLE IF EXISTS devices         CASCADE",
+		"DROP TABLE IF EXISTS households      CASCADE",
+		"DROP TABLE IF EXISTS device_types    CASCADE",
+		"DROP TABLE IF EXISTS regions         CASCADE",
+		"DROP TABLE IF EXISTS plans           CASCADE",
+	}
+	log.Printf("resetting schema: dropping %d tables", len(drops))
+	for i, stmt := range drops {
+		if _, err := pool.Exec(ctx, stmt, pgx.QueryExecModeSimpleProtocol); err != nil {
+			return fmt.Errorf("drop %d: %w", i, err)
+		}
+		log.Printf("  [%d/%d] %s — ok", i+1, len(drops), stmt)
+	}
+	return ApplySchema(ctx, pool)
+}
+
+// splitSQLStatements strips `--` line comments and splits the script on `;`.
+// Sufficient for our DDL (no string literals or function bodies with
+// embedded semicolons).
+func splitSQLStatements(sql string) []string {
+	var clean strings.Builder
+	clean.Grow(len(sql))
+	for _, line := range strings.Split(sql, "\n") {
+		if idx := strings.Index(line, "--"); idx >= 0 {
+			line = line[:idx]
+		}
+		clean.WriteString(line)
+		clean.WriteByte('\n')
+	}
+	out := make([]string, 0, 16)
+	for _, raw := range strings.Split(clean.String(), ";") {
+		stmt := strings.TrimSpace(raw)
+		if stmt != "" {
+			out = append(out, stmt)
+		}
+	}
+	return out
+}
+
+func firstLine(stmt string) string {
+	for _, line := range strings.Split(stmt, "\n") {
+		line = strings.TrimSpace(line)
+		if line == "" {
+			continue
+		}
+		if len(line) > 70 {
+			return line[:67] + "..."
+		}
+		return line
+	}
+	return "(empty)"
+}
diff --git a/homeguard-iot/internal/db/schema.sql b/homeguard-iot/internal/db/schema.sql
new file mode 100644
index 0000000..88cfece
--- /dev/null
+++ b/homeguard-iot/internal/db/schema.sql
@@ -0,0 +1,106 @@
+-- HomeGuard IoT demo schema for CedarDB (Postgres dialect).
+--
+-- The story: a home-security operator running ~3-4 TB/day of incoming IoT
+-- events from millions of devices. Currently they pipe everything into
+-- BigQuery for offline analytics and run a parallel real-time stack
+-- (Pub/Sub + Dataflow + a KV store) for the monitoring center.
+--
+-- The CedarDB pitch is "collapse those two stacks: same table, OLTP-style
+-- writes and OLAP-style reads, no replication lag, real joins across the
+-- normalized dimension model."
+--
+-- Tables:
+--   plans          - subscription tiers; SLA in seconds per tier
+--   regions        - service regions / dispatch centers
+--   device_types   - sensor catalog (motion, door, smoke, etc.)
+--   households    -- customers; FK to plan + region
+--   devices       -- sensors; FK to household + device_type
+--   events         - HOT TABLE; up to ~500K rows/sec in the high-rate demo
+--   alerts         - rules-derived alerts; smaller volume; the operator queue
+--
+-- This file is idempotent (CREATE TABLE IF NOT EXISTS on everything) so the
+-- simulator can safely apply it on every cold start. For destructive reset
+-- pass -reset-schema to the simulator.
+
+CREATE TABLE IF NOT EXISTS plans (
+    plan_id           INTEGER PRIMARY KEY,
+    name              TEXT NOT NULL,
+    monthly_price_usd NUMERIC(8, 2) NOT NULL,
+    sla_seconds       INTEGER NOT NULL  -- monitoring response SLA
+);
+
+CREATE TABLE IF NOT EXISTS regions (
+    region_id        INTEGER PRIMARY KEY,
+    name             TEXT NOT NULL,
+    dispatch_center  TEXT NOT NULL,
+    timezone         TEXT NOT NULL
+);
+
+CREATE TABLE IF NOT EXISTS device_types (
+    device_type_id   INTEGER PRIMARY KEY,
+    code             TEXT NOT NULL,  -- MOTION, DOOR, WINDOW, GLASS_BREAK, SMOKE, CO, WATER, TEMP, DOORBELL, KEYPAD
+    name             TEXT NOT NULL,
+    default_severity SMALLINT NOT NULL
+);
+
+CREATE TABLE IF NOT EXISTS households (
+    household_id  BIGINT PRIMARY KEY,
+    plan_id       INTEGER NOT NULL,
+    region_id     INTEGER NOT NULL,
+    address_hash  TEXT NOT NULL,                  -- hashed for privacy
+    enrolled_at   TIMESTAMPTZ NOT NULL DEFAULT now(),
+    armed         BOOLEAN NOT NULL DEFAULT false  -- current armed state
+);
+
+CREATE TABLE IF NOT EXISTS devices (
+    device_id        BIGINT PRIMARY KEY,
+    household_id     BIGINT NOT NULL,
+    device_type_id   INTEGER NOT NULL,
+    location         TEXT NOT NULL,    -- 'front_door', 'kitchen', 'master_bedroom', ...
+    installed_at     TIMESTAMPTZ NOT NULL DEFAULT now(),
+    last_battery_pct SMALLINT
+);
+
+-- Hot table. ~500K rows/sec in the high-rate demo. Billions of rows over
+-- time. event_id is plain BIGINT (not BIGSERIAL) — the simulator generates
+-- ids from a client-side atomic counter so it can use the binary COPY path,
+-- which CedarDB rejects for BIGSERIAL defaults ("unable to cast from void
+-- to bigint"). Multiple writer goroutines share the same id space via the
+-- atomic counter.
+CREATE TABLE IF NOT EXISTS events (
+    event_id      BIGINT NOT NULL,
+    device_id     BIGINT NOT NULL,
+    household_id  BIGINT NOT NULL,   -- denormalized to skip a join on the hot path
+    ts            TIMESTAMPTZ NOT NULL DEFAULT now(),
+    kind          SMALLINT NOT NULL, -- 0=heartbeat, 1=triggered, 2=battery_low, 3=offline, 4=tamper
+    severity      SMALLINT NOT NULL, -- 0=normal, 1..5 escalating
+    value         DOUBLE PRECISION,  -- sensor reading (temp °C, motion confidence, etc.)
+    battery_pct   SMALLINT,
+    rssi_dbm      SMALLINT           -- wireless signal strength
+);
+
+CREATE TABLE IF NOT EXISTS alerts (
+    alert_id           BIGSERIAL PRIMARY KEY,
+    household_id       BIGINT NOT NULL,
+    triggered_event_id BIGINT NOT NULL,
+    raised_at          TIMESTAMPTZ NOT NULL DEFAULT now(),
+    severity           SMALLINT NOT NULL,
+    status             TEXT NOT NULL,        -- 'active', 'dispatched', 'resolved', 'false_alarm'
+    detail             TEXT,
+    resolved_at        TIMESTAMPTZ,
+    resolution_ms      INTEGER
+);
+
+-- Periodic storage-growth samples: one row per ~5s, sourced from
+-- CedarDB's cedardb_compression_info system view. The dashboard's
+-- /api/storage endpoint derives 1m/5m/15m ingest rates from this table.
+-- sampled_at uses now()'s microsecond precision so it is unique at the
+-- sampler's 5-second cadence.
+CREATE TABLE IF NOT EXISTS storage_samples (
+    sampled_at         TIMESTAMPTZ PRIMARY KEY,
+    uncompressed_bytes BIGINT NOT NULL
+);
+
+CREATE INDEX alerts_status_raised_idx ON alerts (status, raised_at DESC);
+CREATE INDEX alerts_household_raised_idx ON alerts (household_id, raised_at DESC);
+
diff --git a/homeguard-iot/internal/sim/catalog.go b/homeguard-iot/internal/sim/catalog.go
new file mode 100644
index 0000000..6881879
--- /dev/null
+++ b/homeguard-iot/internal/sim/catalog.go
@@ -0,0 +1,75 @@
+package sim
+
+// Static catalog data: plans, regions, device types. These get upserted on
+// simulator startup and rarely change. Keeping them in code (vs a seed file)
+// means the demo is self-contained and reproducible — no external CSVs to
+// keep in sync.
+
+// Plan describes one subscription tier with its monitoring SLA.
+type Plan struct {
+	ID         int
+	Name       string
+	PriceUSD   float64
+	SLASeconds int
+}
+
+var Plans = []Plan{
+	{ID: 1, Name: "Basic", PriceUSD: 19.99, SLASeconds: 300},   // 5 min
+	{ID: 2, Name: "Plus", PriceUSD: 39.99, SLASeconds: 120},    // 2 min
+	{ID: 3, Name: "Premium", PriceUSD: 59.99, SLASeconds: 60},  // 1 min
+	{ID: 4, Name: "Concierge", PriceUSD: 99.99, SLASeconds: 30}, // 30 s
+}
+
+// Region groups households for dispatch + timezone purposes.
+type Region struct {
+	ID             int
+	Name           string
+	DispatchCenter string
+	Timezone       string
+}
+
+var Regions = []Region{
+	{1, "Northeast", "Boston DC", "America/New_York"},
+	{2, "Mid-Atlantic", "Newark DC", "America/New_York"},
+	{3, "Southeast", "Atlanta DC", "America/New_York"},
+	{4, "Midwest", "Chicago DC", "America/Chicago"},
+	{5, "South-Central", "Dallas DC", "America/Chicago"},
+	{6, "Mountain", "Denver DC", "America/Denver"},
+	{7, "Pacific", "Phoenix DC", "America/Phoenix"},
+	{8, "Pacific-NW", "Seattle DC", "America/Los_Angeles"},
+	{9, "California", "San Jose DC", "America/Los_Angeles"},
+	{10, "Canada-East", "Toronto DC", "America/Toronto"},
+}
+
+// DeviceType is the sensor catalog. `DefaultSeverity` is the baseline
+// alert severity if a device of this type fires a triggered event with the
+// household armed — alert rules can promote or demote per-event.
+type DeviceType struct {
+	ID              int
+	Code            string
+	Name            string
+	DefaultSeverity int
+}
+
+var DeviceTypes = []DeviceType{
+	{ID: 1, Code: "MOTION", Name: "Motion sensor", DefaultSeverity: 3},
+	{ID: 2, Code: "DOOR", Name: "Door contact", DefaultSeverity: 4},
+	{ID: 3, Code: "WINDOW", Name: "Window contact", DefaultSeverity: 4},
+	{ID: 4, Code: "GLASS_BREAK", Name: "Glass-break detector", DefaultSeverity: 4},
+	{ID: 5, Code: "SMOKE", Name: "Smoke detector", DefaultSeverity: 5},
+	{ID: 6, Code: "CO", Name: "Carbon monoxide detector", DefaultSeverity: 5},
+	{ID: 7, Code: "WATER", Name: "Water leak sensor", DefaultSeverity: 3},
+	{ID: 8, Code: "TEMP", Name: "Temperature sensor", DefaultSeverity: 1},
+	{ID: 9, Code: "DOORBELL", Name: "Smart doorbell", DefaultSeverity: 1},
+	{ID: 10, Code: "KEYPAD", Name: "Entry keypad", DefaultSeverity: 1},
+}
+
+// Locations is the pool of in-home placements used when allocating devices.
+// Not every device type lives in every location, but the simulator picks
+// per-type plausibly (see allocateDevices in simulator.go).
+var Locations = []string{
+	"front_door", "back_door", "garage_door", "side_door",
+	"living_room", "kitchen", "dining_room",
+	"master_bedroom", "bedroom_2", "bedroom_3",
+	"basement", "attic", "utility_room", "hallway",
+}
diff --git a/homeguard-iot/internal/sim/rules.go b/homeguard-iot/internal/sim/rules.go
new file mode 100644
index 0000000..cfe6e85
--- /dev/null
+++ b/homeguard-iot/internal/sim/rules.go
@@ -0,0 +1,61 @@
+package sim
+
+// Alert rules: deterministically promote certain triggered events into
+// rows in the alerts table. The mapping is intentionally simple so the
+// query that joins alerts × households × plans × device_types tells a
+// readable story on the dashboard.
+//
+// kind values mirror the SMALLINT column in events:
+//
+//	0 = heartbeat       (never alerts)
+//	1 = triggered       (the alerter; consult device_type + armed state)
+//	2 = battery_low     (info-level alert if severity >= 4 plan SLA)
+//	3 = offline         (skipped for now; could become an alert if device
+//	                     was previously online for a long time)
+//	4 = tamper          (always alerts at severity 4)
+
+// AlertDecision says whether to raise an alert for a given event, and at
+// what severity. nil = don't raise.
+type AlertDecision struct {
+	Severity int
+	Detail   string
+}
+
+// EvaluateAlert applies the demo's alert rules to a candidate event.
+// Inputs are the device type code (e.g. "SMOKE"), event kind, and whether
+// the household was armed at the moment the event fired.
+func EvaluateAlert(deviceCode string, kind int, armed bool) *AlertDecision {
+	switch kind {
+	case 4: // tamper — always alert, anywhere
+		return &AlertDecision{Severity: 4, Detail: "Device tamper detected"}
+	case 1: // triggered — depends on device type and armed state
+		switch deviceCode {
+		case "SMOKE":
+			return &AlertDecision{Severity: 5, Detail: "Smoke detected"}
+		case "CO":
+			return &AlertDecision{Severity: 5, Detail: "Carbon monoxide detected"}
+		case "GLASS_BREAK":
+			return &AlertDecision{Severity: 4, Detail: "Glass break detected"}
+		case "WATER":
+			return &AlertDecision{Severity: 3, Detail: "Water leak detected"}
+		case "DOOR", "WINDOW":
+			if armed {
+				return &AlertDecision{Severity: 4, Detail: deviceCode + " opened while armed"}
+			}
+			// Door/window events with the system disarmed are routine.
+			return nil
+		case "MOTION":
+			if armed {
+				return &AlertDecision{Severity: 3, Detail: "Motion detected while armed"}
+			}
+			return nil
+		case "DOORBELL":
+			// Informational only.
+			return nil
+		default:
+			return nil
+		}
+	default:
+		return nil
+	}
+}
diff --git a/homeguard-iot/internal/sim/simulator.go b/homeguard-iot/internal/sim/simulator.go
new file mode 100644
index 0000000..927c460
--- /dev/null
+++ b/homeguard-iot/internal/sim/simulator.go
@@ -0,0 +1,926 @@
+// Package sim generates the IoT telemetry stream and writes it to CedarDB.
+//
+// On startup the simulator upserts the dimension data (plans, regions,
+// device_types) and synthesizes a fleet of households with ~10 devices
+// each. At runtime the work splits into two layers:
+//
+//  1. N row-producer goroutines ("writers"). Each owns a slice of the
+//     device fleet, ticks at TickHz, and produces its share of the per-
+//     tick budget: heartbeats rotated round-robin over its partition,
+//     a handful of triggered events, and very occasional battery_low /
+//     tamper / offline events. Producers do NOT talk to the DB on the
+//     hot path; they push completed row batches onto eventCh.
+//
+//  2. One ingestor goroutine. Drains eventCh, coalesces batches up to
+//     ingestBatch rows (or every flushInterval, whichever comes first),
+//     and writes them via a single pgx.CopyFrom. Because there is only
+//     one CopyFrom in flight at a time, CedarDB never trips the
+//     "cannot start bulk operation until previous bulk operation has
+//     become globally visible" (SQLSTATE 40P01) error that fires when
+//     two writer goroutines try to COPY in parallel.
+//
+// All producers share an atomic event_id counter so they don't collide
+// on the BIGINT primary key. (event_id is plain BIGINT rather than
+// BIGSERIAL — CedarDB rejects the COPY frame when the column has a
+// sequence default; client-side ids dodge the problem entirely.)
+//
+// Triggered events are run through the alert rules (rules.go) and matching
+// alerts are inserted into the alerts table. Background goroutines resolve
+// old active alerts and shuffle households' armed state so the dashboard
+// stays dynamic.
+package sim
+
+import (
+	"context"
+	"crypto/sha1"
+	"encoding/hex"
+	"fmt"
+	"log"
+	"math/rand"
+	"os"
+	"strconv"
+	"sync"
+	"sync/atomic"
+	"time"
+
+	"github.com/jackc/pgx/v5"
+	"github.com/jackc/pgx/v5/pgxpool"
+)
+
+// envDuration reads a Go duration string (e.g. "5s", "200ms", "1m30s")
+// from the named environment variable, falling back to the supplied
+// default if the variable is unset or unparseable. Used to externalise
+// polling intervals so they can be tuned from docker-compose without a
+// rebuild.
+func envDuration(name string, fallback time.Duration) time.Duration {
+	v := os.Getenv(name)
+	if v == "" {
+		return fallback
+	}
+	d, err := time.ParseDuration(v)
+	if err != nil {
+		log.Printf("envDuration: invalid %s=%q (%v); using default %s", name, v, err, fallback)
+		return fallback
+	}
+	return d
+}
+
+// envInt reads a positive integer from the named environment variable,
+// falling back to the default if unset, unparseable, or <= 0.
+func envInt(name string, fallback int) int {
+	v := os.Getenv(name)
+	if v == "" {
+		return fallback
+	}
+	n, err := strconv.Atoi(v)
+	if err != nil || n <= 0 {
+		log.Printf("envInt: invalid %s=%q; using default %d", name, v, fallback)
+		return fallback
+	}
+	return n
+}
+
+// envBool reads a boolean (strconv.ParseBool grammar: 1/t/T/TRUE/true/True
+// or 0/f/F/FALSE/false/False) from the named environment variable,
+// falling back to the supplied default if unset or unparseable.
+func envBool(name string, fallback bool) bool {
+	v := os.Getenv(name)
+	if v == "" {
+		return fallback
+	}
+	b, err := strconv.ParseBool(v)
+	if err != nil {
+		log.Printf("envBool: invalid %s=%q; using default %v", name, v, fallback)
+		return fallback
+	}
+	return b
+}
+
+// Tuning constants for the fan-in COPY drainer.
+const (
+	// ingestBatch is the row count at which the ingestor flushes a COPY.
+	// Larger = better amortization of CopyFrom overhead; smaller = lower
+	// latency and tighter alert-to-event ordering.
+	ingestBatch = 10000
+
+	// flushInterval caps how long pending rows can sit before a partial
+	// COPY is forced. Keeps the dashboard's live event stream fresh even
+	// when -rate is well below ingestBatch/flushInterval.
+	flushInterval = 50 * time.Millisecond
+
+	// eventChCapacity holds completed row batches awaiting ingest. Sized
+	// for ~2 batches per writer at the highest -writers we'd expect, so
+	// producers rarely block on send.
+	eventChCapacity = 64
+
+	// Auto-tune knobs for the alert resolver. The per-tick LIMIT is
+	//   max(env_floor, deltaFired * autoTuneHeadroom + backlog * backlogDecay)
+	// then capped at maxAutoLimit.
+	autoTuneHeadroom = 1.2   // drain 20% faster than current generation
+	backlogDecay     = 0.01  // chip 1% off the standing backlog per tick
+	maxAutoLimit     = 100000 // ceiling per tick so a runaway UPDATE can't bury CedarDB
+)
+
+// Device is one physical sensor placed in a household.
+type Device struct {
+	ID         int64
+	Household  int64
+	TypeID     int
+	Code       string // device type code; cached so we don't re-join per tick
+	Location   string
+	BatteryPct int
+}
+
+// Household is one customer.
+type Household struct {
+	ID          int64
+	PlanID      int
+	RegionID    int
+	AddressHash string
+	Armed       bool
+}
+
+// Simulator state. Read-only on the hot path after New() returns: writers
+// hold pointers in but don't mutate any field except via eventID's atomic
+// counter and the per-device BatteryPct (writers partition the device
+// slice so no two writers ever touch the same Device).
+//
+// One exception: Household.Armed is read by writers (when evaluating
+// alert rules on triggered events) and written by shuffleArmedLoop in a
+// separate goroutine. armedMu protects that access — writers take it as
+// a reader, the shuffler as a writer.
+//
+// eventCh is the producer→ingestor handoff. Writers push completed row
+// batches onto it; one ingestor goroutine drains it and runs CopyFrom.
+type Simulator struct {
+	Pool             *pgxpool.Pool
+	Households       []Household
+	Devices          []Device
+	TickHz           int
+	TargetRate       int          // events per second total target
+	Writers          int          // number of writer goroutines
+	eventID          atomic.Int64 // shared id generator; seeded in Run()
+	armedMu          sync.RWMutex // guards Household.Armed reads/writes
+	eventCh          chan [][]any // producer→ingestor batch queue
+	lastCopyUnixNano atomic.Int64 // UnixNano of last successful CopyFrom (0 if none yet)
+	copyFailures     atomic.Int64 // total CopyFrom errors since startup
+
+	// Resolver instrumentation. fireAlert bumps the appropriate fired
+	// counter; the resolver bumps the resolved counters with the UPDATE's
+	// CommandTag.RowsAffected. (fired - resolved) ≈ backlog in CedarDB,
+	// computed in-process so we don't need a COUNT(*) on the alerts table.
+	alertsFiredLow     atomic.Int64
+	alertsFiredHigh    atomic.Int64
+	alertsResolvedLow  atomic.Int64
+	alertsResolvedHigh atomic.Int64
+}
+
+// New builds the simulator state: upserts dimensions, synthesizes the
+// household & device fleet, but does not begin the loop. Call Run to
+// start ticking.
+func New(ctx context.Context, pool *pgxpool.Pool, householdCount, devicesPerHousehold, tickHz, targetRate, writers int) (*Simulator, error) {
+	if writers < 1 {
+		writers = 1
+	}
+	if err := upsertDimensions(ctx, pool); err != nil {
+		return nil, fmt.Errorf("upsert dimensions: %w", err)
+	}
+
+	rng := rand.New(rand.NewSource(0xC1D4A12)) // deterministic fleet, demo-friendly
+
+	hh, devs := generateFleet(rng, householdCount, devicesPerHousehold)
+	log.Printf("synthesized fleet: %d households · %d devices", len(hh), len(devs))
+
+	if err := upsertHouseholds(ctx, pool, hh); err != nil {
+		return nil, fmt.Errorf("upsert households: %w", err)
+	}
+	if err := upsertDevices(ctx, pool, devs); err != nil {
+		return nil, fmt.Errorf("upsert devices: %w", err)
+	}
+
+	return &Simulator{
+		Pool:       pool,
+		Households: hh,
+		Devices:    devs,
+		TickHz:     tickHz,
+		TargetRate: targetRate,
+		Writers:    writers,
+	}, nil
+}
+
+// Run seeds the event_id counter, launches the background loops (alert
+// resolution + armed-state shuffle + storage sampler), spawns the single
+// ingestor goroutine that owns CopyFrom, and fans out s.Writers
+// producer goroutines over partitions of the device fleet. Producers
+// build row batches and push them onto s.eventCh; the ingestor drains.
+func (s *Simulator) Run(ctx context.Context) error {
+	var maxID int64
+	if err := s.Pool.QueryRow(ctx,
+		`SELECT COALESCE(MAX(event_id), 0) FROM events`).Scan(&maxID); err != nil {
+		return fmt.Errorf("seed event_id: %w", err)
+	}
+	s.eventID.Store(maxID)
+	log.Printf("event_id counter seeded at %d", maxID)
+
+	s.eventCh = make(chan [][]any, eventChCapacity)
+
+	go s.resolveAlertsLoop(ctx)
+	go s.shuffleArmedLoop(ctx)
+	go s.storageSamplerLoop(ctx)
+
+	// Ingestor pool — N goroutines all drain eventCh and run CopyFrom in
+	// parallel. Default N=1 is the historical safe behaviour (CedarDB
+	// used to reject overlapping COPYs with SQLSTATE 40P01); newer
+	// CedarDB versions accept concurrent COPYs, in which case N > 1 may
+	// give a meaningful throughput bump. Tune with HG_INGESTORS.
+	numIngestors := envInt("HG_INGESTORS", 1)
+	var ingestorWG sync.WaitGroup
+	for i := 0; i < numIngestors; i++ {
+		ingestorWG.Add(1)
+		go func(id int) {
+			defer ingestorWG.Done()
+			s.ingestorLoop(ctx, id)
+		}(i)
+	}
+
+	// Heartbeat: one line per second describing pipeline health. Lets us
+	// distinguish "producers stopped" (delta=0) from "ingestor stuck"
+	// (queue high, lastCopy ago growing) from "CedarDB rejecting"
+	// (failures climbing).
+	go s.heartbeatLoop(ctx)
+
+	// Per-writer per-tick budget. The total target rate divides across
+	// writers; each writer then divides its share across ticks.
+	hbPerTick := s.TargetRate / s.TickHz / s.Writers
+	if hbPerTick < 1 {
+		hbPerTick = 1
+	}
+	log.Printf("simulator running: writers=%d tickHz=%d target=%d ev/s hb/tick/writer=%d fleet=%d devices · ingestors=%d batch=%d flush=%s",
+		s.Writers, s.TickHz, s.TargetRate, hbPerTick, len(s.Devices),
+		numIngestors, ingestBatch, flushInterval)
+
+	var wg sync.WaitGroup
+	for i := 0; i < s.Writers; i++ {
+		w := &writer{
+			id:        i,
+			sim:       s,
+			devices:   devicePartition(s.Devices, i, s.Writers),
+			rng:       rand.New(rand.NewSource(time.Now().UnixNano() ^ int64(i)*0xC0DE)),
+			hbPerTick: hbPerTick,
+		}
+		wg.Add(1)
+		go func() {
+			defer wg.Done()
+			w.run(ctx)
+		}()
+	}
+	wg.Wait()
+	// Producers are done; let the ingestors drain anything still pending.
+	// Closing the channel is broadcast to every receiver, so all N
+	// ingestors observe (zero, false) and exit cleanly.
+	close(s.eventCh)
+	ingestorWG.Wait()
+	return ctx.Err()
+}
+
+// devicePartition returns the i-th slice of d, split into n contiguous
+// chunks. The last partition absorbs any remainder.
+func devicePartition(d []Device, i, n int) []Device {
+	chunk := len(d) / n
+	lo := i * chunk
+	hi := lo + chunk
+	if i == n-1 {
+		hi = len(d)
+	}
+	return d[lo:hi]
+}
+
+// writer is one ingest goroutine. Owns a slice of the device fleet,
+// keeps its own deviceCursor and rng, allocates fresh event_ids from
+// sim.eventID, and writes each tick via pgx.CopyFrom on its own pool
+// connection. Writers never share any mutable state on the hot path
+// except the atomic id counter.
+type writer struct {
+	id           int
+	sim          *Simulator
+	devices      []Device
+	rng          *rand.Rand
+	deviceCursor int
+	hbPerTick    int
+}
+
+func (w *writer) run(ctx context.Context) {
+	interval := time.Second / time.Duration(w.sim.TickHz)
+	tick := time.NewTicker(interval)
+	defer tick.Stop()
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-tick.C:
+			w.emitTick(ctx)
+		}
+	}
+}
+
+// emitTick generates one tick's worth of events from this writer's
+// device partition and hands the batch off to the ingestor goroutine
+// via the simulator's eventCh. No DB I/O happens here.
+func (w *writer) emitTick(ctx context.Context) {
+	if len(w.devices) == 0 {
+		return
+	}
+	now := time.Now()
+	rows := make([][]any, 0, w.hbPerTick+8)
+
+	// 1) Heartbeats: rotate through this writer's partition.
+	for i := 0; i < w.hbPerTick; i++ {
+		d := &w.devices[w.deviceCursor%len(w.devices)]
+		w.deviceCursor++
+		// Battery slowly drains over the demo; clamp at 5%.
+		if w.rng.Intn(10000) < 3 && d.BatteryPct > 5 {
+			d.BatteryPct--
+		}
+		rssi := -50 - w.rng.Intn(40) // -50 to -90 dBm
+		var val float64
+		if d.Code == "TEMP" {
+			val = 18.0 + w.rng.Float64()*10
+		}
+		rows = append(rows, w.row(d, now, 0 /*heartbeat*/, 0, val, rssi))
+	}
+
+	// 2) Triggered events: a handful per tick from random devices in
+	//    this writer's partition.
+	triggers := 3 + w.rng.Intn(5)
+	for i := 0; i < triggers; i++ {
+		d := &w.devices[w.rng.Intn(len(w.devices))]
+		hh := &w.sim.Households[d.Household%int64(len(w.sim.Households))]
+		rssi := -50 - w.rng.Intn(40)
+		rows = append(rows, w.row(d, now, 1 /*triggered*/, int(severityFor(d.Code)), 1.0, rssi))
+		w.sim.armedMu.RLock()
+		armed := hh.Armed
+		w.sim.armedMu.RUnlock()
+		if dec := EvaluateAlert(d.Code, 1, armed); dec != nil {
+			w.sim.fireAlert(ctx, d.Household, dec.Severity, dec.Detail)
+		}
+	}
+
+	// 3) Occasional battery_low / tamper / offline.
+	if w.rng.Intn(100) < 5 {
+		d := &w.devices[w.rng.Intn(len(w.devices))]
+		kind := []int{2, 4, 3}[w.rng.Intn(3)]
+		sev := 1
+		if kind == 4 {
+			sev = 4
+		}
+		rows = append(rows, w.row(d, now, kind, sev, 0.0, -60-w.rng.Intn(30)))
+		if kind == 4 {
+			if dec := EvaluateAlert(d.Code, 4, false); dec != nil {
+				w.sim.fireAlert(ctx, d.Household, dec.Severity, dec.Detail)
+			}
+		}
+	}
+
+	// 4) Hand the batch to the ingestor. The send blocks if the channel
+	//    is full — which is the right backpressure: it means the
+	//    ingestor (CedarDB) can't keep up, so we shouldn't generate more
+	//    rows until it can. ctx.Done unblocks for clean shutdown.
+	if len(rows) == 0 {
+		return
+	}
+	select {
+	case w.sim.eventCh <- rows:
+	case <-ctx.Done():
+	}
+}
+
+// row builds one events row with a freshly-allocated event_id. The
+// shared atomic counter means writers never collide on the BIGINT PK.
+func (w *writer) row(d *Device, ts time.Time, kind, sev int, val float64, rssi int) []any {
+	return []any{
+		w.sim.eventID.Add(1),
+		d.ID, d.Household, ts,
+		int16(kind), int16(sev), val,
+		int16(d.BatteryPct), int16(rssi),
+	}
+}
+
+// ingestorLoop is the single CopyFrom point. Coalesces batches off of
+// eventCh up to ingestBatch rows (or every flushInterval, whichever
+// comes first) and runs one pgx.CopyFrom per flush. Because there's
+// only ever one CopyFrom in flight, CedarDB never trips the
+// "previous bulk operation must become globally visible" (SQLSTATE
+// 40P01) error that fires when multiple goroutines COPY concurrently.
+//
+// Shutdown contract: Run() closes eventCh after all producers have
+// exited, the ingestor drains everything still in flight, runs one
+// final flush, and returns.
+func (s *Simulator) ingestorLoop(ctx context.Context, id int) {
+	batchSize := envInt("HG_INGEST_BATCH", ingestBatch)
+	if id == 0 {
+		log.Printf("ingestor: batchSize=%d flushInterval=%s", batchSize, flushInterval)
+	}
+
+	pending := make([][]any, 0, batchSize*2)
+	cols := []string{
+		"event_id", "device_id", "household_id", "ts",
+		"kind", "severity", "value", "battery_pct", "rssi_dbm",
+	}
+	flush := func() {
+		if len(pending) == 0 {
+			return
+		}
+		if _, err := s.Pool.CopyFrom(ctx,
+			pgx.Identifier{"events"}, cols, pgx.CopyFromRows(pending),
+		); err != nil {
+			s.copyFailures.Add(1)
+			log.Printf("ingestor %d: copy events: %v (rows=%d)", id, err, len(pending))
+		} else {
+			s.lastCopyUnixNano.Store(time.Now().UnixNano())
+		}
+		pending = pending[:0]
+	}
+
+	flushTicker := time.NewTicker(flushInterval)
+	defer flushTicker.Stop()
+
+	for {
+		select {
+		case <-ctx.Done():
+			flush()
+			return
+		case batch, ok := <-s.eventCh:
+			if !ok {
+				flush()
+				return
+			}
+			pending = append(pending, batch...)
+			if len(pending) >= batchSize {
+				flush()
+			}
+		case <-flushTicker.C:
+			flush()
+		}
+	}
+}
+
+// heartbeatLoop emits a one-line status report once a second so we can
+// tell from the logs whether the pipeline is healthy or stuck, and where
+// the stall is if it isn't:
+//
+//	queue=N/CAP        — eventCh depth. Near CAP means ingestor (CedarDB) is the bottleneck.
+//	delta=N rows/s     — row generation rate observed via the atomic counter.
+//	lastCopy=Xs ago    — wall time since the last successful CopyFrom. Should be < 1s under load.
+//	copyFails=N        — cumulative CopyFrom errors; non-zero means CedarDB is rejecting.
+//
+// 0 ev/s in the dashboard footer + queue=CAP + lastCopy growing → CedarDB stopped accepting writes.
+// 0 ev/s + queue=0 → producers stopped (look for goroutine panics).
+func (s *Simulator) heartbeatLoop(ctx context.Context) {
+	var prev int64
+	tick := time.NewTicker(1 * time.Second)
+	defer tick.Stop()
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-tick.C:
+			cur := s.eventID.Load()
+			delta := cur - prev
+			prev = cur
+			ago := time.Duration(-1)
+			if last := s.lastCopyUnixNano.Load(); last > 0 {
+				ago = time.Since(time.Unix(0, last)).Round(time.Millisecond)
+			}
+			log.Printf("heartbeat: queue=%d/%d eventID=%d delta=%d rows/s lastCopy=%v ago copyFails=%d",
+				len(s.eventCh), eventChCapacity, cur, delta, ago, s.copyFailures.Load())
+		}
+	}
+}
+
+// fireAlert inserts a row into alerts. Status starts 'active'; the
+// background resolveAlertsLoop will resolve it after a short delay.
+// We also bump the in-process generation counter the resolver uses to
+// auto-tune its drain rate — we don't wait for the DB INSERT to settle
+// because the resolver's controller wants the request rate, not the
+// committed rate (they differ only by transient pool / fsync latency).
+func (s *Simulator) fireAlert(ctx context.Context, householdID int64, severity int, detail string) {
+	if severity <= 2 {
+		s.alertsFiredLow.Add(1)
+	} else {
+		s.alertsFiredHigh.Add(1)
+	}
+	_, err := s.Pool.Exec(ctx, `
+		INSERT INTO alerts (household_id, triggered_event_id, raised_at, severity, status, detail)
+		VALUES ($1, 0, now(), $2, 'active', $3)
+	`, householdID, int16(severity), detail)
+	if err != nil {
+		log.Printf("insert alert: %v", err)
+	}
+}
+
+// resolveAlertsLoop runs in the background; every HG_RESOLVE_INTERVAL it
+// resolves the oldest active alerts so the operator queue doesn't grow
+// without bound. The mix of severities resolved varies (high-severity
+// stick around longer, low-severity get auto-cleared faster) to mimic
+// real triage.
+//
+// When HG_RESOLVE_AUTOTUNE is on (default), the per-tick LIMITs are
+// chosen at runtime as
+//
+//     limit = max(env_floor,
+//                 ceil(deltaFired * autoTuneHeadroom + backlog * backlogDecay))
+//
+// where deltaFired is alerts inserted since the previous tick, and
+// backlog ≈ alertsFired − alertsResolved tracked in two atomic counters
+// per severity tier. Both inputs come from in-process state so we never
+// have to COUNT(*) the alerts table to size the next tick.
+//
+// HG_RESOLVE_LOW_LIMIT / HG_RESOLVE_HIGH_LIMIT remain the floors —
+// auto-tune can push higher but never below them. Set
+// HG_RESOLVE_AUTOTUNE=false to pin the limits at the floors (useful when
+// you want to demo what happens to AGE/SLA as a backlog grows).
+func (s *Simulator) resolveAlertsLoop(ctx context.Context) {
+	interval := envDuration("HG_RESOLVE_INTERVAL", 2*time.Second)
+	lowFloor := envInt("HG_RESOLVE_LOW_LIMIT", 2000)
+	highFloor := envInt("HG_RESOLVE_HIGH_LIMIT", 600)
+	autoTune := envBool("HG_RESOLVE_AUTOTUNE", true)
+	log.Printf("alert resolver: interval=%s lowFloor=%d highFloor=%d autoTune=%v",
+		interval, lowFloor, highFloor, autoTune)
+
+	var lastFiredLow, lastFiredHigh int64
+	var lastLogAt time.Time
+	ticker := time.NewTicker(interval)
+	defer ticker.Stop()
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-ticker.C:
+			curFiredLow := s.alertsFiredLow.Load()
+			curFiredHigh := s.alertsFiredHigh.Load()
+			deltaLow := curFiredLow - lastFiredLow
+			deltaHigh := curFiredHigh - lastFiredHigh
+			lastFiredLow, lastFiredHigh = curFiredLow, curFiredHigh
+
+			backlogLow := curFiredLow - s.alertsResolvedLow.Load()
+			backlogHigh := curFiredHigh - s.alertsResolvedHigh.Load()
+			if backlogLow < 0 {
+				backlogLow = 0
+			}
+			if backlogHigh < 0 {
+				backlogHigh = 0
+			}
+
+			lowLimit, highLimit := lowFloor, highFloor
+			if autoTune {
+				autoLow := int(float64(deltaLow)*autoTuneHeadroom + float64(backlogLow)*backlogDecay)
+				autoHigh := int(float64(deltaHigh)*autoTuneHeadroom + float64(backlogHigh)*backlogDecay)
+				if autoLow > lowLimit {
+					lowLimit = autoLow
+				}
+				if autoHigh > highLimit {
+					highLimit = autoHigh
+				}
+			}
+			if lowLimit > maxAutoLimit {
+				lowLimit = maxAutoLimit
+			}
+			if highLimit > maxAutoLimit {
+				highLimit = maxAutoLimit
+			}
+
+			// Resolve low-severity (info / sev 1-2) → false_alarm.
+			lowTag, _ := s.Pool.Exec(ctx, `
+				UPDATE alerts
+				SET status = 'false_alarm',
+				    resolved_at = now(),
+				    resolution_ms = (EXTRACT(EPOCH FROM (now() - raised_at)) * 1000)::int
+				WHERE alert_id IN (
+					SELECT alert_id FROM alerts
+					WHERE status = 'active' AND severity <= 2
+					ORDER BY raised_at ASC LIMIT $1
+				)
+			`, lowLimit)
+			// Dispatch then resolve a sample of higher-severity older alerts.
+			highTag, _ := s.Pool.Exec(ctx, `
+				UPDATE alerts
+				SET status = 'resolved',
+				    resolved_at = now(),
+				    resolution_ms = (EXTRACT(EPOCH FROM (now() - raised_at)) * 1000)::int
+				WHERE alert_id IN (
+					SELECT alert_id FROM alerts
+					WHERE status = 'active' AND severity >= 3
+					  AND raised_at < now() - interval '20 seconds'
+					ORDER BY raised_at ASC LIMIT $1
+				)
+			`, highLimit)
+			s.alertsResolvedLow.Add(lowTag.RowsAffected())
+			s.alertsResolvedHigh.Add(highTag.RowsAffected())
+
+			// Once every 10 s, log what we drained and how big the queue is.
+			if time.Since(lastLogAt) >= 10*time.Second {
+				log.Printf("resolver: drain low=%d/%d high=%d/%d · backlog low=%d high=%d",
+					lowTag.RowsAffected(), lowLimit,
+					highTag.RowsAffected(), highLimit,
+					backlogLow, backlogHigh)
+				lastLogAt = time.Now()
+			}
+		}
+	}
+}
+
+// shuffleArmedLoop runs as its own goroutine (replaces the
+// armedTimer-driven path in the old emitTick). Every 10 seconds it
+// flips ~5% of households' armed state so the rules engine outputs
+// stay varied over the demo.
+func (s *Simulator) shuffleArmedLoop(ctx context.Context) {
+	rng := rand.New(rand.NewSource(time.Now().UnixNano() ^ 0xA1ED))
+	tick := time.NewTicker(10 * time.Second)
+	defer tick.Stop()
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-tick.C:
+			s.shuffleArmedState(ctx, rng)
+		}
+	}
+}
+
+func (s *Simulator) shuffleArmedState(ctx context.Context, rng *rand.Rand) {
+	// Flip ~5% of households' armed state in memory; push the new value
+	// in batches. armedMu guards the in-memory mutation against
+	// concurrent reads from the writer goroutines.
+	batch := &pgx.Batch{}
+	flipped := 0
+	s.armedMu.Lock()
+	for i := range s.Households {
+		if rng.Intn(20) == 0 {
+			s.Households[i].Armed = !s.Households[i].Armed
+			batch.Queue(`UPDATE households SET armed = $1 WHERE household_id = $2`,
+				s.Households[i].Armed, s.Households[i].ID)
+			flipped++
+		}
+	}
+	s.armedMu.Unlock()
+	if flipped == 0 {
+		return
+	}
+	if err := s.Pool.SendBatch(ctx, batch).Close(); err != nil {
+		log.Printf("armed-state shuffle: %v", err)
+	}
+}
+
+// eventsRowBytes is the per-row uncompressed footprint of the events
+// table, derived directly from the schema column widths:
+//
+//	event_id BIGINT(8) + device_id BIGINT(8) + household_id BIGINT(8)
+//	+ ts TIMESTAMPTZ(8) + kind SMALLINT(2) + severity SMALLINT(2)
+//	+ value DOUBLE(8) + battery_pct SMALLINT(2) + rssi_dbm SMALLINT(2)
+//	= 48 bytes
+//
+// We use this rather than CedarDB's cedardb_compression_info view
+// because the view only updates once written data lands in column-store
+// blocks — at high ingest rates that lags reality by many seconds.
+// COUNT(*) on events tracks live row counts in real time.
+const eventsRowBytes = 48
+
+// storageSamplerLoop records a (now(), 48 * eventID) sample into
+// storage_samples every 5 seconds. The dashboard's /api/storage endpoint
+// derives 1m/5m/15m ingest rates from this table.
+//
+// We compute bytes from the in-process atomic counter (s.eventID, which
+// is seeded from MAX(event_id) at startup and incremented per generated
+// row) rather than running COUNT(*) on the events table. Two reasons:
+//
+//  1. Cost. At demo scale the events table can reach billions of rows
+//     in well under an hour. COUNT(*) becomes a multi-second full scan
+//     and may hold a snapshot that pressures the write path — a likely
+//     cause of the 30-minute ingest stall we saw.
+//  2. Currency. The atomic counter is updated at the moment a row is
+//     generated, ahead of the CopyFrom that actually persists it. The
+//     gauge can therefore appear up to (channel queue + in-flight
+//     batch) rows ahead of what's truly in the table — at most ~160 K
+//     rows ≈ 8 MB at the demo's default backpressure ceiling, which is
+//     well under one second of headway and a fine trade-off given the
+//     scan cost we avoid.
+//
+// The number is "uncompressed-bytes-equivalent" — what the data would
+// weigh as a flat file. CedarDB's actual on-disk footprint after column
+// encoding (truncate, frame-of-reference, dictionary, etc.) is 5–10×
+// smaller.
+func (s *Simulator) storageSamplerLoop(ctx context.Context) {
+	interval := envDuration("HG_STORAGE_SAMPLER_INTERVAL", 5*time.Second)
+	log.Printf("storage sampler: interval=%s", interval)
+	tick := time.NewTicker(interval)
+	defer tick.Stop()
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-tick.C:
+			bytes := int64(eventsRowBytes) * s.eventID.Load()
+			if _, err := s.Pool.Exec(ctx, `
+				INSERT INTO storage_samples (sampled_at, uncompressed_bytes)
+				VALUES (now(), $1)
+			`, bytes); err != nil {
+				log.Printf("storage sampler: insert: %v", err)
+			}
+		}
+	}
+}
+
+// severityFor maps a device type code to its baseline severity for
+// triggered events. Used when writing the event row itself; the alert
+// decision (rules.go) does the more nuanced household-armed logic.
+func severityFor(code string) int16 {
+	for _, dt := range DeviceTypes {
+		if dt.Code == code {
+			return int16(dt.DefaultSeverity)
+		}
+	}
+	return 1
+}
+
+// --- fleet synthesis ------------------------------------------------------
+
+func generateFleet(rng *rand.Rand, householdCount, devicesPerHousehold int) ([]Household, []Device) {
+	hh := make([]Household, 0, householdCount)
+	devs := make([]Device, 0, householdCount*devicesPerHousehold)
+
+	for i := 0; i < householdCount; i++ {
+		id := int64(1000000 + i) // start at a friendly-looking ID
+		plan := pickWeighted(rng, []int{40, 35, 18, 7})         // basic-heavy
+		region := rng.Intn(len(Regions)) + 1
+		armed := rng.Intn(100) < 35 // ~35% armed at any time
+		hh = append(hh, Household{
+			ID:          id,
+			PlanID:      plan + 1,
+			RegionID:    region,
+			AddressHash: hashAddr(id),
+			Armed:       armed,
+		})
+
+		// Each household gets between (devicesPerHousehold/2) and
+		// (devicesPerHousehold*3/2) devices, distributed across plausible
+		// types and locations.
+		n := devicesPerHousehold/2 + rng.Intn(devicesPerHousehold)
+		for j := 0; j < n; j++ {
+			dt := pickDeviceType(rng)
+			loc := pickLocation(rng, dt.Code)
+			devs = append(devs, Device{
+				ID:         int64(len(devs)) + 100000,
+				Household:  id,
+				TypeID:     dt.ID,
+				Code:       dt.Code,
+				Location:   loc,
+				BatteryPct: 70 + rng.Intn(30), // 70..99
+			})
+		}
+	}
+	return hh, devs
+}
+
+func pickWeighted(rng *rand.Rand, weights []int) int {
+	total := 0
+	for _, w := range weights {
+		total += w
+	}
+	r := rng.Intn(total)
+	for i, w := range weights {
+		if r < w {
+			return i
+		}
+		r -= w
+	}
+	return len(weights) - 1
+}
+
+func pickDeviceType(rng *rand.Rand) DeviceType {
+	// Reasonable distribution: motion + door/window common; smoke/CO/water
+	// less common but always at least one. Doorbell + keypad moderate.
+	weights := []int{30, 22, 15, 5, 6, 4, 6, 4, 5, 3}
+	idx := pickWeighted(rng, weights)
+	return DeviceTypes[idx]
+}
+
+func pickLocation(rng *rand.Rand, code string) string {
+	// Map device types to plausible locations.
+	switch code {
+	case "DOOR":
+		return []string{"front_door", "back_door", "garage_door", "side_door"}[rng.Intn(4)]
+	case "WINDOW":
+		return []string{"living_room", "kitchen", "dining_room", "master_bedroom", "bedroom_2", "bedroom_3"}[rng.Intn(6)]
+	case "GLASS_BREAK":
+		return []string{"living_room", "dining_room", "kitchen"}[rng.Intn(3)]
+	case "SMOKE", "CO":
+		return []string{"kitchen", "hallway", "master_bedroom", "bedroom_2", "basement"}[rng.Intn(5)]
+	case "WATER":
+		return []string{"kitchen", "basement", "utility_room"}[rng.Intn(3)]
+	case "TEMP":
+		return []string{"living_room", "master_bedroom", "basement", "attic"}[rng.Intn(4)]
+	case "DOORBELL":
+		return "front_door"
+	case "KEYPAD":
+		return "front_door"
+	case "MOTION":
+		return Locations[rng.Intn(len(Locations))]
+	}
+	return Locations[rng.Intn(len(Locations))]
+}
+
+func hashAddr(id int64) string {
+	h := sha1.Sum([]byte("addr-" + strconv.FormatInt(id, 10)))
+	return hex.EncodeToString(h[:6]) // short, opaque
+}
+
+// --- dimension + fleet upserts -------------------------------------------
+
+func upsertDimensions(ctx context.Context, pool *pgxpool.Pool) error {
+	batch := &pgx.Batch{}
+	for _, p := range Plans {
+		batch.Queue(`
+			INSERT INTO plans (plan_id, name, monthly_price_usd, sla_seconds)
+			VALUES ($1, $2, $3, $4)
+			ON CONFLICT (plan_id) DO UPDATE SET
+				name = EXCLUDED.name,
+				monthly_price_usd = EXCLUDED.monthly_price_usd,
+				sla_seconds = EXCLUDED.sla_seconds
+		`, p.ID, p.Name, p.PriceUSD, p.SLASeconds)
+	}
+	for _, r := range Regions {
+		batch.Queue(`
+			INSERT INTO regions (region_id, name, dispatch_center, timezone)
+			VALUES ($1, $2, $3, $4)
+			ON CONFLICT (region_id) DO UPDATE SET
+				name = EXCLUDED.name,
+				dispatch_center = EXCLUDED.dispatch_center,
+				timezone = EXCLUDED.timezone
+		`, r.ID, r.Name, r.DispatchCenter, r.Timezone)
+	}
+	for _, dt := range DeviceTypes {
+		batch.Queue(`
+			INSERT INTO device_types (device_type_id, code, name, default_severity)
+			VALUES ($1, $2, $3, $4)
+			ON CONFLICT (device_type_id) DO UPDATE SET
+				code = EXCLUDED.code,
+				name = EXCLUDED.name,
+				default_severity = EXCLUDED.default_severity
+		`, dt.ID, dt.Code, dt.Name, int16(dt.DefaultSeverity))
+	}
+	return pool.SendBatch(ctx, batch).Close()
+}
+
+func upsertHouseholds(ctx context.Context, pool *pgxpool.Pool, hh []Household) error {
+	// CopyFrom into a temp staging approach would be faster, but a Batch of
+	// INSERTs with ON CONFLICT is simple, idempotent, and fast enough for
+	// ~30 000 rows once at startup.
+	const batchSize = 1000
+	for start := 0; start < len(hh); start += batchSize {
+		end := start + batchSize
+		if end > len(hh) {
+			end = len(hh)
+		}
+		batch := &pgx.Batch{}
+		for _, h := range hh[start:end] {
+			batch.Queue(`
+				INSERT INTO households (household_id, plan_id, region_id, address_hash, armed)
+				VALUES ($1, $2, $3, $4, $5)
+				ON CONFLICT (household_id) DO UPDATE SET
+					plan_id = EXCLUDED.plan_id,
+					region_id = EXCLUDED.region_id,
+					armed = EXCLUDED.armed
+			`, h.ID, h.PlanID, h.RegionID, h.AddressHash, h.Armed)
+		}
+		if err := pool.SendBatch(ctx, batch).Close(); err != nil {
+			return err
+		}
+	}
+	return nil
+}
+
+func upsertDevices(ctx context.Context, pool *pgxpool.Pool, devs []Device) error {
+	const batchSize = 1000
+	for start := 0; start < len(devs); start += batchSize {
+		end := start + batchSize
+		if end > len(devs) {
+			end = len(devs)
+		}
+		batch := &pgx.Batch{}
+		for _, d := range devs[start:end] {
+			batch.Queue(`
+				INSERT INTO devices (device_id, household_id, device_type_id, location, last_battery_pct)
+				VALUES ($1, $2, $3, $4, $5)
+				ON CONFLICT (device_id) DO UPDATE SET
+					household_id = EXCLUDED.household_id,
+					device_type_id = EXCLUDED.device_type_id,
+					location = EXCLUDED.location,
+					last_battery_pct = EXCLUDED.last_battery_pct
+			`, d.ID, d.Household, d.TypeID, d.Location, int16(d.BatteryPct))
+		}
+		if err := pool.SendBatch(ctx, batch).Close(); err != nil {
+			return err
+		}
+	}
+	return nil
+}
diff --git a/homeguard-iot/internal/web/server.go b/homeguard-iot/internal/web/server.go
new file mode 100644
index 0000000..960b5d8
--- /dev/null
+++ b/homeguard-iot/internal/web/server.go
@@ -0,0 +1,613 @@
+// Package web is the operator-facing dashboard.
+//
+// Layout (top → bottom): active-alerts queue, live event stream, customer
+// drill-down, ingest-rate footer. Each panel is driven by a different
+// query against the same CedarDB instance the simulator is INSERTing into,
+// at refresh rates from 200 ms to 2 s — that's the whole story this demo
+// is selling.
+package web
+
+import (
+	"context"
+	"database/sql"
+	"embed"
+	"encoding/json"
+	"fmt"
+	"html/template"
+	"io/fs"
+	"log"
+	"net/http"
+	"os"
+	"strconv"
+	"time"
+
+	"github.com/jackc/pgx/v5/pgxpool"
+)
+
+// envDuration reads a Go duration string from the named environment
+// variable, falling back to the supplied default. Mirrors the helper in
+// the sim package so neither has to import the other.
+func envDuration(name string, fallback time.Duration) time.Duration {
+	v := os.Getenv(name)
+	if v == "" {
+		return fallback
+	}
+	d, err := time.ParseDuration(v)
+	if err != nil {
+		log.Printf("envDuration: invalid %s=%q (%v); using default %s", name, v, err, fallback)
+		return fallback
+	}
+	return d
+}
+
+//go:embed templates/*.html
+var templatesFS embed.FS
+
+//go:embed static/*
+var staticFS embed.FS
+
+type Server struct {
+	Pool   *pgxpool.Pool
+	tpl    *template.Template
+	static http.Handler
+}
+
+func NewServer(pool *pgxpool.Pool) (*Server, error) {
+	tpl, err := template.ParseFS(templatesFS, "templates/*.html")
+	if err != nil {
+		return nil, fmt.Errorf("parse templates: %w", err)
+	}
+	sub, err := fs.Sub(staticFS, "static")
+	if err != nil {
+		return nil, fmt.Errorf("static sub: %w", err)
+	}
+	return &Server{
+		Pool:   pool,
+		tpl:    tpl,
+		static: http.FileServer(http.FS(sub)),
+	}, nil
+}
+
+func (s *Server) Routes() http.Handler {
+	mux := http.NewServeMux()
+	mux.HandleFunc("/", s.handleIndex)
+	mux.HandleFunc("/sse/events", s.handleSSE)
+	mux.HandleFunc("/api/alerts", s.handleAlerts)
+	mux.HandleFunc("/api/drilldown", s.handleDrilldown)
+	mux.HandleFunc("/api/stats", s.handleStats)
+	mux.HandleFunc("/api/storage", s.handleStorage)
+	mux.Handle("/static/", http.StripPrefix("/static/", s.static))
+	return mux
+}
+
+// --------------------------------------------------------------------- index
+
+// indexData carries the dashboard's polling cadences into the index
+// template. The four refresh intervals are millisecond integers so the
+// JS setInterval() calls and htmx "every Nms" triggers can use them
+// verbatim. Defaults match the original hardcoded values.
+type indexData struct {
+	AlertsRefreshMs    int64
+	DrilldownRefreshMs int64
+	StatsRefreshMs     int64
+	StorageRefreshMs   int64
+}
+
+func (s *Server) handleIndex(w http.ResponseWriter, r *http.Request) {
+	if r.URL.Path != "/" {
+		http.NotFound(w, r)
+		return
+	}
+	data := indexData{
+		AlertsRefreshMs:    envDuration("HG_ALERTS_REFRESH", 1*time.Second).Milliseconds(),
+		DrilldownRefreshMs: envDuration("HG_DRILLDOWN_REFRESH", 2*time.Second).Milliseconds(),
+		StatsRefreshMs:     envDuration("HG_STATS_REFRESH", 1*time.Second).Milliseconds(),
+		StorageRefreshMs:   envDuration("HG_STORAGE_REFRESH", 1*time.Second).Milliseconds(),
+	}
+	if err := s.tpl.ExecuteTemplate(w, "index.html", data); err != nil {
+		log.Printf("template: %v", err)
+	}
+}
+
+// --------------------------------------------------------------------- alerts
+
+// handleAlerts returns the operator's active-alerts queue as an HTML
+// fragment. Joins alerts × households × plans so we can compute the
+// per-alert SLA countdown (plan.sla_seconds - age) directly in the query.
+func (s *Server) handleAlerts(w http.ResponseWriter, r *http.Request) {
+	ctx := r.Context()
+	rows, err := s.Pool.Query(ctx, `
+		SELECT a.alert_id, a.severity, a.detail,
+		       EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS age_s,
+		       p.sla_seconds,
+		       p.sla_seconds - EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS sla_remaining,
+		       h.household_id, h.address_hash,
+		       p.name AS plan_name,
+		       r.name AS region_name,
+		       r.dispatch_center
+		FROM alerts a
+		JOIN households h ON h.household_id = a.household_id
+		JOIN plans      p ON p.plan_id      = h.plan_id
+		JOIN regions    r ON r.region_id    = h.region_id
+		WHERE a.status = 'active'
+		ORDER BY a.severity DESC, a.raised_at ASC
+		LIMIT 25
+	`)
+	if err != nil {
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+	defer rows.Close()
+
+	type row struct {
+		AlertID      int64
+		Severity     int
+		Detail       string
+		AgeSec       int
+		SLASec       int
+		SLARemaining int
+		HouseholdID  int64
+		AddressHash  string
+		PlanName     string
+		RegionName   string
+		Dispatch     string
+		Breached     bool
+		AgeFmt       string
+		SLAFmt       string
+	}
+	var out []row
+	for rows.Next() {
+		var rr row
+		if err := rows.Scan(
+			&rr.AlertID, &rr.Severity, &rr.Detail, &rr.AgeSec,
+			&rr.SLASec, &rr.SLARemaining, &rr.HouseholdID, &rr.AddressHash,
+			&rr.PlanName, &rr.RegionName, &rr.Dispatch,
+		); err != nil {
+			http.Error(w, err.Error(), http.StatusInternalServerError)
+			return
+		}
+		rr.Breached = rr.SLARemaining < 0
+		rr.AgeFmt = fmtDuration(rr.AgeSec)
+		rr.SLAFmt = fmtDuration(absInt(rr.SLARemaining))
+		out = append(out, rr)
+	}
+	// If the iteration ended because of an error (most commonly a
+	// context-cancelled mid-scan on an unindexed query), surface it.
+	// Without this, a half-completed scan looks identical to "no rows"
+	// and the dashboard silently renders "no active alerts."
+	if err := rows.Err(); err != nil {
+		log.Printf("alerts query: rows.Err: %v", err)
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+	w.Header().Set("Content-Type", "text/html; charset=utf-8")
+	if err := s.tpl.ExecuteTemplate(w, "alerts.html", out); err != nil {
+		log.Printf("alerts tpl: %v", err)
+	}
+}
+
+// --------------------------------------------------------------------- SSE
+
+// handleSSE streams a JSON snapshot of the most-recent non-heartbeat
+// events every 200 ms. Heartbeats are filtered out server-side so the
+// operator panel stays readable; the analytical queries elsewhere
+// continue to count every row.
+func (s *Server) handleSSE(w http.ResponseWriter, r *http.Request) {
+	flusher, ok := w.(http.Flusher)
+	if !ok {
+		http.Error(w, "streaming not supported", http.StatusInternalServerError)
+		return
+	}
+	w.Header().Set("Content-Type", "text/event-stream")
+	w.Header().Set("Cache-Control", "no-cache")
+	w.Header().Set("Connection", "keep-alive")
+	w.Header().Set("X-Accel-Buffering", "no")
+
+	ctx := r.Context()
+	tick := time.NewTicker(envDuration("HG_SSE_INTERVAL", 200*time.Millisecond))
+	defer tick.Stop()
+
+	if err := s.pushEventStream(ctx, w, flusher); err != nil {
+		return
+	}
+	for {
+		select {
+		case <-ctx.Done():
+			return
+		case <-tick.C:
+			if err := s.pushEventStream(ctx, w, flusher); err != nil {
+				return
+			}
+		}
+	}
+}
+
+type eventRow struct {
+	EventID     int64   `json:"event_id"`
+	Ts          string  `json:"ts"`
+	HouseholdID int64   `json:"household_id"`
+	AddressHash string  `json:"address_hash"`
+	DeviceCode  string  `json:"device_code"`
+	Location    string  `json:"location"`
+	Kind        int     `json:"kind"`
+	Severity    int     `json:"severity"`
+	BatteryPct  int     `json:"battery_pct"`
+	Region      string  `json:"region"`
+}
+
+func (s *Server) pushEventStream(ctx context.Context, w http.ResponseWriter, f http.Flusher) error {
+	// Newest 25 non-heartbeat events, joined with device + device_type +
+	// household + region for human-readable rendering. The (kind > 0)
+	// filter uses the events_kind_ts_idx index efficiently.
+	rows, err := s.Pool.Query(ctx, `
+		SELECT e.event_id, e.ts, e.household_id, h.address_hash,
+		       dt.code, d.location, e.kind, e.severity,
+		       COALESCE(e.battery_pct, -1), r.name
+		FROM events e
+		JOIN devices       d  ON d.device_id     = e.device_id
+		JOIN device_types  dt ON dt.device_type_id = d.device_type_id
+		JOIN households    h  ON h.household_id  = e.household_id
+		JOIN regions       r  ON r.region_id     = h.region_id
+		WHERE e.kind > 0
+		ORDER BY e.ts DESC
+		LIMIT 25
+	`)
+	if err != nil {
+		log.Printf("event stream query: %v", err)
+		return nil
+	}
+	defer rows.Close()
+
+	out := make([]eventRow, 0, 25)
+	for rows.Next() {
+		var er eventRow
+		var ts time.Time
+		var bp int
+		if err := rows.Scan(&er.EventID, &ts, &er.HouseholdID, &er.AddressHash,
+			&er.DeviceCode, &er.Location, &er.Kind, &er.Severity,
+			&bp, &er.Region); err != nil {
+			log.Printf("event stream scan: %v", err)
+			return nil
+		}
+		er.Ts = ts.Format("15:04:05")
+		er.BatteryPct = bp
+		out = append(out, er)
+	}
+	if err := rows.Err(); err != nil {
+		log.Printf("event stream: rows.Err: %v", err)
+		// Don't propagate as a stream error — just skip this frame and
+		// let the next tick try again.
+		return nil
+	}
+	buf, err := json.Marshal(map[string]any{
+		"events":   out,
+		"stamp_ms": time.Now().UnixMilli(),
+	})
+	if err != nil {
+		return err
+	}
+	if _, err := fmt.Fprintf(w, "data: %s\n\n", buf); err != nil {
+		return err
+	}
+	f.Flush()
+	return nil
+}
+
+// --------------------------------------------------------------------- drilldown
+
+// handleDrilldown returns the "what's going on at this household right
+// now" panel. We pick the household behind the highest-severity currently-
+// active alert (or the most-recently-active household if none) and show
+// its last 20 events with device + type joined in.
+//
+// This is the operator-flow query: alert fires → click household → see
+// context to decide dispatch.
+func (s *Server) handleDrilldown(w http.ResponseWriter, r *http.Request) {
+	ctx := r.Context()
+
+	var householdID int64
+	err := s.Pool.QueryRow(ctx, `
+		SELECT a.household_id
+		FROM alerts a
+		WHERE a.status = 'active'
+		ORDER BY a.severity DESC, a.raised_at ASC
+		LIMIT 1
+	`).Scan(&householdID)
+	if err != nil {
+		// Fallback: pick the household with the most events in the last
+		// 5 minutes — keeps the panel populated even when there are no
+		// active alerts.
+		_ = s.Pool.QueryRow(ctx, `
+			SELECT household_id
+			FROM events
+			WHERE ts > now() - interval '5 minutes'
+			GROUP BY household_id
+			ORDER BY COUNT(*) DESC
+			LIMIT 1
+		`).Scan(&householdID)
+	}
+	if householdID == 0 {
+		w.Header().Set("Content-Type", "text/html; charset=utf-8")
+		_, _ = w.Write([]byte(`<p class="muted">waiting for events…</p>`))
+		return
+	}
+
+	// Household metadata for the header.
+	var (
+		addr, planName, regionName, dispatch string
+		armed                                bool
+		slaSec                               int
+	)
+	if err := s.Pool.QueryRow(ctx, `
+		SELECT h.address_hash, h.armed, p.name, p.sla_seconds, r.name, r.dispatch_center
+		FROM   households h
+		JOIN   plans      p ON p.plan_id   = h.plan_id
+		JOIN   regions    r ON r.region_id = h.region_id
+		WHERE  h.household_id = $1
+	`, householdID).Scan(&addr, &armed, &planName, &slaSec, &regionName, &dispatch); err != nil {
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+
+	// Last 20 events for this household, any kind.
+	rows, err := s.Pool.Query(ctx, `
+		SELECT e.ts, dt.code, d.location, e.kind, e.severity,
+		       COALESCE(e.battery_pct, -1)
+		FROM   events e
+		JOIN   devices      d  ON d.device_id      = e.device_id
+		JOIN   device_types dt ON dt.device_type_id = d.device_type_id
+		WHERE  e.household_id = $1
+		ORDER BY e.ts DESC
+		LIMIT 20
+	`, householdID)
+	if err != nil {
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+	defer rows.Close()
+
+	type evRow struct {
+		Ts         string
+		Code       string
+		Location   string
+		Kind       int
+		Severity   int
+		BatteryPct int
+	}
+	var events []evRow
+	for rows.Next() {
+		var (
+			er evRow
+			ts time.Time
+		)
+		if err := rows.Scan(&ts, &er.Code, &er.Location, &er.Kind, &er.Severity, &er.BatteryPct); err != nil {
+			http.Error(w, err.Error(), http.StatusInternalServerError)
+			return
+		}
+		er.Ts = ts.Format("15:04:05")
+		events = append(events, er)
+	}
+	if err := rows.Err(); err != nil {
+		log.Printf("drilldown query: rows.Err: %v", err)
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+
+	w.Header().Set("Content-Type", "text/html; charset=utf-8")
+	if err := s.tpl.ExecuteTemplate(w, "drilldown.html", map[string]any{
+		"HouseholdID": householdID,
+		"Address":     addr,
+		"Armed":       armed,
+		"Plan":        planName,
+		"SLASec":      slaSec,
+		"Region":      regionName,
+		"Dispatch":    dispatch,
+		"Events":      events,
+	}); err != nil {
+		log.Printf("drilldown tpl: %v", err)
+	}
+}
+
+// --------------------------------------------------------------------- stats
+
+// handleStats is the meta-query for the dashboard footer: how fast are
+// we ingesting right now, how many rows have landed total, how many
+// alerts are open. The dashboard footer reads this every
+// HG_STATS_REFRESH (default 1s).
+//
+// The event-rate side reads from storage_samples (which the simulator
+// populates every HG_STORAGE_SAMPLER_INTERVAL from its atomic eventID
+// counter), not from events itself. At billion-row scale, scanning
+// events for COUNT(*) — especially with no useful index — takes long
+// enough to make the footer effectively frozen. storage_samples is a
+// few hundred rows at most and answers instantly. The published
+// rows_per_sec is therefore the average over the last sample interval
+// (default 5s, the user may have set it longer in HG_STORAGE_SAMPLER_INTERVAL),
+// not a strict one-second window.
+func (s *Server) handleStats(w http.ResponseWriter, r *http.Request) {
+	ctx := r.Context()
+
+	var (
+		rowsPerSec int64
+		totalEv    int64
+		activeAlts int64
+		totalAlts  int64
+	)
+	// Event metrics — derived from the two most recent storage_samples
+	// rows. total_events = latest_bytes / 48. rate = bytes-delta over the
+	// elapsed time divided by 48.
+	var (
+		latestBytes, priorBytes sql.NullInt64
+		latestTs, priorTs       sql.NullTime
+	)
+	if err := s.Pool.QueryRow(ctx, `
+		SELECT
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        ORDER BY sampled_at DESC LIMIT 1) AS latest_bytes,
+		    (SELECT sampled_at         FROM storage_samples
+		        ORDER BY sampled_at DESC LIMIT 1) AS latest_ts,
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        WHERE sampled_at < (SELECT MAX(sampled_at) FROM storage_samples)
+		        ORDER BY sampled_at DESC LIMIT 1) AS prior_bytes,
+		    (SELECT sampled_at         FROM storage_samples
+		        WHERE sampled_at < (SELECT MAX(sampled_at) FROM storage_samples)
+		        ORDER BY sampled_at DESC LIMIT 1) AS prior_ts
+	`).Scan(&latestBytes, &latestTs, &priorBytes, &priorTs); err != nil {
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+	if latestBytes.Valid {
+		totalEv = latestBytes.Int64 / int64(eventsRowBytes)
+	}
+	if latestBytes.Valid && priorBytes.Valid && latestTs.Valid && priorTs.Valid {
+		dt := latestTs.Time.Sub(priorTs.Time).Seconds()
+		if dt > 0 {
+			delta := latestBytes.Int64 - priorBytes.Int64
+			if delta < 0 {
+				delta = 0
+			}
+			rowsPerSec = int64(float64(delta) / dt / float64(eventsRowBytes))
+		}
+	}
+	// Alert metrics.
+	if err := s.Pool.QueryRow(ctx, `
+		SELECT
+		    COUNT(*) FILTER (WHERE status = 'active') AS active,
+		    COUNT(*)                                   AS total
+		FROM alerts
+	`).Scan(&activeAlts, &totalAlts); err != nil {
+		// Non-fatal; alerts table may be empty very early.
+		activeAlts = 0
+		totalAlts = 0
+	}
+
+	w.Header().Set("Content-Type", "application/json")
+	_ = json.NewEncoder(w).Encode(map[string]any{
+		"rows_per_sec":  rowsPerSec,
+		"total_events":  totalEv,
+		"active_alerts": activeAlts,
+		"total_alerts":  totalAlts,
+	})
+}
+
+// --------------------------------------------------------------------- storage
+
+// targetBytesPerSec: 3 TB/day uncompressed = 3 * 10^12 / 86 400 s ≈
+// 34.722 MB/s. The dashboard renders gauges against this target.
+const targetBytesPerSec = 34_722_222
+
+// eventsRowBytes: per-row uncompressed footprint of the events table,
+// derived from the schema column widths (matches the constant of the
+// same name in the sim package). Used to convert storage_samples
+// uncompressed_bytes back into row counts for the stats footer.
+const eventsRowBytes = 48
+
+// handleStorage returns the most recent uncompressed-size sample plus
+// rates computed across the last 1, 5, and 15 minutes. Rates are
+// (latest_bytes - bytes_at_window_start) / window_seconds — so they
+// reflect the real growth of stored data, not the wire-rate of inserts.
+func (s *Server) handleStorage(w http.ResponseWriter, r *http.Request) {
+	ctx := r.Context()
+
+	type windowResult struct {
+		LatestBytes sql.NullInt64
+		LatestTs    sql.NullTime
+		Bytes1mAgo  sql.NullInt64
+		Ts1mAgo     sql.NullTime
+		Bytes5mAgo  sql.NullInt64
+		Ts5mAgo     sql.NullTime
+		Bytes15mAgo sql.NullInt64
+		Ts15mAgo    sql.NullTime
+	}
+	var rr windowResult
+	if err := s.Pool.QueryRow(ctx, `
+		SELECT
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        ORDER BY sampled_at DESC LIMIT 1) AS latest_bytes,
+		    (SELECT sampled_at         FROM storage_samples
+		        ORDER BY sampled_at DESC LIMIT 1) AS latest_ts,
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        WHERE sampled_at <= now() - interval '1 minute'
+		        ORDER BY sampled_at DESC LIMIT 1) AS bytes_1m,
+		    (SELECT sampled_at         FROM storage_samples
+		        WHERE sampled_at <= now() - interval '1 minute'
+		        ORDER BY sampled_at DESC LIMIT 1) AS ts_1m,
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        WHERE sampled_at <= now() - interval '5 minutes'
+		        ORDER BY sampled_at DESC LIMIT 1) AS bytes_5m,
+		    (SELECT sampled_at         FROM storage_samples
+		        WHERE sampled_at <= now() - interval '5 minutes'
+		        ORDER BY sampled_at DESC LIMIT 1) AS ts_5m,
+		    (SELECT uncompressed_bytes FROM storage_samples
+		        WHERE sampled_at <= now() - interval '15 minutes'
+		        ORDER BY sampled_at DESC LIMIT 1) AS bytes_15m,
+		    (SELECT sampled_at         FROM storage_samples
+		        WHERE sampled_at <= now() - interval '15 minutes'
+		        ORDER BY sampled_at DESC LIMIT 1) AS ts_15m
+	`).Scan(
+		&rr.LatestBytes, &rr.LatestTs,
+		&rr.Bytes1mAgo, &rr.Ts1mAgo,
+		&rr.Bytes5mAgo, &rr.Ts5mAgo,
+		&rr.Bytes15mAgo, &rr.Ts15mAgo,
+	); err != nil {
+		http.Error(w, err.Error(), http.StatusInternalServerError)
+		return
+	}
+
+	rate := func(latest, prev sql.NullInt64, latestTs, prevTs sql.NullTime) (float64, bool) {
+		if !latest.Valid || !prev.Valid || !latestTs.Valid || !prevTs.Valid {
+			return 0, false
+		}
+		dt := latestTs.Time.Sub(prevTs.Time).Seconds()
+		if dt <= 0 {
+			return 0, false
+		}
+		db := float64(latest.Int64 - prev.Int64)
+		if db < 0 {
+			db = 0 // can happen across CedarDB compactions
+		}
+		return db / dt, true
+	}
+
+	r1, ok1 := rate(rr.LatestBytes, rr.Bytes1mAgo, rr.LatestTs, rr.Ts1mAgo)
+	r5, ok5 := rate(rr.LatestBytes, rr.Bytes5mAgo, rr.LatestTs, rr.Ts5mAgo)
+	r15, ok15 := rate(rr.LatestBytes, rr.Bytes15mAgo, rr.LatestTs, rr.Ts15mAgo)
+
+	out := map[string]any{
+		"target_bytes_per_sec": targetBytesPerSec,
+	}
+	if rr.LatestBytes.Valid {
+		out["latest_bytes"] = rr.LatestBytes.Int64
+	}
+	if rr.LatestTs.Valid {
+		out["latest_ts"] = rr.LatestTs.Time.Format(time.RFC3339)
+	}
+	if ok1 {
+		out["rate_1m_bps"] = r1
+	}
+	if ok5 {
+		out["rate_5m_bps"] = r5
+	}
+	if ok15 {
+		out["rate_15m_bps"] = r15
+	}
+
+	w.Header().Set("Content-Type", "application/json")
+	_ = json.NewEncoder(w).Encode(out)
+}
+
+// --------------------------------------------------------------------- utils
+
+func fmtDuration(sec int) string {
+	if sec < 60 {
+		return strconv.Itoa(sec) + "s"
+	}
+	return strconv.Itoa(sec/60) + "m" + strconv.Itoa(sec%60) + "s"
+}
+
+func absInt(x int) int {
+	if x < 0 {
+		return -x
+	}
+	return x
+}
diff --git a/homeguard-iot/internal/web/static/cedardb-logo.svg b/homeguard-iot/internal/web/static/cedardb-logo.svg
new file mode 100644
index 0000000..192623c
--- /dev/null
+++ b/homeguard-iot/internal/web/static/cedardb-logo.svg
@@ -0,0 +1,10 @@
+<svg width="701" height="166" viewBox="0 0 701 166" fill="none" xmlns="http://www.w3.org/2000/svg">
+<path d="M8.12172 112.546C29.787 107.268 50.3235 96.1681 67.5566 80.4221C81.1261 68.0236 92.6483 52.7746 101.358 35.6885M8.12172 112.546C4.98006 106.426 3.20899 99.4162 3.01793 92.3447C2.88619 87.4693 3.49245 82.5984 4.43502 77.8384C5.26796 73.6321 6.36438 69.4924 7.71344 65.4606M8.12172 112.546C7.82961 118.663 9.0022 124.862 11.4819 130.309C15.4829 139.099 22.5997 145.551 30.036 150.757M101.358 35.6885C101.809 34.985 102.425 34.4184 103.126 34.0628C103.828 33.7071 104.612 33.5633 105.377 33.6503C106.135 33.7366 106.867 34.0462 107.518 34.4955C108.168 34.9448 108.74 35.5313 109.238 36.183C110.235 37.4865 110.939 39.0358 111.685 40.5384C113.715 44.6275 116.115 48.4669 118.222 52.5063C120.328 56.5457 122.157 60.8479 122.899 65.4606M101.358 35.6885C99.2115 28.2343 96.4329 21.0128 93.0726 14.1548C92.2297 12.4347 91.3432 10.7261 90.2039 9.23991C89.0645 7.75377 87.6502 6.48915 86.0028 5.85539C84.7971 5.39154 83.487 5.27885 82.233 5.53101C80.979 5.78318 79.7838 6.39972 78.7938 7.30511M30.036 150.757C47.6585 137.582 66.0292 125.686 84.9977 115.165C94.3728 109.965 104.064 104.973 111.718 96.9449C115.545 92.9308 118.815 88.1674 120.897 82.7569C122.979 77.3464 123.833 71.2653 122.899 65.4606M30.036 150.757C34.4461 153.844 39.0548 156.6 43.9119 158.66C57.5565 164.446 73.0564 164.447 86.7007 158.66C91.5577 156.599 96.1662 153.843 100.577 150.757M122.899 65.4606C124.248 69.4925 125.344 73.6322 126.177 77.8384C127.12 82.5984 127.728 87.4694 127.595 92.3448C127.402 99.4187 125.623 106.43 122.47 112.546M122.899 65.4606C109.466 63.7157 96.3987 58.4759 84.9977 50.2633C70.6194 39.9059 58.9547 24.8029 51.819 7.30511M78.7938 7.30511C76.6751 5.833 74.3665 4.71028 71.9638 3.98364C67.6272 2.67212 62.9855 2.67212 58.6488 3.98364C56.2462 4.71028 53.9377 5.833 51.819 7.30511M78.7938 7.30511C71.6581 24.8029 59.9932 39.9059 45.6149 50.2633C34.2138 58.476 21.1462 63.7157 7.71344 65.4606M51.819 7.30511C50.829 6.39965 49.6337 5.78306 48.3798 5.53087C47.1258 5.2787 45.8158 5.39152 44.6101 5.85539C42.9628 6.48927 41.5485 7.75394 40.4092 9.24013C39.2698 10.7263 38.3834 12.4349 37.5402 14.1548C34.1772 21.0145 31.3919 28.2355 29.2341 35.6885M29.2341 35.6885C28.7853 34.9881 28.1721 34.4234 27.4744 34.0678C26.7767 33.7123 25.9962 33.5668 25.2344 33.6503C24.4742 33.7335 23.7399 34.0415 23.0872 34.4899C22.4346 34.9383 21.8614 35.5247 21.3611 36.1768C20.3606 37.4812 19.6547 39.0331 18.9071 40.5383C16.8771 44.6253 14.4795 48.465 12.3774 52.5053C10.2753 56.5456 8.4518 60.8488 7.71344 65.4606M29.2341 35.6885C37.9525 52.7746 49.4815 68.023 63.0562 80.4221C80.2863 96.16 100.814 107.259 122.47 112.546M7.71344 65.4606C6.78406 71.2654 7.64092 77.3446 9.72413 82.7537C11.8074 88.1628 15.0767 92.9252 18.9027 96.9392C26.5549 104.967 36.2424 109.963 45.6149 115.165C64.5807 125.692 82.9511 137.588 100.577 150.757M100.577 150.757C108.014 145.551 115.135 139.102 119.131 130.309C121.607 124.861 122.772 118.661 122.47 112.546" stroke="#FB773E" stroke-width="5.76"/>
+<path d="M628.525 135.966V32.6001H664.663C671.864 32.6001 677.803 33.8451 682.48 36.335C687.157 38.7913 690.639 42.1056 692.927 46.2779C695.215 50.4166 696.359 55.0095 696.359 60.0566C696.359 64.4981 695.569 68.1657 693.987 71.0595C692.439 73.9532 690.387 76.2412 687.83 77.9236C685.306 79.606 682.564 80.8509 679.603 81.6585V82.6679C682.766 82.8698 685.945 83.9802 689.142 85.999C692.338 88.0179 695.013 90.9116 697.167 94.6802C699.32 98.4487 700.397 103.058 700.397 108.509C700.397 113.691 699.219 118.351 696.864 122.49C694.509 126.629 690.791 129.909 685.71 132.332C680.629 134.755 674.017 135.966 665.874 135.966H628.525ZM641.042 124.862H665.874C674.051 124.862 679.855 123.281 683.287 120.118C686.753 116.921 688.486 113.052 688.486 108.509C688.486 105.01 687.594 101.78 685.811 98.8188C684.027 95.8242 681.487 93.4352 678.189 91.6519C674.892 89.8349 670.989 88.9264 666.48 88.9264H641.042V124.862ZM641.042 78.0245H664.259C668.028 78.0245 671.426 77.2843 674.455 75.8038C677.517 74.3233 679.939 72.2371 681.722 69.5453C683.539 66.8535 684.448 63.6906 684.448 60.0566C684.448 55.5142 682.867 51.6615 679.704 48.4987C676.541 45.3021 671.527 43.7038 664.663 43.7038H641.042V78.0245Z" fill="#FFFFFF"/>
+<path d="M567.745 135.966H535.847V32.6001H569.158C579.185 32.6001 587.765 34.6694 594.898 38.8081C602.032 42.9131 607.5 48.8183 611.302 56.5236C615.104 64.1953 617.005 73.3811 617.005 84.0811C617.005 94.8484 615.087 104.118 611.251 111.891C607.415 119.63 601.83 125.586 594.495 129.758C587.159 133.897 578.243 135.966 567.745 135.966ZM548.364 124.862H566.937C575.484 124.862 582.567 123.213 588.186 119.916C593.805 116.618 597.994 111.925 600.753 105.834C603.512 99.7441 604.892 92.4931 604.892 84.0811C604.892 75.7365 603.529 68.5527 600.804 62.5298C598.078 56.4732 594.007 51.8298 588.59 48.5996C583.172 45.3358 576.426 43.7038 568.35 43.7038H548.364V124.862Z" fill="#FFFFFF"/>
+<path d="M485.564 135.985V58.4608H497.072V70.1702H497.88C499.293 66.3344 501.85 63.222 505.551 60.833C509.252 58.444 513.425 57.2495 518.068 57.2495C518.943 57.2495 520.037 57.2663 521.349 57.3C522.661 57.3336 523.654 57.3841 524.327 57.4514V69.5646C523.923 69.4636 522.998 69.3122 521.551 69.1103C520.138 68.8748 518.64 68.757 517.059 68.757C513.29 68.757 509.925 69.5478 506.964 71.1292C504.037 72.677 501.715 74.8304 499.999 77.5896C498.317 80.315 497.476 83.4275 497.476 86.9268V135.985H485.564Z" fill="#FFFFFF"/>
+<path d="M435.554 137.591C430.641 137.591 426.183 136.666 422.179 134.815C418.174 132.931 414.995 130.222 412.639 126.689C410.284 123.122 409.106 118.816 409.106 113.768C409.106 109.327 409.981 105.727 411.731 102.967C413.481 100.175 415.819 97.9876 418.747 96.4062C421.674 94.8247 424.904 93.6471 428.437 92.8732C432.004 92.0656 435.587 91.4263 439.187 90.9553C443.898 90.3496 447.717 89.8954 450.645 89.5925C453.606 89.256 455.759 88.7009 457.105 87.927C458.484 87.1531 459.174 85.8072 459.174 83.8892V83.4855C459.174 78.5056 457.812 74.6361 455.086 71.877C452.394 69.1179 448.306 67.7383 442.821 67.7383C437.135 67.7383 432.677 68.9833 429.446 71.4732C426.216 73.9632 423.945 76.6213 422.633 79.4477L411.327 75.41C413.346 70.6993 416.038 67.0317 419.403 64.4072C422.801 61.749 426.502 59.8984 430.506 58.8553C434.544 57.7786 438.515 57.2402 442.418 57.2402C444.908 57.2402 447.768 57.5431 450.998 58.1487C454.262 58.7207 457.408 59.9152 460.436 61.7322C463.498 63.5492 466.038 66.2915 468.057 69.9591C470.076 73.6267 471.086 78.5392 471.086 84.6968V135.774H459.174V125.276H458.569C457.761 126.958 456.415 128.758 454.531 130.676C452.647 132.594 450.14 134.226 447.011 135.572C443.881 136.918 440.062 137.591 435.554 137.591ZM437.371 126.891C442.081 126.891 446.052 125.966 449.282 124.115C452.546 122.264 455.002 119.875 456.651 116.948C458.333 114.021 459.174 110.942 459.174 107.712V96.81C458.67 97.4156 457.559 97.9708 455.843 98.4755C454.161 98.9466 452.209 99.3672 449.988 99.7373C447.801 100.074 445.665 100.377 443.579 100.646C441.526 100.881 439.86 101.083 438.582 101.251C435.486 101.655 432.593 102.311 429.901 103.22C427.243 104.095 425.089 105.424 423.44 107.207C421.825 108.957 421.018 111.346 421.018 114.374C421.018 118.513 422.549 121.642 425.611 123.762C428.706 125.848 432.626 126.891 437.371 126.891Z" fill="#FFFFFF"/>
+<path d="M362.054 137.581C355.594 137.581 349.89 135.949 344.944 132.685C339.998 129.388 336.128 124.744 333.336 118.755C330.543 112.732 329.146 105.616 329.146 97.4056C329.146 89.2629 330.543 82.1969 333.336 76.2076C336.128 70.2183 340.015 65.5917 344.995 62.3279C349.974 59.064 355.728 57.4321 362.256 57.4321C367.303 57.4321 371.29 58.2733 374.218 59.9557C377.179 61.6044 379.433 63.4887 380.981 65.6085C382.562 67.6947 383.79 69.4107 384.665 70.7566H385.675V32.6001H397.586V135.966H386.078V124.055H384.665C383.79 125.468 382.545 127.251 380.93 129.405C379.315 131.524 377.01 133.426 374.016 135.108C371.021 136.757 367.034 137.581 362.054 137.581ZM363.669 126.881C368.447 126.881 372.485 125.636 375.782 123.146C379.08 120.623 381.586 117.14 383.303 112.699C385.019 108.223 385.877 103.058 385.877 97.2037C385.877 91.4163 385.035 86.3523 383.353 82.0118C381.671 77.6376 379.181 74.2392 375.883 71.8165C372.586 69.3602 368.514 68.1321 363.669 68.1321C358.622 68.1321 354.416 69.4275 351.051 72.0184C347.72 74.5756 345.213 78.0582 343.531 82.466C341.882 86.8402 341.058 91.7528 341.058 97.2037C341.058 102.722 341.899 107.735 343.581 112.244C345.297 116.719 347.821 120.286 351.152 122.944C354.517 125.569 358.689 126.881 363.669 126.881Z" fill="#FFFFFF"/>
+<path d="M291.287 137.389C283.817 137.389 277.374 135.74 271.956 132.443C266.573 129.112 262.417 124.468 259.49 118.513C256.596 112.523 255.149 105.558 255.149 97.6175C255.149 89.6766 256.596 82.6779 259.49 76.6213C262.417 70.5311 266.489 65.7868 271.704 62.3883C276.953 58.9563 283.077 57.2402 290.076 57.2402C294.113 57.2402 298.101 57.9132 302.038 59.2591C305.974 60.605 309.558 62.7921 312.788 65.8204C316.018 68.8151 318.592 72.7855 320.51 77.7317C322.428 82.6779 323.387 88.7682 323.387 96.0024V101.05H263.629V90.7534H311.274C311.274 86.3792 310.399 82.476 308.649 79.044C306.933 75.6119 304.477 72.9033 301.28 70.918C298.118 68.9328 294.383 67.9402 290.076 67.9402C285.331 67.9402 281.226 69.1179 277.761 71.4732C274.329 73.7949 271.687 76.8232 269.837 80.5581C267.986 84.293 267.061 88.2971 267.061 92.5703V99.4345C267.061 105.289 268.07 110.252 270.089 114.324C272.142 118.361 274.985 121.44 278.619 123.56C282.253 125.646 286.475 126.689 291.287 126.689C294.416 126.689 297.243 126.252 299.766 125.377C302.324 124.468 304.527 123.122 306.378 121.339C308.229 119.522 309.659 117.268 310.668 114.576L322.176 117.806C320.964 121.709 318.929 125.141 316.069 128.102C313.209 131.03 309.676 133.318 305.47 134.966C301.264 136.582 296.536 137.389 291.287 137.389Z" fill="#FFFFFF"/>
+<path d="M249.39 64.715H236.873C236.133 61.1147 234.837 57.9518 232.986 55.2264C231.169 52.5009 228.949 50.2129 226.324 48.3622C223.733 46.478 220.856 45.0648 217.694 44.1226C214.531 43.1805 211.233 42.7094 207.801 42.7094C201.543 42.7094 195.873 44.2909 190.792 47.4537C185.745 50.6166 181.724 55.2768 178.729 61.4344C175.768 67.5919 174.288 75.1458 174.288 84.0961C174.288 93.0464 175.768 100.6 178.729 106.758C181.724 112.915 185.745 117.576 190.792 120.738C195.873 123.901 201.543 125.483 207.801 125.483C211.233 125.483 214.531 125.012 217.694 124.07C220.856 123.127 223.733 121.731 226.324 119.88C228.949 117.996 231.169 115.691 232.986 112.966C234.837 110.207 236.133 107.044 236.873 103.477H249.39C248.448 108.76 246.732 113.487 244.242 117.66C241.752 121.832 238.656 125.382 234.955 128.309C231.254 131.203 227.098 133.407 222.488 134.921C217.912 136.435 213.017 137.192 207.801 137.192C198.985 137.192 191.146 135.039 184.281 130.732C177.417 126.425 172.017 120.301 168.08 112.36C164.143 104.419 162.175 94.998 162.175 84.0961C162.175 73.1943 164.143 63.7729 168.08 55.832C172.017 47.8912 177.417 41.7673 184.281 37.4604C191.146 33.1535 198.985 31 207.801 31C213.017 31 217.912 31.7571 222.488 33.2712C227.098 34.7854 231.254 37.0061 234.955 39.9335C238.656 42.8272 241.752 46.3602 244.242 50.5325C246.732 54.6712 248.448 59.3987 249.39 64.715Z" fill="#FFFFFF"/>
+</svg>
diff --git a/homeguard-iot/internal/web/static/queries.html b/homeguard-iot/internal/web/static/queries.html
new file mode 100644
index 0000000..a3dceca
--- /dev/null
+++ b/homeguard-iot/internal/web/static/queries.html
@@ -0,0 +1,865 @@
+<!doctype html>
+<html lang="en">
+<head>
+<meta charset="utf-8">
+<title>HomeGuard IoT — SQL queries reference</title>
+<link rel="stylesheet"
+      href="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/themes/prism-tomorrow.min.css">
+<style>
+  :root {
+    --bg: #0d1117;
+    --panel: #161b22;
+    --panel-2: #1c2128;
+    --line: #30363d;
+    --text: #e6edf3;
+    --muted: #8b949e;
+    --accent: #FB773E;
+    --green: #1ee2a2;
+    --red:   #ff5577;
+    --yellow:#f4d04e;
+    --blue:  #3aa6ff;
+    --purple:#9966ff;
+  }
+  * { box-sizing: border-box; }
+  html, body { margin: 0; }
+  body {
+    font-family: ui-monospace, "SF Mono", Menlo, monospace;
+    background: var(--bg);
+    color: var(--text);
+    line-height: 1.5;
+    padding-bottom: 60px;
+  }
+
+  /* --- Header ---------------------------------------------------------- */
+  header {
+    padding: 14px 28px;
+    border-bottom: 1px solid var(--line);
+    display: flex;
+    justify-content: space-between;
+    align-items: center;
+    background: var(--panel);
+    position: sticky;
+    top: 0;
+    z-index: 10;
+  }
+  header h1 {
+    font-size: 16px;
+    margin: 0;
+    font-weight: 600;
+    letter-spacing: 0.04em;
+    display: flex;
+    align-items: center;
+    gap: 14px;
+  }
+  header h1 .logo { height: 28px; }
+  header h1 .sep  { color: var(--muted); font-weight: 400; }
+  header .back    { font-size: 12px; color: var(--accent); text-decoration: none; border-bottom: 1px dotted rgba(251,119,62,0.5); }
+  header .back:hover { border-bottom-color: var(--accent); }
+
+  /* --- Intro / TOC ----------------------------------------------------- */
+  main {
+    max-width: 1180px;
+    margin: 0 auto;
+    padding: 24px 28px;
+  }
+  .intro {
+    color: var(--muted);
+    font-size: 13px;
+    margin: 6px 0 26px;
+    max-width: 880px;
+  }
+  .intro strong { color: var(--text); }
+  .intro a { color: var(--accent); }
+
+  .toc {
+    border: 1px solid var(--line);
+    background: var(--panel);
+    border-radius: 8px;
+    padding: 14px 18px;
+    margin-bottom: 32px;
+    font-size: 12px;
+    display: grid;
+    grid-template-columns: repeat(2, 1fr);
+    gap: 8px 24px;
+  }
+  .toc h3 {
+    grid-column: 1 / -1;
+    font-size: 11px;
+    margin: 0 0 4px;
+    color: var(--muted);
+    text-transform: uppercase;
+    letter-spacing: 0.08em;
+  }
+  .toc a { color: var(--text); text-decoration: none; }
+  .toc a:hover { color: var(--accent); }
+  .toc .sec { color: var(--muted); margin-right: 6px; font-variant-numeric: tabular-nums; }
+
+  /* --- Section headers ------------------------------------------------- */
+  h2.section {
+    font-size: 13px;
+    text-transform: uppercase;
+    letter-spacing: 0.08em;
+    color: var(--accent);
+    border-bottom: 1px solid var(--line);
+    padding-bottom: 6px;
+    margin: 38px 0 16px;
+  }
+  h2.section .count { color: var(--muted); font-weight: 400; margin-left: 8px; font-size: 11px; }
+
+  /* --- Query card ------------------------------------------------------ */
+  .q {
+    background: var(--panel);
+    border: 1px solid var(--line);
+    border-radius: 8px;
+    padding: 16px 18px;
+    margin-bottom: 20px;
+  }
+  .q-hdr {
+    display: flex;
+    justify-content: space-between;
+    align-items: baseline;
+    flex-wrap: wrap;
+    gap: 8px 16px;
+    margin-bottom: 10px;
+  }
+  .q-hdr h3 {
+    margin: 0;
+    font-size: 14px;
+    font-weight: 700;
+  }
+  .q-hdr h3 a { color: inherit; text-decoration: none; }
+  .q-hdr h3 a:hover { color: var(--accent); }
+  .q-meta {
+    display: flex;
+    gap: 10px;
+    flex-wrap: wrap;
+    font-size: 11px;
+    color: var(--muted);
+  }
+  .q-meta .pill {
+    display: inline-block;
+    padding: 2px 8px;
+    border-radius: 10px;
+    border: 1px solid var(--line);
+    background: var(--panel-2);
+  }
+  .pill.freq-hot     { color: var(--red);    border-color: rgba(255,85,119,0.4); }
+  .pill.freq-fast    { color: var(--yellow); border-color: rgba(244,208,78,0.4); }
+  .pill.freq-medium  { color: var(--blue);   border-color: rgba(58,166,255,0.4); }
+  .pill.freq-slow    { color: var(--green);  border-color: rgba(30,226,162,0.4); }
+  .pill.freq-once    { color: var(--purple); border-color: rgba(153,102,255,0.4); }
+  .pill.src          { color: var(--muted); }
+
+  .q-explain {
+    font-size: 12.5px;
+    color: var(--muted);
+    margin: 0 0 10px;
+  }
+  .q-explain strong { color: var(--text); }
+
+  pre[class*="language-"] {
+    margin: 0;
+    border-radius: 6px;
+    font-size: 12.5px !important;
+    background: #0d1117 !important;
+    border: 1px solid var(--line);
+  }
+  code[class*="language-"], pre[class*="language-"] {
+    font-family: ui-monospace, "SF Mono", Menlo, monospace !important;
+  }
+
+  /* --- Legend ---------------------------------------------------------- */
+  .legend {
+    display: flex;
+    flex-wrap: wrap;
+    gap: 6px 14px;
+    margin: 0 0 22px;
+    font-size: 11px;
+    color: var(--muted);
+  }
+  .legend .pill { font-size: 11px; }
+</style>
+</head>
+<body>
+
+<header>
+  <h1>
+    <img src="/static/cedardb-logo.svg" alt="CedarDB" class="logo">
+    <span class="sep">&nbsp;</span>
+    HOMEGUARD&nbsp;IoT&nbsp;·&nbsp;SQL&nbsp;QUERIES
+  </h1>
+  <a class="back" href="/">&larr; back to operator console</a>
+</header>
+
+<main>
+
+<p class="intro">
+  Every SQL statement the simulator and dashboard run against CedarDB, in
+  one place. The interesting bit isn't any individual query — it's that
+  <strong>all of these run concurrently against the same instance</strong>:
+  N writer goroutines are <code>CopyFrom</code>-ing into <code>events</code>
+  at up to ~500 K events/sec (≈ 3 TB/day uncompressed) while four
+  different read paths (active-alerts queue, live event stream, customer
+  drill-down, ingest stats) are joining across the same hot table and the
+  normalized dimensions at 200 ms – 2 s refresh cadences. The whole pitch
+  is collapsing the BigQuery + Pub/Sub-plus-KV split onto one database
+  with real joins and no replication lag.
+</p>
+<p class="intro">
+  <strong>Tuning the cadences.</strong> Every "refresh N s" pill below is
+  an env-var-driven default — set <code>HG_ALERTS_REFRESH</code>,
+  <code>HG_DRILLDOWN_REFRESH</code>, <code>HG_STATS_REFRESH</code>,
+  <code>HG_STORAGE_REFRESH</code>, <code>HG_SSE_INTERVAL</code>
+  (on the server container), or <code>HG_STORAGE_SAMPLER_INTERVAL</code>
+  (on the simulator container) to any Go duration string (e.g.
+  <code>"500ms"</code>, <code>"2s"</code>, <code>"30s"</code>) to dial
+  the read pressure CedarDB has to serve next to the ingest stream.
+</p>
+
+<div class="legend">
+  <span class="pill freq-hot">hot · every tick (10 Hz)</span>
+  <span class="pill freq-fast">fast · 200 ms – 1 s</span>
+  <span class="pill freq-medium">medium · 1 – 2 s</span>
+  <span class="pill freq-slow">slow · 2 s+ background</span>
+  <span class="pill freq-once">once · startup</span>
+</div>
+
+<nav class="toc">
+  <h3>Contents</h3>
+  <div>
+    <a href="#hot"><span class="sec">1.</span>Hot write path</a><br>
+    <a href="#dashboard"><span class="sec">2.</span>Dashboard reads</a><br>
+    <a href="#background"><span class="sec">3.</span>Background maintenance</a>
+  </div>
+  <div>
+    <a href="#startup"><span class="sec">4.</span>Startup &amp; bootstrap</a><br>
+    <a href="#reset"><span class="sec">5.</span>Schema reset</a>
+  </div>
+</nav>
+
+<!-- ================================================================== -->
+<h2 class="section" id="hot">1. Hot write path<span class="count">2 queries — drives the firehose</span></h2>
+
+<div class="q" id="copy-events">
+  <div class="q-hdr">
+    <h3><a href="#copy-events">COPY events (fan-in, single ingestor)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-hot">~20 flushes/s · ≤ 10K rows each</span>
+      <span class="pill">up to ~500 K ev/s</span>
+      <span class="pill src">internal/sim/simulator.go · ingestorLoop()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The firehose. <code>-writers</code> producer goroutines partition
+    the device fleet, build row batches, and push them onto a buffered
+    channel. <strong>A single ingestor goroutine</strong> drains the
+    channel, coalesces up to 10 000 rows (or every 50 ms, whichever
+    comes first), and runs one binary <code>CopyFrom</code>. This
+    sidesteps CedarDB's <em>"cannot start bulk operation until previous
+    bulk operation has become globally visible"</em> error (SQLSTATE
+    40P01) that fires when more than one goroutine COPYs in parallel,
+    while still keeping the binary protocol's ~3× wire and parsing
+    advantage over multi-row INSERT under simple-query mode.
+    <code>event_id</code> is an <code>atomic.Int64</code> seeded from
+    <code>MAX(event_id)</code> at startup, so producers share the id
+    space without locking — the column is plain <code>BIGINT</code>,
+    not <code>BIGSERIAL</code> (CedarDB rejects COPY for sequence-
+    defaulted columns with "unable to cast from void to bigint").
+  </p>
+<pre><code class="language-sql">-- One round trip per ingestor flush (≤ 10K rows, or every 50ms).
+COPY events (
+    event_id, device_id, household_id, ts,
+    kind, severity, value, battery_pct, rssi_dbm
+) FROM STDIN BINARY;
+-- ...one binary tuple per buffered event...</code></pre>
+  <p class="q-explain" style="margin-top:8px">
+    On startup, the simulator runs one read to seed the id counter:
+  </p>
+<pre><code class="language-sql">SELECT COALESCE(MAX(event_id), 0) FROM events;</code></pre>
+</div>
+
+<div class="q" id="insert-alert">
+  <div class="q-hdr">
+    <h3><a href="#insert-alert">Insert one alert</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-fast">~10–30/s</span>
+      <span class="pill">1 row/call</span>
+      <span class="pill src">internal/sim/simulator.go · fireAlert()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Called inline from <code>emitTick</code> whenever <code>EvaluateAlert</code>
+    in <code>rules.go</code> promotes a triggered/tamper event to an
+    operator-actionable alert. <code>status</code> starts <code>'active'</code>; the
+    background <code>resolveAlertsLoop</code> later marks it resolved or
+    false_alarm.
+  </p>
+<pre><code class="language-sql">INSERT INTO alerts (
+    household_id,
+    triggered_event_id,
+    raised_at,
+    severity,
+    status,
+    detail
+)
+VALUES ($1, 0, now(), $2, 'active', $3);</code></pre>
+</div>
+
+<!-- ================================================================== -->
+<h2 class="section" id="dashboard">2. Dashboard reads<span class="count">9 queries — the demo story</span></h2>
+
+<div class="q" id="active-alerts">
+  <div class="q-hdr">
+    <h3><a href="#active-alerts">Active-alerts queue (SLA-aware)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 1 s</span>
+      <span class="pill">joins 4 tables</span>
+      <span class="pill src">internal/web/server.go · handleAlerts()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The top panel of the dashboard. <strong>Joins alerts × households × plans
+    × regions</strong> so the SLA countdown
+    (<code>plan.sla_seconds − age</code>) and the dispatch-center attribution
+    are computed by the database, not the app. Ordered by severity DESC
+    then raised_at ASC — escalations first, then oldest-of-equal-severity.
+  </p>
+<pre><code class="language-sql">SELECT a.alert_id,
+       a.severity,
+       a.detail,
+       EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS age_s,
+       p.sla_seconds,
+       p.sla_seconds
+         - EXTRACT(EPOCH FROM (now() - a.raised_at))::int AS sla_remaining,
+       h.household_id,
+       h.address_hash,
+       p.name AS plan_name,
+       r.name AS region_name,
+       r.dispatch_center
+FROM   alerts     a
+JOIN   households h ON h.household_id = a.household_id
+JOIN   plans      p ON p.plan_id      = h.plan_id
+JOIN   regions    r ON r.region_id    = h.region_id
+WHERE  a.status = 'active'
+ORDER  BY a.severity DESC, a.raised_at ASC
+LIMIT  25;</code></pre>
+</div>
+
+<div class="q" id="event-stream">
+  <div class="q-hdr">
+    <h3><a href="#event-stream">Live event stream (non-heartbeat)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-fast">SSE every 200 ms</span>
+      <span class="pill">joins 5 tables</span>
+      <span class="pill src">internal/web/server.go · pushEventStream()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Bottom-left panel, the ticker tape. <strong>Joins events × devices ×
+    device_types × households × regions</strong> so each row renders with a
+    human-readable device code, location and region. The
+    <code>WHERE&nbsp;e.kind&nbsp;&gt;&nbsp;0</code> filter drops heartbeats and
+    rides the <code>events_kind_ts_idx</code> index.
+  </p>
+<pre><code class="language-sql">SELECT e.event_id,
+       e.ts,
+       e.household_id,
+       h.address_hash,
+       dt.code,
+       d.location,
+       e.kind,
+       e.severity,
+       COALESCE(e.battery_pct, -1),
+       r.name
+FROM   events       e
+JOIN   devices      d  ON d.device_id      = e.device_id
+JOIN   device_types dt ON dt.device_type_id = d.device_type_id
+JOIN   households   h  ON h.household_id   = e.household_id
+JOIN   regions      r  ON r.region_id      = h.region_id
+WHERE  e.kind &gt; 0
+ORDER  BY e.ts DESC
+LIMIT  25;</code></pre>
+</div>
+
+<div class="q" id="drill-pick">
+  <div class="q-hdr">
+    <h3><a href="#drill-pick">Drill-down: pick the household</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 2 s</span>
+      <span class="pill">first step of 3</span>
+      <span class="pill src">internal/web/server.go · handleDrilldown()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Bottom-right panel auto-rotates to whichever household has the
+    highest-severity currently-active alert. Severity DESC then raised_at
+    ASC mirrors the operator's actual triage order.
+  </p>
+<pre><code class="language-sql">SELECT a.household_id
+FROM   alerts a
+WHERE  a.status = 'active'
+ORDER  BY a.severity DESC, a.raised_at ASC
+LIMIT  1;</code></pre>
+</div>
+
+<div class="q" id="drill-fallback">
+  <div class="q-hdr">
+    <h3><a href="#drill-fallback">Drill-down: fallback household</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">when no active alerts</span>
+      <span class="pill">GROUP BY on hot table</span>
+      <span class="pill src">internal/web/server.go · handleDrilldown()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Runs only when the first query found no active alerts — picks the
+    chattiest household in the last 5 minutes so the drill-down panel never
+    sits empty during a demo. A small but real aggregate over the hot
+    table.
+  </p>
+<pre><code class="language-sql">SELECT household_id
+FROM   events
+WHERE  ts &gt; now() - interval '5 minutes'
+GROUP  BY household_id
+ORDER  BY COUNT(*) DESC
+LIMIT  1;</code></pre>
+</div>
+
+<div class="q" id="drill-meta">
+  <div class="q-hdr">
+    <h3><a href="#drill-meta">Drill-down: household header</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 2 s</span>
+      <span class="pill">3-way join</span>
+      <span class="pill src">internal/web/server.go · handleDrilldown()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Pulls the household's plan + region + dispatch center to render the
+    drill-down panel header (plan name, SLA seconds, armed state, region
+    DC).
+  </p>
+<pre><code class="language-sql">SELECT h.address_hash,
+       h.armed,
+       p.name,
+       p.sla_seconds,
+       r.name,
+       r.dispatch_center
+FROM   households h
+JOIN   plans      p ON p.plan_id   = h.plan_id
+JOIN   regions    r ON r.region_id = h.region_id
+WHERE  h.household_id = $1;</code></pre>
+</div>
+
+<div class="q" id="drill-events">
+  <div class="q-hdr">
+    <h3><a href="#drill-events">Drill-down: last 20 events</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 2 s</span>
+      <span class="pill">uses events_household_ts_idx</span>
+      <span class="pill src">internal/web/server.go · handleDrilldown()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The "what's going on at this household right now" panel. Indexed on
+    <code>events (household_id, ts DESC)</code> so the LIMIT 20 returns
+    instantly even with the table at billion-row scale.
+  </p>
+<pre><code class="language-sql">SELECT e.ts,
+       dt.code,
+       d.location,
+       e.kind,
+       e.severity,
+       COALESCE(e.battery_pct, -1)
+FROM   events       e
+JOIN   devices      d  ON d.device_id       = e.device_id
+JOIN   device_types dt ON dt.device_type_id = d.device_type_id
+WHERE  e.household_id = $1
+ORDER  BY e.ts DESC
+LIMIT  20;</code></pre>
+</div>
+
+<div class="q" id="stats-events">
+  <div class="q-hdr">
+    <h3><a href="#stats-events">Footer: ingest rate + total events</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every HG_STATS_REFRESH</span>
+      <span class="pill">storage_samples · ≤ a few hundred rows</span>
+      <span class="pill src">internal/web/server.go · handleStats()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The two most recent <code>storage_samples</code> rows answer both
+    "how many rows are in events" and "how fast are they growing": the
+    table is just <code>(sampled_at, uncompressed_bytes)</code> pairs
+    the simulator writes every <code>HG_STORAGE_SAMPLER_INTERVAL</code>
+    from its in-process atomic counter, so the delta-over-time gives
+    you events/sec without ever touching <code>events</code>. Original
+    iterations of this query did <code>COUNT(*) FILTER (WHERE ts &gt;
+    now() - interval '1 second')</code> on events directly — but
+    without an <code>events (ts DESC)</code> index that's a full table
+    scan, which at billion-row scale takes long enough to make the
+    footer freeze. (The published <code>rows_per_sec</code> is a
+    sample-interval average rather than a strict 1-second window.)
+  </p>
+<pre><code class="language-sql">SELECT
+    (SELECT uncompressed_bytes FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_bytes,
+    (SELECT sampled_at         FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_ts,
+    (SELECT uncompressed_bytes FROM storage_samples
+         WHERE sampled_at &lt; (SELECT MAX(sampled_at) FROM storage_samples)
+         ORDER BY sampled_at DESC LIMIT 1) AS prior_bytes,
+    (SELECT sampled_at         FROM storage_samples
+         WHERE sampled_at &lt; (SELECT MAX(sampled_at) FROM storage_samples)
+         ORDER BY sampled_at DESC LIMIT 1) AS prior_ts;
+-- total_events  = latest_bytes / 48
+-- rows_per_sec  = (latest_bytes - prior_bytes) / dt / 48</code></pre>
+</div>
+
+<div class="q" id="stats-alerts">
+  <div class="q-hdr">
+    <h3><a href="#stats-alerts">Footer: active / total alerts</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 1 s</span>
+      <span class="pill">small table</span>
+      <span class="pill src">internal/web/server.go · handleStats()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Same shape as the events counter but over the alerts table.
+  </p>
+<pre><code class="language-sql">SELECT
+    COUNT(*) FILTER (WHERE status = 'active') AS active,
+    COUNT(*)                                  AS total
+FROM alerts;</code></pre>
+</div>
+
+<div class="q" id="storage-rates">
+  <div class="q-hdr">
+    <h3><a href="#storage-rates">Storage gauge: latest size + 1m / 5m / 15m rates</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-medium">every 1 s</span>
+      <span class="pill">8 subselects · one round trip</span>
+      <span class="pill src">internal/web/server.go · handleStorage()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Feeds the Storage Growth gauge. Picks the freshest sample and, for
+    each window, the freshest sample <em>at or before</em> the
+    window-start cutoff — handles uneven sampler timing without lying
+    about the elapsed seconds (the rate is computed in Go from the two
+    timestamps the query returns, not assumed). Reads only the
+    <code>storage_samples</code> table; the underlying compression view
+    is hit once every 5 s by the sampler, not here.
+  </p>
+<pre><code class="language-sql">SELECT
+    (SELECT uncompressed_bytes FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_bytes,
+    (SELECT sampled_at         FROM storage_samples
+         ORDER BY sampled_at DESC LIMIT 1) AS latest_ts,
+
+    (SELECT uncompressed_bytes FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '1 minute'
+         ORDER BY sampled_at DESC LIMIT 1) AS bytes_1m,
+    (SELECT sampled_at         FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '1 minute'
+         ORDER BY sampled_at DESC LIMIT 1) AS ts_1m,
+
+    (SELECT uncompressed_bytes FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '5 minutes'
+         ORDER BY sampled_at DESC LIMIT 1) AS bytes_5m,
+    (SELECT sampled_at         FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '5 minutes'
+         ORDER BY sampled_at DESC LIMIT 1) AS ts_5m,
+
+    (SELECT uncompressed_bytes FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '15 minutes'
+         ORDER BY sampled_at DESC LIMIT 1) AS bytes_15m,
+    (SELECT sampled_at         FROM storage_samples
+         WHERE sampled_at &lt;= now() - interval '15 minutes'
+         ORDER BY sampled_at DESC LIMIT 1) AS ts_15m;</code></pre>
+</div>
+
+<!-- ================================================================== -->
+<h2 class="section" id="background">3. Background maintenance<span class="count">4 queries — keep the demo dynamic</span></h2>
+
+<div class="q" id="resolve-low">
+  <div class="q-hdr">
+    <h3><a href="#resolve-low">Resolve low-severity alerts (false alarm)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-slow">every 2 s</span>
+      <span class="pill">LIMIT 2000 per tick</span>
+      <span class="pill src">internal/sim/simulator.go · resolveAlertsLoop()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Severity 1–2 alerts age out as false_alarm. Without this the operator
+    queue would just grow forever during a long-running demo.
+  </p>
+<pre><code class="language-sql">UPDATE alerts
+SET status        = 'false_alarm',
+    resolved_at   = now(),
+    resolution_ms = (EXTRACT(EPOCH FROM (now() - raised_at)) * 1000)::int
+WHERE alert_id IN (
+    SELECT alert_id
+    FROM   alerts
+    WHERE  status = 'active' AND severity &lt;= 2
+    ORDER  BY raised_at ASC
+    LIMIT  2000
+);</code></pre>
+</div>
+
+<div class="q" id="resolve-high">
+  <div class="q-hdr">
+    <h3><a href="#resolve-high">Resolve high-severity alerts (dispatch)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-slow">every 2 s</span>
+      <span class="pill">LIMIT 600 per tick</span>
+      <span class="pill src">internal/sim/simulator.go · resolveAlertsLoop()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Severity 3+ alerts only get cleared after they've sat in the queue
+    for &gt; 20 seconds — gives the operator panel something to look at
+    before the simulated dispatch fires.
+  </p>
+<pre><code class="language-sql">UPDATE alerts
+SET status        = 'resolved',
+    resolved_at   = now(),
+    resolution_ms = (EXTRACT(EPOCH FROM (now() - raised_at)) * 1000)::int
+WHERE alert_id IN (
+    SELECT alert_id
+    FROM   alerts
+    WHERE  status = 'active' AND severity &gt;= 3
+      AND  raised_at &lt; now() - interval '20 seconds'
+    ORDER  BY raised_at ASC
+    LIMIT  600
+);</code></pre>
+</div>
+
+<div class="q" id="armed-shuffle">
+  <div class="q-hdr">
+    <h3><a href="#armed-shuffle">Shuffle households' armed state</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-slow">every ~10 s · batched</span>
+      <span class="pill">PK update · ~1 500 rows/round</span>
+      <span class="pill src">internal/sim/simulator.go · shuffleArmedState()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Flips ~5 % of households armed/disarmed each round, sent as a single
+    <code>pgx.Batch</code>. Keeps the rules engine outputs varying — door
+    opens flip between "routine" and "alarmable" without restarting the
+    sim.
+  </p>
+<pre><code class="language-sql">UPDATE households
+SET    armed = $1
+WHERE  household_id = $2;</code></pre>
+</div>
+
+<div class="q" id="storage-sample-insert">
+  <div class="q-hdr">
+    <h3><a href="#storage-sample-insert">Sampler: record table-size sample</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-slow">every 5 s</span>
+      <span class="pill">1 row/call · no read needed</span>
+      <span class="pill src">internal/sim/simulator.go · storageSamplerLoop()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The size value is <code>48 × s.eventID.Load()</code> — the in-process
+    atomic counter the producers increment per generated row, no DB
+    query needed. Earlier iterations polled
+    <code>cedardb_compression_info</code> (laggy because it only
+    reflects rows packed into column-store blocks) and then
+    <code>COUNT(*) FROM events</code> (exact but increasingly expensive
+    as the table grows past a billion rows, and a likely cause of write
+    stalls because the long scan held an MVCC snapshot). The atomic
+    counter is exact for rows that producers <em>tried</em> to write —
+    diverges from the table count only by the channel queue plus the
+    in-flight COPY batch (≤ ~160 K rows ≈ 8 MB at the configured
+    backpressure ceiling). <code>sampled_at</code> is the table's PK
+    and is unique at the 5 s sampler cadence (microsecond resolution).
+  </p>
+<pre><code class="language-sql">INSERT INTO storage_samples (sampled_at, uncompressed_bytes)
+VALUES (now(), $1);</code></pre>
+</div>
+
+<!-- ================================================================== -->
+<h2 class="section" id="startup">4. Startup &amp; bootstrap<span class="count">5 queries — once per cold start</span></h2>
+
+<div class="q" id="schema-present">
+  <div class="q-hdr">
+    <h3><a href="#schema-present">Probe: is the schema present?</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">once</span>
+      <span class="pill">information_schema</span>
+      <span class="pill src">internal/db/schema.go · SchemaPresent()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Diagnostic log only — we always call <code>ApplySchema</code> anyway
+    because the file is idempotent (CREATE TABLE IF NOT EXISTS on
+    everything).
+  </p>
+<pre><code class="language-sql">SELECT (
+    SELECT COUNT(*) FROM information_schema.tables
+    WHERE  table_name = 'households'
+) = 1;</code></pre>
+</div>
+
+<div class="q" id="apply-schema">
+  <div class="q-hdr">
+    <h3><a href="#apply-schema">Apply schema (DDL, one stmt at a time)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">once</span>
+      <span class="pill">~14 statements</span>
+      <span class="pill src">internal/db/schema.go · ApplySchema() · schema.sql</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The embedded <code>schema.sql</code> is split on <code>;</code> and each
+    statement is run individually under simple-query protocol — CedarDB
+    accepts DDL through that path but silently no-ops some of it under
+    extended Parse/Bind/Execute, so we force the simpler path.
+  </p>
+<pre><code class="language-sql">CREATE TABLE IF NOT EXISTS plans (
+    plan_id           INTEGER PRIMARY KEY,
+    name              TEXT NOT NULL,
+    monthly_price_usd NUMERIC(8, 2) NOT NULL,
+    sla_seconds       INTEGER NOT NULL
+);
+-- ... regions, device_types, households, devices, events, alerts ...
+-- ... plus the four supporting indexes (trimmed for the high-rate demo —
+--     every secondary index on events is write amplification at 500 K/s):
+CREATE INDEX IF NOT EXISTS events_household_ts_idx ON events (household_id, ts DESC);
+CREATE INDEX IF NOT EXISTS events_kind_ts_idx      ON events (kind, ts DESC);
+CREATE INDEX IF NOT EXISTS alerts_status_raised_idx
+    ON alerts (status, raised_at DESC);
+CREATE INDEX IF NOT EXISTS alerts_household_raised_idx
+    ON alerts (household_id, raised_at DESC);</code></pre>
+</div>
+
+<div class="q" id="upsert-dims">
+  <div class="q-hdr">
+    <h3><a href="#upsert-dims">Upsert dimensions (plans, regions, device_types)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">once · single batch</span>
+      <span class="pill">~25 rows total</span>
+      <span class="pill src">internal/sim/simulator.go · upsertDimensions()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    The three small static dimension tables are populated from
+    <code>catalog.go</code> using <code>INSERT ... ON CONFLICT DO UPDATE</code>
+    so the simulator is safe to restart against a hot database. All three
+    upserts ride a single <code>pgx.Batch</code>.
+  </p>
+<pre><code class="language-sql">-- plans
+INSERT INTO plans (plan_id, name, monthly_price_usd, sla_seconds)
+VALUES ($1, $2, $3, $4)
+ON CONFLICT (plan_id) DO UPDATE SET
+    name              = EXCLUDED.name,
+    monthly_price_usd = EXCLUDED.monthly_price_usd,
+    sla_seconds       = EXCLUDED.sla_seconds;
+
+-- regions
+INSERT INTO regions (region_id, name, dispatch_center, timezone)
+VALUES ($1, $2, $3, $4)
+ON CONFLICT (region_id) DO UPDATE SET
+    name            = EXCLUDED.name,
+    dispatch_center = EXCLUDED.dispatch_center,
+    timezone        = EXCLUDED.timezone;
+
+-- device_types
+INSERT INTO device_types (device_type_id, code, name, default_severity)
+VALUES ($1, $2, $3, $4)
+ON CONFLICT (device_type_id) DO UPDATE SET
+    code             = EXCLUDED.code,
+    name             = EXCLUDED.name,
+    default_severity = EXCLUDED.default_severity;</code></pre>
+</div>
+
+<div class="q" id="upsert-households">
+  <div class="q-hdr">
+    <h3><a href="#upsert-households">Upsert household fleet</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">once · batched 1 000/round</span>
+      <span class="pill">~30 000 rows default</span>
+      <span class="pill src">internal/sim/simulator.go · upsertHouseholds()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Synthesised fleet from <code>generateFleet()</code>. Run as batches of
+    1 000 INSERTs per <code>pgx.Batch.SendBatch</code> — simpler and almost
+    as fast as a staging-table COPY at this scale, idempotent on restart.
+  </p>
+<pre><code class="language-sql">INSERT INTO households (
+    household_id, plan_id, region_id, address_hash, armed
+)
+VALUES ($1, $2, $3, $4, $5)
+ON CONFLICT (household_id) DO UPDATE SET
+    plan_id   = EXCLUDED.plan_id,
+    region_id = EXCLUDED.region_id,
+    armed     = EXCLUDED.armed;</code></pre>
+</div>
+
+<div class="q" id="upsert-devices">
+  <div class="q-hdr">
+    <h3><a href="#upsert-devices">Upsert device fleet</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">once · batched 1 000/round</span>
+      <span class="pill">~300 000 rows default</span>
+      <span class="pill src">internal/sim/simulator.go · upsertDevices()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Same shape as the households upsert. ~10 devices per household by
+    default; total fleet is whatever <code>-households</code> ×
+    <code>-devices-per-household</code> lands on.
+  </p>
+<pre><code class="language-sql">INSERT INTO devices (
+    device_id, household_id, device_type_id, location, last_battery_pct
+)
+VALUES ($1, $2, $3, $4, $5)
+ON CONFLICT (device_id) DO UPDATE SET
+    household_id     = EXCLUDED.household_id,
+    device_type_id   = EXCLUDED.device_type_id,
+    location         = EXCLUDED.location,
+    last_battery_pct = EXCLUDED.last_battery_pct;</code></pre>
+</div>
+
+<!-- ================================================================== -->
+<h2 class="section" id="reset">5. Schema reset<span class="count">1 query · destructive · behind a flag</span></h2>
+
+<div class="q" id="reset-drops">
+  <div class="q-hdr">
+    <h3><a href="#reset-drops">DROP everything (reverse FK order)</a></h3>
+    <div class="q-meta">
+      <span class="pill freq-once">on -reset-schema</span>
+      <span class="pill">8 tables</span>
+      <span class="pill src">internal/db/schema.go · ResetSchema()</span>
+    </div>
+  </div>
+  <p class="q-explain">
+    Only runs when the simulator is started with <code>-reset-schema</code>.
+    Dropped in reverse-FK order so the CASCADE never has anything real to
+    do, then <code>ApplySchema</code> rebuilds.
+  </p>
+<pre><code class="language-sql">DROP TABLE IF EXISTS storage_samples CASCADE;
+DROP TABLE IF EXISTS alerts          CASCADE;
+DROP TABLE IF EXISTS events          CASCADE;
+DROP TABLE IF EXISTS devices         CASCADE;
+DROP TABLE IF EXISTS households      CASCADE;
+DROP TABLE IF EXISTS device_types    CASCADE;
+DROP TABLE IF EXISTS regions         CASCADE;
+DROP TABLE IF EXISTS plans           CASCADE;</code></pre>
+</div>
+
+</main>
+
+<script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/components/prism-core.min.js"></script>
+<script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/components/prism-sql.min.js"></script>
+</body>
+</html>
diff --git a/homeguard-iot/internal/web/templates/alerts.html b/homeguard-iot/internal/web/templates/alerts.html
new file mode 100644
index 0000000..8125cd3
--- /dev/null
+++ b/homeguard-iot/internal/web/templates/alerts.html
@@ -0,0 +1,32 @@
+{{if . }}
+<table class="alerts">
+  <thead>
+    <tr>
+      <th>Sev</th>
+      <th>Household</th>
+      <th>Plan</th>
+      <th>Region · Dispatch</th>
+      <th>Detail</th>
+      <th>Age</th>
+      <th>SLA</th>
+    </tr>
+  </thead>
+  <tbody>
+  {{range .}}
+    <tr class="sev{{.Severity}}">
+      <td class="sev">{{.Severity}}</td>
+      <td><span class="code">#{{.HouseholdID}}</span></td>
+      <td>{{.PlanName}}</td>
+      <td><span class="code">{{.RegionName}} · {{.Dispatch}}</span></td>
+      <td class="detail">{{.Detail}}</td>
+      <td class="sla">{{.AgeFmt}}</td>
+      <td class="sla{{if .Breached}} breached{{else if le .SLARemaining 10}} warn{{end}}">
+        {{if .Breached}}+{{.SLAFmt}} OVER{{else}}{{.SLAFmt}} left{{end}}
+      </td>
+    </tr>
+  {{end}}
+  </tbody>
+</table>
+{{else}}
+<p class="muted">no active alerts right now — system clean.</p>
+{{end}}
diff --git a/homeguard-iot/internal/web/templates/drilldown.html b/homeguard-iot/internal/web/templates/drilldown.html
new file mode 100644
index 0000000..5f09804
--- /dev/null
+++ b/homeguard-iot/internal/web/templates/drilldown.html
@@ -0,0 +1,25 @@
+<div class="drill-hdr">
+  <span class="accent">#{{.HouseholdID}}</span>
+  <span>addr&nbsp;<span class="muted">{{.Address}}</span></span>
+  <span>{{.Plan}}&nbsp;<span class="muted">(SLA&nbsp;{{.SLASec}}s)</span></span>
+  <span>{{.Region}}&nbsp;·&nbsp;{{.Dispatch}}</span>
+  <span class="{{if .Armed}}armed{{else}}disarmed{{end}}">
+    {{if .Armed}}● ARMED{{else}}○ disarmed{{end}}
+  </span>
+</div>
+{{if .Events }}
+<table class="drill">
+  <tbody>
+  {{range .Events}}
+    <tr class="{{if ge .Severity 4}}sev-high{{end}}">
+      <td class="ts">{{.Ts}}</td>
+      <td class="dev">{{.Code}}</td>
+      <td class="where">{{.Location}}</td>
+      <td>{{if eq .Kind 0}}heartbeat{{else if eq .Kind 1}}TRIGGERED{{else if eq .Kind 2}}battery low{{else if eq .Kind 3}}offline{{else if eq .Kind 4}}TAMPER{{else}}?{{end}}{{if gt .Severity 0}} · sev {{.Severity}}{{end}}</td>
+    </tr>
+  {{end}}
+  </tbody>
+</table>
+{{else}}
+<p class="muted">no recent events for this household.</p>
+{{end}}
diff --git a/homeguard-iot/internal/web/templates/index.html b/homeguard-iot/internal/web/templates/index.html
new file mode 100644
index 0000000..f76fe4e
--- /dev/null
+++ b/homeguard-iot/internal/web/templates/index.html
@@ -0,0 +1,467 @@
+<!doctype html>
+<html lang="en">
+<head>
+<meta charset="utf-8">
+<title>HomeGuard IoT — CedarDB Operator Console</title>
+<script src="https://unpkg.com/htmx.org@1.9.12"></script>
+<style>
+  :root {
+    --bg: #0d1117;
+    --panel: #161b22;
+    --line: #30363d;
+    --text: #e6edf3;
+    --muted: #8b949e;
+    --accent: #FB773E;   /* CedarDB orange */
+    --green: #1ee2a2;
+    --red:   #ff5577;
+    --yellow:#f4d04e;
+    --blue:  #3aa6ff;
+    --purple:#9966ff;
+  }
+  * { box-sizing: border-box; }
+  html, body { height: 100%; }
+  body {
+    margin: 0;
+    font-family: ui-monospace, "SF Mono", Menlo, monospace;
+    background: var(--bg);
+    color: var(--text);
+    display: flex;
+    flex-direction: column;
+  }
+  header {
+    padding: 8px 20px;
+    border-bottom: 1px solid var(--line);
+    display: flex;
+    justify-content: space-between;
+    align-items: center;
+    flex: 0 0 auto;
+  }
+  header h1 {
+    font-size: 16px;
+    margin: 0;
+    font-weight: 600;
+    letter-spacing: 0.04em;
+    display: flex;
+    align-items: center;
+    gap: 14px;
+  }
+  header h1 .logo { height: 28px; width: auto; display: block; }
+  header h1 .sep  { color: var(--muted); font-weight: 400; }
+  header .meta    { color: var(--muted); font-size: 12px; }
+
+  main {
+    display: grid;
+    grid-template-rows: 1.2fr 1fr;
+    gap: 10px;
+    padding: 10px;
+    flex: 1 1 auto;
+    min-height: 0;
+  }
+  .row { display: grid; gap: 10px; min-height: 0; }
+  .row.top    { grid-template-columns: 1fr; }
+  .row.bottom { grid-template-columns: 1.3fr 1fr 1fr; }
+
+  .panel {
+    background: var(--panel);
+    border: 1px solid var(--line);
+    border-radius: 8px;
+    padding: 10px 14px;
+    overflow: hidden;
+    display: flex;
+    flex-direction: column;
+    min-height: 0;
+  }
+  .panel h2 {
+    font-size: 11px;
+    margin: 0 0 8px;
+    color: var(--muted);
+    text-transform: uppercase;
+    letter-spacing: 0.08em;
+    flex: 0 0 auto;
+  }
+  .panel h2 .accent { color: var(--accent); }
+  .panel-body { overflow-y: auto; flex: 1 1 auto; min-height: 0; }
+
+  /* --- Alerts queue ----------------------------------------------------- */
+  table.alerts { width: 100%; border-collapse: collapse; font-size: 13px; }
+  table.alerts th {
+    text-align: left;
+    color: var(--muted);
+    font-weight: 400;
+    padding: 6px 8px;
+    border-bottom: 1px solid var(--line);
+    font-size: 11px;
+    text-transform: uppercase;
+    letter-spacing: 0.06em;
+    position: sticky;
+    top: 0;
+    background: var(--panel);
+  }
+  table.alerts td { padding: 6px 8px; border-bottom: 1px solid #1c2128; }
+  table.alerts td.sev   { font-weight: 700; text-align: center; width: 38px; }
+  table.alerts td.detail{ font-weight: 500; }
+  table.alerts td.sla   { font-variant-numeric: tabular-nums; }
+  table.alerts td.sla.breached { color: var(--red); font-weight: 700; }
+  table.alerts td.sla.warn     { color: var(--yellow); }
+  table.alerts td.code  { color: var(--muted); font-size: 12px; }
+  table.alerts tr.sev5 td.sev { color: var(--red); }
+  table.alerts tr.sev4 td.sev { color: #ff8c1a; }
+  table.alerts tr.sev3 td.sev { color: var(--yellow); }
+  table.alerts tr.sev2 td.sev { color: var(--blue); }
+  table.alerts tr.sev1 td.sev { color: var(--muted); }
+
+  /* --- Event stream ---------------------------------------------------- */
+  ul.events { list-style: none; padding: 0; margin: 0; font-size: 12px; }
+  ul.events li {
+    display: grid;
+    grid-template-columns: 60px 80px 110px 1fr 60px;
+    gap: 8px;
+    padding: 3px 4px;
+    border-bottom: 1px solid #1c2128;
+    align-items: center;
+  }
+  ul.events li .ts       { color: var(--muted); font-variant-numeric: tabular-nums; }
+  ul.events li .hh       { color: var(--accent); }
+  ul.events li .dev      { font-weight: 700; }
+  ul.events li .where    { color: var(--muted); }
+  ul.events li .meta     { color: var(--muted); text-align: right; }
+  ul.events li.kind-1    { background: rgba(244, 208, 78, 0.04); }
+  ul.events li.kind-4    { background: rgba(255, 85, 119, 0.06); }
+  ul.events li.sev-5     { background: rgba(255, 85, 119, 0.08); }
+  .kind-tag {
+    display: inline-block;
+    padding: 1px 5px;
+    border-radius: 3px;
+    font-size: 10px;
+    font-weight: 700;
+    letter-spacing: 0.04em;
+    border: 1px solid currentColor;
+  }
+  .kind-tag.k1 { color: var(--yellow); }
+  .kind-tag.k2 { color: var(--blue); }
+  .kind-tag.k3 { color: var(--muted); }
+  .kind-tag.k4 { color: var(--red); }
+
+  /* --- Drilldown ------------------------------------------------------- */
+  .drill-hdr { display: flex; flex-wrap: wrap; gap: 6px 14px; font-size: 12px; margin-bottom: 6px; color: var(--muted); }
+  .drill-hdr .accent { color: var(--accent); font-weight: 700; }
+  .drill-hdr .armed  { color: var(--red); font-weight: 700; }
+  .drill-hdr .disarmed { color: var(--muted); }
+  table.drill { width: 100%; border-collapse: collapse; font-size: 12px; }
+  table.drill td { padding: 3px 6px; border-bottom: 1px solid #1c2128; vertical-align: top; }
+  table.drill td.ts { color: var(--muted); font-variant-numeric: tabular-nums; width: 70px; }
+  table.drill td.dev { font-weight: 700; width: 100px; }
+  table.drill td.where { color: var(--muted); }
+  table.drill tr.sev-high td { background: rgba(255,85,119,0.05); }
+
+  /* --- Storage gauge --------------------------------------------------- */
+  .gauge-wrap {
+    display: flex;
+    flex-direction: column;
+    align-items: center;
+    gap: 4px;
+    margin-bottom: 8px;
+  }
+  .gauge-wrap svg { width: 100%; max-width: 220px; height: auto; display: block; }
+  .gauge-track   { stroke: #1c2128; }
+  .gauge-fill    { stroke: var(--accent); transition: stroke-dasharray 0.3s ease, stroke 0.3s ease; }
+  .gauge-fill.warn { stroke: var(--yellow); }
+  .gauge-fill.good { stroke: var(--green);  }
+  .gauge-tick    { stroke: var(--muted); stroke-width: 1; }
+  .gauge-label-target { fill: var(--muted); font: 600 9px ui-monospace, "SF Mono", Menlo, monospace; letter-spacing: 0.05em; }
+  .gauge-value   {
+    fill: var(--text);
+    font: 700 22px ui-monospace, "SF Mono", Menlo, monospace;
+    text-anchor: middle;
+    font-variant-numeric: tabular-nums;
+  }
+  .gauge-unit { fill: var(--muted); font: 400 10px ui-monospace, "SF Mono", Menlo, monospace; text-anchor: middle; }
+  .gauge-caption { font-size: 11px; color: var(--muted); text-align: center; }
+  .gauge-caption strong { color: var(--accent); font-weight: 700; }
+
+  table.storage {
+    width: 100%;
+    border-collapse: collapse;
+    font-size: 12px;
+    font-variant-numeric: tabular-nums;
+  }
+  table.storage td { padding: 3px 4px; border-bottom: 1px solid #1c2128; }
+  table.storage td.lbl   { color: var(--muted); width: 38%; }
+  table.storage td.val   { font-weight: 700; }
+  table.storage td.pct   { color: var(--muted); text-align: right; font-size: 11px; }
+  table.storage td.pct.good { color: var(--green); }
+  table.storage td.pct.warn { color: var(--yellow); }
+  table.storage td.pct.low  { color: var(--red); }
+  .storage-foot {
+    margin-top: 8px;
+    font-size: 11px;
+    color: var(--muted);
+    text-align: center;
+  }
+  .storage-foot .accent { color: var(--accent); font-weight: 700; }
+
+  /* --- Footer ---------------------------------------------------------- */
+  footer {
+    padding: 5px 20px;
+    border-top: 1px solid var(--line);
+    color: var(--muted);
+    font-size: 11px;
+    display: flex;
+    justify-content: space-between;
+    flex: 0 0 auto;
+  }
+  footer .accent  { color: var(--accent); font-weight: 700; font-variant-numeric: tabular-nums; }
+  footer .alerts  { color: var(--yellow); font-variant-numeric: tabular-nums; }
+  footer .docs-link { color: var(--accent); text-decoration: none; border-bottom: 1px dotted rgba(251,119,62,0.5); }
+  footer .docs-link:hover { border-bottom-color: var(--accent); }
+
+  .muted { color: var(--muted); }
+</style>
+</head>
+<body>
+
+<header>
+  <h1>
+    <img src="/static/cedardb-logo.svg" alt="CedarDB" class="logo">
+    <span class="sep">&nbsp;</span>
+    HOMEGUARD&nbsp;IoT&nbsp;·&nbsp;OPERATOR&nbsp;CONSOLE
+  </h1>
+  <div class="meta">
+    Active alerts <span id="hdr-active">0</span> ·
+    Events ingested <span id="hdr-total">0</span> ·
+    <span id="conn-status">connecting…</span>
+  </div>
+</header>
+
+<main>
+  <section class="row top">
+    <div class="panel">
+      <h2>Active Alerts <span class="accent">·</span> SLA-aware <span class="muted" style="float:right;font-weight:400">refresh {{.AlertsRefreshMs}} ms · joins alerts × households × plans × regions</span></h2>
+      <div class="panel-body"
+           hx-get="/api/alerts"
+           hx-trigger="load, every {{.AlertsRefreshMs}}ms"
+           hx-swap="innerHTML">
+        <p class="muted">waiting for first alert…</p>
+      </div>
+    </div>
+  </section>
+
+  <section class="row bottom">
+    <div class="panel">
+      <h2>Live Event Stream <span class="accent">·</span> non-heartbeat <span class="muted" style="float:right;font-weight:400">SSE · joins events × devices × device_types × households × regions</span></h2>
+      <ul id="events" class="events panel-body">
+        <li class="muted">connecting to stream…</li>
+      </ul>
+    </div>
+    <div class="panel">
+      <h2>Customer Drill-Down <span class="accent">·</span> highest-severity household <span class="muted" style="float:right;font-weight:400">refresh {{.DrilldownRefreshMs}} ms</span></h2>
+      <div class="panel-body"
+           hx-get="/api/drilldown"
+           hx-trigger="load, every {{.DrilldownRefreshMs}}ms"
+           hx-swap="innerHTML">
+        <p class="muted">selecting household…</p>
+      </div>
+    </div>
+    <div class="panel">
+      <h2>Storage Growth <span class="accent">·</span> uncompressed <span class="muted" style="float:right;font-weight:400">cedardb_compression_info · refresh 1 s</span></h2>
+      <div class="panel-body">
+        <div class="gauge-wrap">
+          <svg viewBox="0 0 200 130" xmlns="http://www.w3.org/2000/svg" aria-label="1-minute ingest rate gauge">
+            <!-- 180° arc, opens downward; centre (100,100) radius 75 -->
+            <path class="gauge-track"
+                  d="M 25 100 A 75 75 0 0 1 175 100"
+                  fill="none" stroke-width="14" stroke-linecap="round"/>
+            <path id="gauge-fill" class="gauge-fill"
+                  d="M 25 100 A 75 75 0 0 1 175 100"
+                  fill="none" stroke-width="14" stroke-linecap="round"
+                  pathLength="100" stroke-dasharray="0 100"/>
+            <!-- 50% / 100% tick marks for visual orientation -->
+            <line class="gauge-tick" x1="100" y1="22" x2="100" y2="28"/>
+            <text x="100" y="65" class="gauge-value" id="gauge-mbps">—</text>
+            <text x="100" y="83" class="gauge-unit">MB/s · 1m avg</text>
+            <text x="100" y="122" class="gauge-label-target" text-anchor="middle">TARGET 34.7 MB/s · 3 TB/day</text>
+          </svg>
+          <div class="gauge-caption" id="gauge-caption">collecting samples…</div>
+        </div>
+        <table class="storage" id="storage-table">
+          <tr>
+            <td class="lbl">Total uncompressed</td>
+            <td class="val" id="st-total">—</td>
+            <td class="pct"></td>
+          </tr>
+          <tr>
+            <td class="lbl">1-minute rate</td>
+            <td class="val" id="st-1m">—</td>
+            <td class="pct" id="st-1m-pct"></td>
+          </tr>
+          <tr>
+            <td class="lbl">5-minute rate</td>
+            <td class="val" id="st-5m">—</td>
+            <td class="pct" id="st-5m-pct"></td>
+          </tr>
+          <tr>
+            <td class="lbl">15-minute rate</td>
+            <td class="val" id="st-15m">—</td>
+            <td class="pct" id="st-15m-pct"></td>
+          </tr>
+        </table>
+        <div class="storage-foot">
+          Projected daily (current 1m rate):
+          <span class="accent" id="st-projection">—</span>
+        </div>
+      </div>
+    </div>
+  </section>
+</main>
+
+<footer>
+  <span>
+    INSERTs: <span id="rate" class="accent">— ev/sec</span> ·
+    <span id="total" class="accent">— total events</span> ·
+    <span id="alerts" class="alerts">— active / — total alerts</span> ·
+    one table, four concurrent reads · CedarDB ·
+    <a href="/static/queries.html" target="_blank" rel="noopener" class="docs-link">SQL queries&nbsp;↗</a>
+  </span>
+  <span id="last-frame" class="muted">—</span>
+</footer>
+
+<script>
+const kindNames = { 0: "HB", 1: "TRIG", 2: "BATT", 3: "OFFL", 4: "TAMP" };
+
+function renderEvents(snap) {
+  const ul = document.getElementById("events");
+  if (!snap.events || snap.events.length === 0) {
+    ul.innerHTML = '<li class="muted">no triggered events yet…</li>';
+    return;
+  }
+  ul.innerHTML = snap.events.map(e => {
+    const cls = `kind-${e.kind}` + (e.severity >= 4 ? " sev-5" : "");
+    const tag = `<span class="kind-tag k${e.kind}">${kindNames[e.kind] || e.kind}</span>`;
+    return `<li class="${cls}">
+      <span class="ts">${e.ts}</span>
+      <span class="hh">#${e.household_id}</span>
+      <span class="dev">${e.device_code}</span>
+      <span class="where">${e.location.replace(/_/g, " ")} · ${e.region}</span>
+      <span class="meta">${tag}</span>
+    </li>`;
+  }).join("");
+  document.getElementById("last-frame").textContent =
+    "last frame " + new Date(snap.stamp_ms).toLocaleTimeString();
+}
+
+let es;
+function openStream() {
+  es = new EventSource("/sse/events");
+  es.onopen    = () => document.getElementById("conn-status").textContent = "stream OK";
+  es.onerror   = () => document.getElementById("conn-status").textContent = "stream reconnecting…";
+  es.onmessage = (e) => {
+    try { renderEvents(JSON.parse(e.data)); } catch { /* ignore */ }
+  };
+}
+
+async function refreshStats() {
+  try {
+    const r = await fetch("/api/stats");
+    const d = await r.json();
+    document.getElementById("rate").textContent =
+      (d.rows_per_sec || 0).toLocaleString() + " ev/sec";
+    document.getElementById("total").textContent =
+      (d.total_events || 0).toLocaleString() + " total events";
+    document.getElementById("alerts").textContent =
+      `${(d.active_alerts || 0).toLocaleString()} active / ` +
+      `${(d.total_alerts || 0).toLocaleString()} total alerts`;
+    document.getElementById("hdr-active").textContent =
+      (d.active_alerts || 0).toLocaleString();
+    document.getElementById("hdr-total").textContent =
+      (d.total_events || 0).toLocaleString();
+  } catch (e) { /* keep polling */ }
+}
+
+openStream();
+refreshStats();
+setInterval(refreshStats, {{.StatsRefreshMs}});
+
+// --- Storage gauge ------------------------------------------------------
+const STORAGE_TARGET_BPS = 34_722_222; // 3 TB / day
+
+function humanBytes(n) {
+  if (n === undefined || n === null || isNaN(n)) return "—";
+  const units = ["B", "KB", "MB", "GB", "TB", "PB"];
+  let i = 0, v = n;
+  while (v >= 1024 && i < units.length - 1) { v /= 1024; i++; }
+  return v.toFixed(v >= 100 ? 0 : v >= 10 ? 1 : 2) + " " + units[i];
+}
+function humanRate(bps) {
+  if (bps === undefined || bps === null || isNaN(bps)) return "—";
+  return humanBytes(bps) + "/s";
+}
+function pctOfTarget(bps) {
+  if (bps === undefined || bps === null || isNaN(bps)) return null;
+  return (bps / STORAGE_TARGET_BPS) * 100;
+}
+function pctClass(p) {
+  if (p === null) return "";
+  if (p >= 95) return "good";
+  if (p >= 60) return "warn";
+  return "low";
+}
+function setPct(elId, bps) {
+  const el = document.getElementById(elId);
+  if (!el) return;
+  const p = pctOfTarget(bps);
+  if (p === null) { el.textContent = ""; el.className = "pct"; return; }
+  el.textContent = p.toFixed(1) + "%";
+  el.className = "pct " + pctClass(p);
+}
+
+async function refreshStorage() {
+  let d;
+  try {
+    const r = await fetch("/api/storage");
+    d = await r.json();
+  } catch (e) { return; }
+
+  // Gauge: % of target driven by the 1-minute rate. Cap at 110% visually
+  // so we don't run past the arc.
+  const r1 = d.rate_1m_bps;
+  const pct1 = pctOfTarget(r1);
+  const fill = document.getElementById("gauge-fill");
+  if (pct1 === null) {
+    fill.setAttribute("stroke-dasharray", "0 100");
+    document.getElementById("gauge-mbps").textContent = "—";
+    document.getElementById("gauge-caption").textContent = "collecting samples…";
+  } else {
+    const clamped = Math.max(0, Math.min(pct1, 110));
+    fill.setAttribute("stroke-dasharray", clamped.toFixed(2) + " " + (100 - clamped).toFixed(2));
+    fill.classList.remove("warn", "good");
+    if (pct1 >= 95)      fill.classList.add("good");
+    else if (pct1 >= 60) fill.classList.add("warn");
+    const mbps = r1 / 1_000_000;
+    document.getElementById("gauge-mbps").textContent = mbps.toFixed(mbps >= 100 ? 0 : 1);
+    document.getElementById("gauge-caption").textContent =
+      pct1.toFixed(1) + "% of target · " + humanRate(r1);
+  }
+
+  // Table
+  document.getElementById("st-total").textContent = humanBytes(d.latest_bytes);
+  document.getElementById("st-1m").textContent  = humanRate(d.rate_1m_bps);
+  document.getElementById("st-5m").textContent  = humanRate(d.rate_5m_bps);
+  document.getElementById("st-15m").textContent = humanRate(d.rate_15m_bps);
+  setPct("st-1m-pct",  d.rate_1m_bps);
+  setPct("st-5m-pct",  d.rate_5m_bps);
+  setPct("st-15m-pct", d.rate_15m_bps);
+
+  // Daily projection from the freshest (1m) rate
+  const proj = document.getElementById("st-projection");
+  if (d.rate_1m_bps === undefined || d.rate_1m_bps === null) {
+    proj.textContent = "—";
+  } else {
+    const perDay = d.rate_1m_bps * 86400;
+    proj.textContent = humanBytes(perDay) + " / day";
+  }
+}
+refreshStorage();
+setInterval(refreshStorage, {{.StorageRefreshMs}});
+</script>
+</body>
+</html>
diff --git a/homeguard-iot/out/server b/homeguard-iot/out/server
new file mode 100755
index 0000000..0957fbb
Binary files /dev/null and b/homeguard-iot/out/server differ
diff --git a/homeguard-iot/out/simulator b/homeguard-iot/out/simulator
new file mode 100755
index 0000000..37f5f53
Binary files /dev/null and b/homeguard-iot/out/simulator differ
diff --git a/homeguard-iot/run.sh b/homeguard-iot/run.sh
new file mode 100755
index 0000000..56f01b9
--- /dev/null
+++ b/homeguard-iot/run.sh
@@ -0,0 +1,29 @@
+#!/bin/bash
+
+# Using a random port which isn't likely to be taken by someone else
+export DATABASE_URL="postgresql://postgres:postgres@localhost:26257/postgres?sslmode=require"
+
+# Simulator
+export HG_STORAGE_SAMPLER_INTERVAL="5s"
+export HG_INGEST_BATCH="50000"
+export HG_INGESTORS="10"
+
+# Server
+export HG_SSE_INTERVAL="1s"
+export HG_ALERTS_REFRESH="5s"
+export HG_DRILLDOWN_REFRESH="5s"
+export HG_STATS_REFRESH="1s"
+export HG_STORAGE_REFRESH="2s"
+
+# Small run, for a little laptop:
+#nohup ./out/simulator -households=30000 -devices-per-household=10 -rate=2500 -hz=10 -writers=4 >> simulator.log 2>&1 </dev/null &
+
+# Large run, for a server with 192 cores, 384 GB RAM, lots of disk space
+nohup ./out/simulator -households=200000 -devices-per-household=12 -rate=1750000 -hz=20 -writers=64 >> simulator.log 2>&1 </dev/null &
+
+# Server startup
+nohup ./out/server -addr=:18080 >> server.log 2>&1 </dev/null &
+
+# Tail the logs:
+# tail -50f simulator.log
+
diff --git a/homeguard-iot/stop.sh b/homeguard-iot/stop.sh
new file mode 100755
index 0000000..562bd9e
--- /dev/null
+++ b/homeguard-iot/stop.sh
@@ -0,0 +1,4 @@
+#!/bin/bash
+
+kill $( ps | egrep '(server|simulator)' | awk '{print $1}' )
+
diff --git a/homeguard-iot/view_of_app_on_192_cores.jpg b/homeguard-iot/view_of_app_on_192_cores.jpg
new file mode 100644
index 0000000..07e141a
Binary files /dev/null and b/homeguard-iot/view_of_app_on_192_cores.jpg differ
diff --git a/homeguard-iot/view_of_app_on_MacBook_Air.jpg b/homeguard-iot/view_of_app_on_MacBook_Air.jpg
new file mode 100644
index 0000000..8509ac8
Binary files /dev/null and b/homeguard-iot/view_of_app_on_MacBook_Air.jpg differ