REST API

Mounted under /api/v1 on the configured API bind. JSON in, JSON out. No authentication in v1 — bind to loopback or front it with a reverse proxy you trust.

OpenAPI 3.1 document at GET /api/openapi.json; Swagger UI at GET /docs.

All responses use Content-Type: application/json; charset=utf-8.

Response headers

POST /api/v1/targets (201) sets Location: /api/v1/targets/{id} so clients can follow up without re-deriving the path.
Cache-Control is stamped on every /api/v1/* response:
- mutations (POST / PATCH / DELETE) → no-store
- /api/v1/dashboard/summary → private, max-age=5 (matches the server-side cache)
- all other reads → private, max-age=10

Endpoints

Method	Path	Purpose
`POST`	`/api/v1/targets`	create one target
`POST`	`/api/v1/targets/bulk`	bulk-create up to 10,000 targets
`POST`	`/api/v1/targets/bulk-action`	enable / disable / delete / tag-add / tag-remove on many ids
`POST`	`/api/v1/targets/test`	run a one-shot check against a `CheckSpec` without persisting
`POST`	`/api/v1/targets/{id}/check-now`	run an immediate check using the target’s stored credentials
`GET`	`/api/v1/targets`	list targets (`limit`, `offset`, `tag`, `enabled`, `q`) — paginated
`GET`	`/api/v1/targets/{id}`	get one target
`PATCH`	`/api/v1/targets/{id}`	update name, check spec, interval, enabled, tags
`DELETE`	`/api/v1/targets/{id}`	delete a target
`GET`	`/api/v1/targets/{id}/results`	recent check results (`from`, `to`, `limit`, `offset`, `region`) — paginated
`GET`	`/api/v1/targets/{id}/latency`	bucketed latency series (`from`, `to`, `region`) — server-side quantiles + per-phase means
`GET`	`/api/v1/targets/{id}/latency/by-region`	per-region latency series (`from`, `to`) — one series per region, for overlay charts
`GET`	`/api/v1/targets/{id}/uptime`	uptime summary over a range (`from`, `to`, `region`)
`GET`	`/api/v1/targets/{id}/regions`	list the regions a monitor probes from
`PUT`	`/api/v1/targets/{id}/regions`	set the regions a monitor probes from
`GET`	`/api/v1/regions`	list the enabled probe-region catalog (`id`, `name`, `location`)
`GET`	`/api/v1/targets/{id}/incidents`	coalesced incident periods (`from`, `to`, `ongoing_only`) — paginated
`POST`	`/api/v1/targets/{id}/shares`	mint a read-only share link; returns the share (token included)
`GET`	`/api/v1/targets/{id}/shares`	list a monitor’s live share links (token included, re-copyable)
`DELETE`	`/api/v1/targets/{id}/shares/{share_id}`	revoke a share link
`GET`	`/api/v1/tags`	tag inventory with target counts (`q` prefix) — paginated
`GET`	`/api/v1/dashboard/summary`	per-org rollup (5-second in-process cache, keyed by `OrgId`)
`GET`	`/healthz`	liveness — always 200 once the process is up
`GET`	`/readyz`	readiness — pings the target store; 503 if unreachable
`GET`	`/api/openapi.json`	OpenAPI 3.1 document
`GET`	`/docs`	Swagger UI

Instance-admin and agent surfaces

Two surfaces sit outside /api/v1 with their own auth, used only for multi-region deployments:

/operator/* — instance-admin regions + agents CRUD, gated by a static bearer secret (UPTIMEPAGE_OPERATOR__ADMIN_TOKEN); 404s when unset.
/api/agent/* — the pull/ingest endpoints an agent uses, authenticated by its sm_agent_… token (not a tenant api_token).

Both are documented in Multi-region probes.

Operator endpoints (maintenance + incident narration)

These mutate the public surface; they live under the same auth boundary as /api/v1/targets. Operator workflow + validation rules in Public status page.

Method	Path	Purpose
`POST`	`/api/v1/maintenance`	schedule a maintenance window
`GET`	`/api/v1/maintenance`	list windows (`status=active\|upcoming\|past\|all`, paginated)
`GET`	`/api/v1/maintenance/{id}`	get one window
`PATCH`	`/api/v1/maintenance/{id}`	edit title / description / time range / components (rejected after `ends_at`)
`DELETE`	`/api/v1/maintenance/{id}`	cancel a window
`PATCH`	`/api/v1/incidents/{id}`	update narration: `public_title`, `public_description`, `severity` (JSON `null` clears, omit to leave alone)
`POST`	`/api/v1/incidents/{id}/updates`	append a status update — `phase` ∈ `investigating`/`identified`/`monitoring`/`resolved`/`postmortem`, `message` ≤ 2 000 chars

Operator endpoints (status pages)

An org owns one or more public status pages, each with its own slug, branding, and curated set of monitors. Reads are open to any active member; every mutation is owner-only. Scoped to the caller’s active org (a foreign page id is 404). Adding a monitor already on the page returns 409 COMPONENT_ALREADY_ON_PAGE — edit it with PATCH. Model + caps in Per-org status pages.

Method	Path	Purpose
`GET`	`/api/v1/status-pages`	list this org’s pages
`POST`	`/api/v1/status-pages`	create a page (capped at `max_status_pages`; slug globally unique)
`GET`	`/api/v1/status-pages/{id}`	one page + its live URL and logo URL
`PATCH`	`/api/v1/status-pages/{id}`	rename, change slug, publish/unpublish, edit branding
`DELETE`	`/api/v1/status-pages/{id}`	delete the page
`GET`	`/api/v1/status-pages/{id}/components`	the monitors curated onto the page
`POST`	`/api/v1/status-pages/{id}/components`	add a monitor (distinct-target cap `max_public_components`)
`PATCH`	`/api/v1/status-pages/{id}/components/{target_id}`	per-page `public_name` / `public_description` / `public_group` (JSON `null` clears)
`DELETE`	`/api/v1/status-pages/{id}/components/{target_id}`	remove a monitor from the page
`POST`	`/api/v1/status-pages/{id}/components/reorder`	set component order
`POST`	`/api/v1/status-pages/{id}/logo`	upload a logo (multipart)
`DELETE`	`/api/v1/status-pages/{id}/logo`	remove the logo

Public status endpoints

Unauthenticated; mounted at /api/public/v1/* and bypassed at Caddy via the @public matcher (see Deployment). Each response carries Cache-Control: public, max-age=10, stale-while-revalidate=30. A monitor not curated onto the page being served is invisible on every public surface — direct lookups return 404 and it never appears in any list. Wire types literally cannot serialise sensitive target fields (url, headers, basic_auth, bearer_token).

Method	Path	Purpose
`GET`	`/status`	server-rendered HTML status page (`?fragment=1` returns the dynamic region only)
`GET`	`/status/incidents/{id}`	per-incident detail page
`GET`	`/api/public/v1/status`	the same data as `/status` in JSON
`GET`	`/api/public/v1/components/{id}/history`	per-component 90-day history (`days` query, default 90, max 90)
`GET`	`/api/public/v1/incidents`	recent public incidents (paginated)
`GET`	`/api/public/v1/incidents/{id}`	one public incident with its update timeline
`GET`	`/api/public/v1/incidents.rss`	RSS 2.0 feed of recent incidents
`GET`	`/api/public/v1/maintenance`	active + upcoming maintenance windows
`GET`	`/api/public/v1/badge.svg`	embeddable SVG status badge (overall, or `?component={id}`)

See Public status page for the operator workflow and the per-page component fields (public_name, public_description, public_group, sort_order) that drive what’s published.

A share link is a capability URL that renders one monitor’s full read-only detail view to anyone who has it, no account. Managing share links — mint, list, revoke — is a monitor action gated on member-level targets:write (not owner-only); listing returns the live token so a read-only caller can’t harvest working public links. Scoped to the caller’s active org (a foreign monitor id is 404). expires_at is optional; omit it for a link that never expires. The public surface those tokens unlock is documented in Share links.

Method	Path	Purpose
`POST`	`/api/v1/targets/{id}/shares`	mint a share; body `{ "label"?, "expires_at"? }`, returns the `MonitorShare`
`GET`	`/api/v1/targets/{id}/shares`	list live (non-revoked) shares
`DELETE`	`/api/v1/targets/{id}/shares/{share_id}`	revoke immediately — the link 404s on its next request

Both POST and GET return the token; build the link as /m/{token} (prepend your origin). The token stays re-copyable — it is stored encrypted at rest (the app KEK, same as basic_auth/bearer_token); the public resolve path matches on a separate hash, so a hot link never triggers a decrypt. token is null only when a row was sealed under a KEK that is no longer configured. Two plan caps apply (columns on plans, overridable per-org via plan_overrides): max_share_links_per_monitor (active links on one monitor) and max_shared_monitors (distinct monitors in the org that have any link). The free plan is 1 and 2. Exceeding either is 422 QUOTA_EXCEEDED (the body names the quota). A label longer than 80 characters is 400 SHARE_LABEL_INVALID; an expires_at in the past is 400 INVALID_EXPIRY.

Check specs

Tagged enum, type discriminator.

HTTP

{
  "type": "http",
  "url": "https://example.com/healthz",
  "method": "GET",
  "timeout": 5000,                              // ms, total request budget
  "follow_redirects": false,
  "max_redirects": 0,
  "expected_status": { "kind": "exact", "value": 200 },
  "expected_body_contains": null,               // optional substring match
  "headers": {},
  "body": null,
  "verify_tls": true,
  "basic_auth": null,                           // ["user", "pass"] or null
  "bearer_token": null
}

Credential redaction

GET, POST, PATCH, and bulk responses replace populated basic_auth / bearer_token fields with the sentinel "***". A null field stays null, so clients can distinguish “auth is configured” from “no auth”. When you PATCH a target’s check, you must re-supply the real credential — a body that contains "***" is rejected with 400 Bad Request. If you only need to change other fields (name, tags, enabled, interval), omit check from the PATCH body. Encryption at rest is gated on security.credentials_kek_base64; the redaction behavior applies in either mode.

expected_status variants:

{ "kind": "exact", "value": 200 }
{ "kind": "range", "value": { "min": 200, "max": 299 } }
{ "kind": "one_of", "value": [200, 204] }

Rate-limited responses

A response with 429 Too Many Requests or 503 Service Unavailable is recorded as degraded, not down — the upstream is telling us “I’m here, back off.” The error field carries rate-limited <code> (Retry-After: <value>) when the header is present so operators can size the polling interval against what the upstream actually wants. A check that explicitly accepts 429 / 503 via expected_status is honored first and stays up.

Some third-party APIs rate-limit by source IP regardless. GitHub’s unauthenticated REST API is the canonical case: 60 req/h per IP, 5 000 req/h with a token in the Authorization header. Poll those endpoints at ≥ 300 s, or attach the token via a header in this spec.

Per-host throttle

The worker side caps the number of concurrent checks one tenant can fan at the same (host, port) so a burst of monitors against one upstream doesn’t look like a probe. When the cap is reached, the over-cap check is recorded as degraded with error="throttled: host concurrency cap" and no alert fires — the upstream is fine, the back-pressure is operator-side. The cap is per-tenant: one customer’s burst never starves another customer’s monitor of the same host. Default cap is two in-flight per (org, host, port); tune via checker.per_host_max_inflight. RDAP queries (domain expiry) carry their own per-TLD cap via checker.rdap_max_inflight.

TCP

{ "type": "tcp", "host": "db.internal", "port": 5432, "timeout": 2000 }

TLS certificate expiry

{
  "type": "tls_cert",
  "host": "example.com",
  "port": 443,
  "server_name": null,         // optional SNI override; defaults to `host`
  "warn_days": 14,
  "critical_days": 7,
  "timeout": 5000
}

Opens a TCP connection, performs a TLS handshake against the host (accepting any presented chain so that expired or self-signed certs can still be inspected), and parses the leaf certificate’s notAfter. Status mapping:

days_remaining < 0 (expired) → down
days_remaining < critical_days → down
days_remaining < warn_days → degraded
otherwise → up

error carries a JSON document with days_remaining, not_after, subject_common_name, issuer_common_name. A handshake failure (plain-TCP host, network error) returns error status with the underlying message. warn_days must be strictly greater than critical_days. Floor is interval >= 3600 (enforced); default for a new monitor is 86400 (daily).

Domain expiration

{
  "type": "domain_expiry",
  "domain": "example.com",
  "warn_days": 30,
  "critical_days": 7,
  "timeout": 10000
}

Queries the IANA RDAP bootstrap registry to find the authoritative RDAP server for the domain’s TLD, then fetches /domain/<domain> and reads the events[?eventAction == "expiration"] entry. Status mapping is the same as TLS cert: < critical_days → down, < warn_days → degraded, else up. Non-up results carry a JSON error body with domain, days_remaining, expiration_date, and (when present) registrar.

The bootstrap registry is fetched lazily on the first lookup and cached for the lifetime of the process. The SSRF guard does not apply — the check’s network destination is an IANA-published RDAP server, not the user-supplied domain. Floor is interval >= 3600 (enforced); default for a new monitor is 86400 (daily). RDAP servers rate-limit clients — keep this near daily, not hourly. warn_days must be strictly greater than critical_days.

Target payload

{
  "name": "internal-api",
  "check": { /* check spec */ },
  "interval": 60,             // seconds between ticks; effective floor is
                              // max(plan.min_check_interval_secs, kind_min).
                              // kind_min is 10 for http/tcp/dns and 3600 for
                              // tls_cert/domain_expiry. Plan-free min = 60.
                              // 10 is the absolute DB CHECK hard floor.
  "enabled": true,
  "tags": ["prod", "tier1"],
  "alerts": { /* optional, see below */ }
}

Server returns the full Target including id (UUIDv7), created_at, updated_at, and write_source.

write_source is a read-only field recording where the resource was last written from: ui, api, or terraform (decided server-side from the request, never the body — sending it is ignored). It also appears on notification channels and maintenance windows, and drives the “managed by” badge in the web UI. A write through any endpoint restamps it, so it reflects the most recent author.

Alert config

alerts is an optional array of channel bindings. Each binding is just a reference to a notification channel (see Notification channels); the firing policy lives on the monitor itself. An empty/omitted array disables channel alerting for that target (incidents still open and show on status pages).

"alerts": [
  { "channel_id": "0192a1ce-0000-7000-8000-000000000001" },
  { "channel_id": "0192a1ce-0000-7000-8000-000000000002" }
],
"alert_confirmations": 3,
"notify_recovery": true,
"renotify_interval_secs": 3600,
"region_policy": "majority"

channel_id — id of a notification channel owned by the same org. A binding to an unknown or another tenant’s channel is rejected.
alert_confirmations — consecutive failing checks before an incident opens (and the same number of passing checks before it closes, which damps flapping). Default 2, must be >= 1.
notify_recovery — when true (default), the recovery is announced to the monitor’s channels. When false, recovery is silent.
renotify_interval_secs — seconds between reminder notifications while an outage stays unacknowledged. 0 disables reminders; otherwise must be >= 60. Default 3600. Acknowledging or resolving the incident stops the reminders.
region_policy — how many probe regions must agree the target is down before an incident opens: "any", "majority" (default), "all", or { "count": N }.

Notifications are driven by the incident engine: one notification per incident open (then reminders per renotify_interval_secs), one on recovery. Failed deliveries retry on exponential backoff and dead-letter after the attempt cap; per-incident delivery state is visible at GET /api/v1/incidents/{id}/notifications.

Alert validation errors

POST and PATCH return 400 Bad Request (INVALID_ALERT_CONFIG) for:

a duplicate channel_id in the array
notification channel <id> does not exist — unknown id, or one owned by another org
alert_confirmations must be >= 1
renotify_interval_secs must be 0 (off) or at least 60

A region_policy of { "count": N } where N is 0 or exceeds the available regions is 422 INVALID_REGION_POLICY.

Validation errors

POST and PUT return 400 Bad Request for:

Unsupported URL scheme (url scheme '...' not allowed — only http and https)
Missing URL host, empty TCP host, or TCP/TLS port 0
tls_cert warn_days must be > critical_days
domain_expiry domain must contain a TLD label (no dot in domain)
domain_expiry warn_days must be > critical_days
SSRF guard — target address ... is in a blocked range. Triggered when the URL or TCP host is an IP literal that resolves to loopback / private / link-local / reserved space (see Configuration → security.allow_private_targets). Hostname literals are checked again at connect time after DNS resolution, so DNS rebinding cannot bypass the guard.
Redaction sentinel — basic_auth contains redaction sentinel — re-supply the real credential or the equivalent for bearer_token. Rejected to prevent a GET → PATCH round-trip from silently overwriting the stored credential with "***".
TLS verification + credentials — verify_tls = false cannot be combined with basic_auth or bearer_token over https. When verification is disabled any host presenting a forged certificate can collect the stored credential on every check interval. Set verify_tls = true (recommended) or remove the credential from the target.

Notification channels

Per-org delivery destinations that targets bind to via their alerts array. Org scoping is implicit in the caller’s authenticated context — one tenant can never read, mutate, or test another’s channels.

Method	Path	Purpose
`POST`	`/api/v1/notification-channels`	Create a channel (201 + `Location`)
`GET`	`/api/v1/notification-channels`	List the org’s channels
`GET`	`/api/v1/notification-channels/{id}`	Get one
`PATCH`	`/api/v1/notification-channels/{id}`	Partial update
`DELETE`	`/api/v1/notification-channels/{id}`	Delete (204); also removes the channel’s alert bindings from every monitor
`POST`	`/api/v1/notification-channels/test`	Test an unsaved transport config
`POST`	`/api/v1/notification-channels/{id}/test`	Send a synthetic test alert through a saved channel
`POST`	`/api/v1/notification-channels/{id}/resend-verification`	Resend the verification mail for an unverified email channel

{
  "name": "Ops Slack",
  "enabled": true,
  "config": { "type": "slack", "webhook_url": "https://hooks.slack.com/services/T/B/XXXX" }
}

config is type-tagged. Supported transports:

slack — { "type": "slack", "webhook_url": "https://…" } (incoming webhook; posts { "text": "…" })
discord — { "type": "discord", "webhook_url": "https://discord.com/api/webhooks/…" } (channel webhook; posts { "content": "…" } with ?wait=true so delivery failures surface synchronously; text capped at 2000 chars)
msteams — { "type": "msteams", "webhook_url": "https://….logic.azure.com/…" } (Teams Workflows webhook; posts an Adaptive Card. Retired O365 connector URLs are not accepted)
google_chat — { "type": "google_chat", "webhook_url": "https://chat.googleapis.com/v1/spaces/…" } (space webhook; posts { "text": "…" }, capped at 4096 chars)
webhook — { "type": "webhook", "url": "https://…", "headers": { … }, "secret": "…" } (POSTs the alert JSON; optional custom headers; optional signing secret, see below). The escape hatch: no host restrictions, for services the named kinds don’t cover
telegram — { "type": "telegram", "bot_token": "…", "chat_id": "…" } (bring-your-own bot)
telegram_app — { "type": "telegram_app", "chat_id": "…", "chat_title": "…" } — linked through the platform’s central bot. Not creatable from request bodies: a POST/PATCH/test carrying this kind returns 422 CHANNEL_KIND_MANAGED (the chat id rides the operator bot’s credentials, so accepting one would let any caller page an arbitrary chat). Channels of this kind are created only by the link-code flow below.
whatsapp — { "type": "whatsapp", "access_token": "…", "phone_number_id": "…", "to": "…", "template_name": "…", "language_code": "en" } (Business Cloud API; language_code optional, default en)
whatsapp_app — { "type": "whatsapp_app", "phone": "…", "profile_name": "…" } — linked through the platform’s WhatsApp number. Not creatable from request bodies (422 CHANNEL_KIND_MANAGED, same rationale as telegram_app); created only by the WhatsApp link-code flow below.
pagerduty — { "type": "pagerduty", "routing_key": "…" } (the 32-character Events API v2 integration key of a PagerDuty service). The only transport that drives the destination’s own incident lifecycle: opens/reopens/escalations send trigger and resolution sends resolve, all correlated by dedup_key = the incident id, so one uptimepage incident maps to exactly one PagerDuty alert that opens and closes with it. Severity maps Critical→critical, Major→error, Minor→warning. A test send fires a trigger+resolve pair on a throwaway dedup key and never leaves an open PagerDuty incident
ntfy — { "type": "ntfy", "server_url": "https://ntfy.sh", "topic": "…", "access_token": "tk_…" } (JSON publish to the server root; server_url optional, defaults to ntfy.sh, must be the bare server root; access_token optional, sent as a Bearer token). High-urgency opens publish at priority 4, the rest at 3; resolves tag white_check_mark, opens rotating_light. On ntfy.sh an unprotected topic’s name is its only access control
pushover — { "type": "pushover", "token": "…", "user": "…", "device": "…" } (30-character application token and user/group key, both treated as secrets; device optional). High-urgency alerts go out at priority 1 (bypasses quiet hours), low at 0, resolves at −1 (no sound). Emergency priority 2 is not used
sms — { "type": "sms", "provider": "twilio", "to": "+15551234567", "from": "…", … } — bring-your-own SMS gateway; one text message per alert, body trimmed to a few segments to bound per-segment cost. to is E.164; from is an E.164 number or sender id. The provider-specific credentials are: twilio → account_sid + auth_token; telnyx → api_key (+ optional messaging_profile_id); vonage → api_key + api_secret; plivo → auth_id + auth_token; sinch → service_plan_id + api_token + region (us/eu/au/br/ca, default us). Only the gateway secret is treated as a secret (Twilio/Plivo auth_token, Telnyx api_key, Vonage api_secret, Sinch api_token); account identifiers stay visible
email — { "type": "email", "to": "oncall@example.com" } — one lowercase address per channel, delivered through the platform’s transactional sender. Verification-gated: the channel is created unverified and a mail with a single-use 24 h link is sent to the address; until the link is confirmed every delivery (incident page or test send) fails with email address not verified. Replacing the config resets the gate and re-sends the mail. POST /api/v1/notification-channels/{id}/resend-verification re-sends it (capped per channel and per org per day — 422 CHANNEL_VERIFICATION_LIMIT; on a non-email channel — 422 CHANNEL_NOT_VERIFIABLE); a test against an unverified or unsaved email config is 422 CHANNEL_UNVERIFIED.

Webhook signing. When a webhook channel carries a secret (≥ 16 characters), every delivery is signed: the request includes X-Uptimepage-Timestamp (unix seconds) and X-Uptimepage-Signature: sha256=<hex>, where the hex is HMAC-SHA256(secret, "{timestamp}.{body}") over the exact bytes sent. Receivers should recompute the digest and reject stale timestamps (e.g. older than 5 minutes) to block replays. Channels without a secret deliver unsigned.

WhatsApp templates. Create a one-parameter utility template (body {{1}}) in the WhatsApp Business Manager and set template_name (plus language_code, which must match the template’s exact language — en and en_US are distinct). The alert text is sent as that single parameter, collapsed to one line. A template is required: WhatsApp accepts free-form text only within 24 hours of the recipient’s last message, and out-of-window sends are accepted by the API yet dropped asynchronously — a silent-loss mode an alerting channel must not have.

Behaviour:

Secrets sealed at rest with the credentials KEK; never echoed back. Every read path masks secret-bearing fields with *** (the webhook URL is masked whole — it can carry a token; header names and chat_id are kept so the UI stays useful).
Redaction-sentinel guard: submitting a config that still contains *** returns 400 REDACTION_SENTINEL. Omit config on PATCH to keep the stored secret unchanged.
Validation (400): every webhook URL must be https; the provider-branded kinds are additionally host-pinned (discord → discord.com/discordapp.com with an /api/webhooks/ path, msteams → *.logic.azure.com/*.powerplatform.com, google_chat → chat.googleapis.com) and a URL elsewhere is rejected with a hint to use the generic webhook kind; telegram requires non-empty bot_token and chat_id; whatsapp requires access_token, a numeric phone_number_id, an international-format to, and a template_name (lowercase/digits/underscore); email requires a lowercase single-address to; pagerduty requires a 32-char alphanumeric routing_key; ntfy requires an https root-only server_url and a 1–64 char topic (letters/digits/_/-); pushover requires 30-char alphanumeric token and user; sms requires an E.164 to, a from, and the selected provider’s credentials (Twilio account_sid is AC + 32 hex; Plivo auth_id and Sinch service_plan_id are alphanumeric; Sinch region is one of us/eu/au/br/ca); channel name is required and ≤ 100 chars.
Destination deny-list: the customer-controlled outbound URL (slack/discord/msteams/google_chat/webhook/ntfy’s server_url) is checked against the platform’s abuse deny-list on create, update, and both test endpoints — a match is rejected (ABUSE_BLOCKED / DOMAIN_DENYLISTED). telegram/whatsapp/email/pagerduty/pushover/sms deliver to fixed vendor endpoints.
Quota: capped per org by the plan’s max_notification_channels (atomic, advisory-locked). A duplicate name within the org is 422 CHANNEL_NAME_TAKEN; the cap is 422 CHANNEL_QUOTA_EXCEEDED.
Test sends deliver one clearly-labelled synthetic alert. The per-channel form tests the stored config (works on a disabled channel too); the collection-level POST …/test takes { "config": { … } } in the body, validates it exactly as create would, and persists nothing — the UI uses it for “test now” before a channel is saved. A transport failure is 422 CHANNEL_TEST_FAILED. Both count against the test_now rate-limit bucket.
Platform disables: when a linked Telegram chat unlinks from its side (the bot is removed, or the chat sends /stop), every channel linked to that chat is disabled with a disabled_reason the UI shows. Re-enabling the channel clears the note.

Telegram one-tap linking

Deployments running the central bot expose a link-code flow (absent — 404 TELEGRAM_LINK_NOT_FOUND — otherwise):

POST /api/v1/notification-channels/telegram-link (channels:write) with an optional { "name": "…" } hint mints a single-use code (15-minute expiry, capped outstanding codes per org → 422 TELEGRAM_LINK_LIMIT). The response carries the raw code (shown once, only its hash is stored), a deep_link (t.me/<bot>?start=<code>, private chat) and a group_deep_link (?startgroup=<code>, picks a group). The same code works for either destination.
Sending the code to the bot (tap Start, or /link <code> in a group) creates the telegram_app channel for the minting org. The org is resolved only from the code — never from the Telegram payload.
GET /api/v1/notification-channels/telegram-link/{id} (channels:read) polls the code: pending, consumed (with channel_id), or expired.
Unlink = delete the channel; deleting the last channel linked to a group also walks the bot out of that group. From the chat side, /stop or removing the bot disables the channel (see platform disables above).

WhatsApp one-tap linking

Deployments with the operator WhatsApp number enabled expose the same flow (absent — 404 WHATSAPP_LINK_NOT_FOUND — otherwise):

POST /api/v1/notification-channels/whatsapp-link (channels:write) with an optional { "name": "…" } hint mints a single-use code (15-minute expiry, capped per org → 422 WHATSAPP_LINK_LIMIT). The response carries the raw code and a deep_link (wa.me/<number>?text=<code>) that opens WhatsApp with the code prefilled.
Sending the prefilled message creates the whatsapp_app channel for the minting org, bound to the sender’s number. The org is resolved only from the code — never from the webhook payload.
GET /api/v1/notification-channels/whatsapp-link/{id} (channels:read) polls the code: pending, consumed (with channel_id), or expired.
Unlink = delete the channel; from the phone side, sending stop disables every channel bound to the number (platform disable, reason shown in the UI).

Delegation links

The person who owns the Slack workspace / Telegram group / inbox often isn’t the person configuring monitors — a delegation link hands off just the connect step.

POST /api/v1/notification-channels/delegate (channels:write) with optional { "name": "…", "kind": "…" } hints mints a single-use /c/<code> URL (7-day expiry, capped outstanding links per org → 422 DELEGATE_LINK_LIMIT; unknown kind → 400 DELEGATE_KIND_INVALID). Only the code’s hash is stored.
GET /c/<code> is public and chrome-less: it offers exactly the connect-capable transports of the deployment — the telegram one-tap link + QR (the delegation code doubles as the t.me start payload), “add to Slack” / “add to Discord” when the operator OAuth apps are configured, and a manual webhook/address form. The link can create one channel in the inviting org and read nothing; expired, revoked, and spent codes all render the same 404 page. Every delegated create lands in the org audit log.
GET /api/v1/notification-channels/delegate (channels:read) lists the org’s links (pending / consumed / expired); DELETE /api/v1/notification-channels/delegate/{id} (channels:write) revokes an unconsumed one (revoked links read as expired).

Rate limiting

/api/v1/* is rate-limited per authenticated subject — by (org, category) and by (user, category), whichever trips first — with the per-minute budgets taken from the org’s plan. Categories: api_writes (POST/PATCH/DELETE), api_reads (GET/HEAD/OPTIONS), bulk_ops (/bulk*), test_now (/test), check_now (/check-now). Exceeding a budget returns 429 Too Many Requests with a Retry-After header (seconds until the next token) and code: RATE_LIMITED. /healthz and /readyz are never throttled. Unauthenticated and per-IP limiting is the reverse proxy’s job (see Deployment). Full model: Quotas & rate limits.

CORS

Disabled by default. When api.cors.enabled = true, /api/v1/* answers preflight OPTIONS with Access-Control-Allow-Origin (matching allowed_origins or * when allow_any_origin = true), Access-Control-Allow-Methods (the configured list), and Access-Control-Allow-Headers: content-type. /healthz and /readyz carry no CORS headers regardless.

Error envelope

Every 4xx and 5xx response uses one wire shape:

{
  "error": {
    "code": "INVALID_URL_SCHEME",
    "message": "url scheme 'ftp' not allowed",
    "field": "check.url",
    "details": null,
    "trace_id": null
  }
}

code is stable, machine-readable, UPPER_SNAKE_CASE. Never repurposed once published.
field is a JSON pointer to the offending input for 400s; null for non-field errors.
details carries optional structured context (e.g., { "range": "127.0.0.0/8" } for SSRF rejections).
trace_id is the W3C traceparent when tracing is enabled.

Common codes: INVALID_URL_SCHEME, INVALID_URL_FORMAT, SSRF_BLOCKED, INVALID_INTERVAL, INVALID_TIMEOUT, INVALID_TCP_PORT, INVALID_TCP_HOST, INVALID_STATUS_RANGE, INVALID_TLS_CERT_PARAMS, INVALID_DOMAIN_PARAMS, INVALID_TLS_CRED_COMBO, INVALID_ALERT_CONFIG, REDACTION_SENTINEL, BULK_EMPTY, BULK_TOO_LARGE, BAD_TIME_RANGE, TARGET_NOT_FOUND, CHANNEL_NOT_FOUND, CHANNEL_NAME_TAKEN, CHANNEL_NAME_INVALID, CHANNEL_QUOTA_EXCEEDED, INVALID_CHANNEL_CONFIG, CHANNEL_TEST_FAILED, CIRCUIT_OPEN, DEPENDENCY_DOWN, INTERNAL.

Quota, rate-limit and abuse codes

Code	HTTP	Meaning
`QUOTA_EXCEEDED`	422	A plan quota would be exceeded. `details` carries `quota` (e.g. `max_targets`, `max_members`, `max_public_components`), `current`, `limit`, `plan`.
`MIN_CHECK_INTERVAL`	422	Requested check interval is below the effective floor (`max(plan.min_check_interval_secs, kind_min)`), where `kind_min` is 3600 for `tls_cert` / `domain_expiry` and 10 for `http` / `tcp` / `dns`. Enforced on create, bulk, and PATCH.
`INVITATIONS_LIMIT`	409	The org is at its pending-invitation cap.
`RATE_LIMITED`	429	A per-minute rate budget was exceeded. `Retry-After` (seconds) is set; `details.scope` names the tier, e.g. `per_org_api_writes`.
`ABUSE_BLOCKED`	400	Target blocked by abuse protection. `details.reason` explains.
`URL_PATTERN_BLOCKED`	400	Target URL matched an abuse pattern (recon path).
`DOMAIN_DENYLISTED`	400	Target domain (or a parent) is on the deny-list.

See Quotas & rate limits for the quota model, the per-minute categories, and the deny-list policy.

Pagination envelope

Every list endpoint returns:

{ "items": [ /* ... */ ], "total": 1240, "limit": 50, "offset": 0 }

limit defaults to 50 for /targets and /tags, 1000 for /results, 100 for /incidents. limit is silently capped server-side (10,000 for results, 1,000 for incidents/tags). total reflects rows matching the filters, ignoring limit/offset.

Results query

GET /api/v1/targets/{id}/results?from=2026-05-12T00:00:00Z&to=2026-05-12T23:59:59Z&limit=100&offset=0

from / to default to the last 24 h; to must be strictly greater than from (400 BAD_TIME_RANGE otherwise).
Returns a PageEnvelope of CheckResult ordered by timestamp DESC.

Latency series

GET /api/v1/targets/{id}/latency?from=…&to=…

Pre-bucketed quantiles and per-phase means read straight from the per-minute rollup — powers the monitor-detail latency line and phase-breakdown area charts. The server divides the range into ~60 slices (floored to the 60-second rollup grain), so any range returns a comparably dense series and the cost stays O(buckets), not O(samples). Switching range re-scales the buckets.

from / to default to the last 24 h; to must be strictly greater than from (400 BAD_TIME_RANGE).

{
  "bucket_seconds": 1440,
  "buckets": [
    {
      "t": 1747137600000,      // unix-ms at bucket start (JS new Date(t))
      "p50": 120, "p95": 180, "p99": 240,
      "avg": 130,              // mean total; breakdown chart derives "processing" = avg − (dns+connect+tls+ttfb)
      "dns": 12, "connect": 20, "tls": 35, "ttfb": 60,  // mean per-phase ms; 0 for kinds that skip the phase
      "samples": 24            // 0 marks a gap the chart leaves unconnected
    }
  ]
}

bucket_seconds is always a multiple of 60 (1h→60, 24h→1440, 7d→10080, 30d→43200).

Region filter

results, latency, and uptime accept an optional region= query parameter to scope the read to one probe region; omit it for an all-regions view. Region ids are the slugs registered via the operator surface. See Multi-region probes.

Per-region latency series

GET /api/v1/targets/{id}/latency/by-region?from=…&to=…

Same bucketing and cost as /latency, but split by region so each can be overlaid as its own line — powers the monitor-detail overlay chart. One entry per region that has samples in the range; each region’s buckets use the same shape as /latency.

{
  "bucket_seconds": 1440,
  "regions": [
    { "region": "default",  "buckets": [ /* LatencyBucket… */ ] },
    { "region": "eu-west",  "buckets": [ /* LatencyBucket… */ ] }
  ]
}

Uptime query

GET /api/v1/targets/{id}/uptime?from=…&to=…

{ "total": 8640, "up": 8635, "down": 0, "degraded": 0, "error": 5, "uptime_pct": 99.94 }

Incidents query

GET /api/v1/targets/{id}/incidents?from=…&to=…&ongoing_only=false&limit=100&offset=0

Returns coalesced down / error periods. A contiguous run of bad statuses becomes one incident; an up result between two bad runs splits them. Ongoing incidents return ended_at: null and duration_secs: null.

{
  "items": [
    {
      "id": "01h7m8z4n6v0e1m7v7y6x8x8x8",
      "target_id": "01h7m...",
      "started_at": "2026-05-13T11:30:00.000Z",
      "ended_at":   "2026-05-13T11:35:00.000Z",
      "status":     "down",
      "duration_secs": 300,
      "check_count": 5,
      "error_sample": "connection refused"
    }
  ],
  "total": 1, "limit": 100, "offset": 0
}

Tags inventory

GET /api/v1/tags?q=prod&limit=100

Returns every tag currently in use across the caller’s targets (enabled or disabled), with target count, sorted by descending count then alphabetical. q is a prefix filter for autocomplete. Scoped to the active org — in SaaS mode another org’s tags are invisible.

{ "items": [ { "name": "prod", "count": 12 }, { "name": "staging", "count": 4 } ],
  "total": 2, "limit": 100, "offset": 0 }

Dashboard summary

GET /api/v1/dashboard/summary — per-org rollup cached in-process for 5 seconds (keyed by OrgId, so two tenants never share an entry).

{
  "targets":        { "total": 42, "enabled": 40, "disabled": 2 },
  "current_status": { "up": 38, "down": 1, "degraded": 1, "error": 0, "unknown": 2 },
  "last_24h":       { "checks_total": 50400, "checks_up": 50360, "uptime_pct": 99.92, "incidents": 3 },
  "system":         { "in_flight_checks": 5, "result_queue_depth": 12, "dropped_results_last_5m": 0, "circuit_breakers_open": 0 }
}

On-demand operations

POST /api/v1/targets/test — runs one check against a raw CheckSpec, no persistence. Same SSRF / URL-scheme / port validation as POST /targets. Returns TestResponse { result, matched_expectations, warnings }.
POST /api/v1/targets/{id}/check-now — runs one check against an existing target using its stored credentials, dispatched to an agent in the target’s region. Result is persisted. Returns 503 PROBE_UNAVAILABLE if no agent is currently serving the region.
POST /api/v1/targets/bulk-action — apply one action atomically to up to 10,000 ids. Partial failure allowed; the response lists succeeded and failed separately, with per-id code + message.

{
  "ids": ["01h7m...", "01h7n..."],
  "action": { "type": "disable" }
  // alternatives: { "type": "enable" }, { "type": "delete" },
  //   { "type": "tag_add",    "tags": ["frozen"] },
  //   { "type": "tag_remove", "tags": ["frozen"] }
}

Idempotency

POST /api/v1/targets/bulk and POST /api/v1/targets/bulk-action accept an optional Idempotency-Key header. The server stores the response for 24 hours keyed by (header value, body hash). A retry with the same key and body returns the original response without re-executing. A retry with the same key but a different body executes normally — the body hash is part of the cache key. The cache is in-process; entries are lost on restart.

POST /api/v1/targets/bulk-action HTTP/1.1
Idempotency-Key: 01h7m8z4n6v0e1m7v7y6x8x8x8
Content-Type: application/json

{ "ids": ["..."], "action": { "type": "disable" } }

Keyboard shortcuts

uptimepage