Structured response from first alert to resolution
Vantaj opens, tracks, and resolves incidents automatically as your monitors change state. Every outage gets a timeline, a duration, and a record — so your team responds faster and learns from every event.
Automatic incident creation
The moment a monitor fails, an incident is opened with a timestamp, affected service, and severity level. No manual triage required.
Auto-resolution
When a monitor recovers, the incident closes automatically. Duration is calculated precisely — no forgetting to mark things resolved.
Escalating alert policies
Define multi-step escalation policies: notify on-call immediately, page the team lead after 10 minutes, trigger a webhook after 30. Fully configurable.
Severity levels
Incidents are classified as Critical, Major, or Minor based on your policy configuration. Route and escalate differently depending on severity.
MTTR tracking
Mean Time to Resolution is calculated automatically across all incidents. Understand your team's response performance over time.
Full incident history
Every incident is stored with its complete timeline — start, escalation steps, recovery, and total duration. A permanent record for postmortems and SLA reporting.
Incident lifecycle
| Stage | What happens |
|---|---|
Detected | Monitor fails — multi-region verification confirms the outage is real |
Incident opened | Incident record created with severity, affected service, and start timestamp |
First alert | Immediate notification to configured channels — Slack, email, webhook, and more |
Escalation | If unresolved after your configured delay, escalation steps fire to reach broader teams |
Resolved | Monitor recovers — incident closes, duration calculated, recovery notification sent |
Reported | Incident logged in history for MTTR analysis, SLA reporting, and postmortems |
How it works
Define who gets notified and when
Set up escalation steps: immediate notification to your on-call channel, then broader alerts after a configurable delay if the incident isn't resolved.
Connect policies to services
Assign alert policies to individual monitors or groups. Different services can have different escalation paths — critical infrastructure can page an entire team while minor services send a single Slack message.
No manual work during outages
When a monitor fails, the incident opens, the timeline starts, and alerts fire — all automatically. Your team can focus entirely on resolution, not incident coordination.
Learn from every outage
After resolution, review the incident timeline, MTTR, and escalation log. Use the data to tighten alert policies, identify flaky services, and reduce future response times.
Frequently asked questions
Can I manually open or close incidents?
How are severity levels determined?
Does Vantaj send recovery notifications?
How long is incident history retained?
Can I post manual updates to an incident?
Ready to set up Incident Management?
Up and running in under a minute. Free for up to 20 monitors, no credit card required.
Start for free