10 Ways to Use Vantaj to Monitor Your Stack (Beyond Basic Uptime)
Most teams set up one uptime check and forget about it. Here are 10 specific ways to use Vantaj's monitoring features - HTTP checks, DNS monitoring, heartbeats, SSL, and status pages - to catch real problems before your users do.
Most teams use uptime monitoring the same way: paste in the homepage URL, pick a 5-minute check interval, and move on. That catches the obvious outage - when the site goes down completely - but misses most of the ways production systems actually fail.
Here are 10 specific ways to use Vantaj that go beyond basic uptime checks, with the setup steps and the reasoning behind each.
1. Monitor Health Endpoints, Not the Homepage
What most teams do: Monitor https://yourapp.com and accept a 200 response as healthy.
The problem: Load balancers return 200 with an error page when the app server crashes. CDNs return 200 with cached content from hours ago. Your homepage can show 200 while your database, payment service, and authentication system are all failing.
The fix: Build a /health endpoint that checks your critical dependencies and monitor that instead.
// What your /health endpoint should return
{
"status": "ok",
"database": "ok",
"cache": "ok",
"queue": "ok"
}
In Vantaj, set up your HTTP monitor with a keyword assertion to verify the response body contains "status":"ok". If your database goes down and the endpoint returns "status":"degraded", the monitor fires even though the HTTP status code is still 200.
Setup: Add monitor → enter your /health URL → under "Response validation" → add keyword "status":"ok" → save.
2. Monitor Every Environment, Not Just Production
Staging and development environments break differently from production. A broken staging environment means developers can't test features, blocking your release pipeline. A misconfigured preview deployment can block code review.
Add monitors for:
staging.yourapp.com- set a 5-minute interval (not worth waking anyone up for staging, but worth knowing)- Preview deployment URLs - if your deployment platform generates stable preview URLs (Vercel, Render, Railway), monitor them
- Internal admin panels - admins notice outages last; monitoring catches them first
Use alert policy routing to keep staging alerts out of the on-call rotation. Route staging failures to Slack only, not SMS. Route production failures to SMS + Slack. Vantaj lets you attach different alert policies to different monitors.
3. Track DNS Records for Security and Change Detection
DNS changes are invisible until they cause problems. An attacker who gains access to your registrar can change your A records to point at a phishing server - and unless you're watching the DNS, you won't know until users report it. The average detection time for DNS hijacking without monitoring is over four hours.
Less dramatically: an engineer makes a DNS change during a migration, gets the CNAME wrong, and forgets to revert it. The error propagates silently.
Vantaj's DNS monitoring checks your configured records on each interval and alerts immediately when any value changes. Set it up for:
- A record - alerts if your domain resolves to a different IP
- MX records - alerts if your email routing changes
- CNAME records - alerts if subdomains are redirected unexpectedly
- Nameservers - alerts if your DNS is moved to a different provider without your knowledge
Setup: Add monitor → select "DNS" type → enter your domain → configure the record type and expected value → save.
Each DNS monitor checks one record type. Add separate monitors for A, MX, and your most critical CNAMEs.
4. Monitor Third-Party API Dependencies
Your service can be perfectly healthy while your users see errors - because an API you depend on is down.
Payment processors, authentication providers, email services, AI APIs, shipping calculators, address validation services - these are all outside your control, but their failures become your outages. When Stripe's API goes down, your checkout fails. When your email provider's sending API returns 500, your transactional emails stop.
Add HTTP monitors for the public health or status endpoints of your critical dependencies:
https://api.stripe.com/v1(returns 200 for authenticated requests)https://status.stripe.com/api/v2/summary.json- returns current component status- Your authentication provider's token endpoint
- Your email provider's API endpoint
When a third-party dependency starts failing, you know before your users complain and before you spend 30 minutes debugging what you assume is your own code.
5. Monitor Cron Jobs with Heartbeats
Cron jobs fail silently. A failed database backup, a stalled queue processor, a report generator that stopped running - none of these produce an HTTP error or trigger an uptime alert. You find out weeks later when the downstream effects become visible.
Heartbeat monitoring flips the model: instead of Vantaj checking whether your endpoint responds, your cron job pings Vantaj after it completes. If the ping doesn't arrive within the expected window, Vantaj alerts you.
Add one curl call to the end of your cron script:
# Your existing cron job
pg_dump mydb | gzip > backup.sql.gz
aws s3 cp backup.sql.gz s3://my-backups/
# Add this at the end
curl -s "https://app.vantaj.co/api/hb/your-token" > /dev/null
If the backup fails and exits early, the ping doesn't fire. Vantaj alerts you that the heartbeat is missed.
Setup: Add monitor → select "Heartbeat" type → set expected interval (match your cron schedule) → set grace period (how long to wait before alerting) → copy the ping URL → add to your script.
Use heartbeats for:
- Database backups
- Report generation jobs
- Queue processors that should run continuously
- Data sync pipelines
- Log rotation jobs
6. Set Up SSL Certificate Monitoring with Meaningful Lead Times
Let's Encrypt certificates expire every 90 days. Commercial certificates expire annually or biannually. Auto-renewal processes fail more often than most teams expect - misconfigured Certbot cron jobs, changed domain ownership, certificate authority rate limits, or expired authorization credentials all cause silent renewal failures.
You don't know the renewal failed until the certificate actually expires and your site starts throwing browser security warnings.
Vantaj monitors SSL certificates and alerts you at configurable lead times before expiry. Set alerts at 30 days and 7 days. The 30-day alert gives you enough time to investigate and fix a stuck auto-renewal. The 7-day alert means something has gone wrong with the fix and needs immediate attention.
Setup: Add monitor → select "SSL" type → enter your domain → configure alert thresholds (30 days, 7 days) → save.
Monitor SSL for:
- Your primary domain
- Any subdomain serving HTTPS content
- API endpoints on custom domains
- Third-party tools on your domain (status pages, docs sites, help centers)
7. Monitor Domain Expiry Before It Becomes a Crisis
A domain that expires takes your entire website, all email on that domain, every subdomain, and every API endpoint offline simultaneously. Domain squatters register expired domains within seconds of availability. Recovery can take days or weeks, and that's if you're lucky - sometimes you can't recover the domain at all.
The failures are mundane: a corporate credit card expires, a renewal notification goes to a former employee's email, someone disabled auto-renewal "temporarily" and forgot to re-enable it.
Vantaj checks domain expiry and alerts 60 days and 30 days before the registration ends. 60 days is enough time to investigate a billing problem, update a credit card, or transfer a registrar. 30 days means the situation needs immediate action.
Setup: Add monitor → select "Domain Expiry" type → enter your domain → save.
Monitor every domain you own - not just your primary one. Legacy domains from old projects, domain variants you registered defensively, and marketing campaign domains all need the same coverage.
8. Monitor TCP Ports for Non-HTTP Services
Not everything runs on HTTP. Your PostgreSQL database, Redis cache, SMTP mail server, custom TCP services, and game servers all listen on specific ports. An HTTP uptime check won't tell you if port 5432 is closed.
TCP port monitoring checks whether a port is open and accepting connections, without requiring HTTP. Use it for:
| Service | Port to monitor |
|---|---|
| PostgreSQL | 5432 |
| MySQL / MariaDB | 3306 |
| Redis | 6379 |
| SMTP (mail sending) | 25 or 587 |
| IMAP (mail receiving) | 993 |
| Custom TCP service | Your port |
Note on database monitoring: Don't expose database ports to the public internet to enable external monitoring. Vantaj's TCP checks run from external probe servers. If your database is (correctly) not accessible from the public internet, use a health check endpoint on your application that queries the database and returns a status - then monitor that endpoint with keyword assertion instead.
For publicly accessible services (mail servers, custom TCP endpoints), TCP monitoring catches port closures, firewall misconfigurations, and service crashes.
9. Use Your Status Page as a Communication Tool, Not a Vanity Page
A public status page that's never updated is worse than no status page. It tells users "everything is operational" while they're experiencing errors, which damages trust faster than the outage itself.
Vantaj's status page connects directly to your monitors. When a monitor fails, the corresponding status page component updates automatically. When the monitor recovers, the component updates back to operational. You don't need to log in and manually update anything during an incident - when you're most likely distracted.
Set up your status page to reflect your actual service architecture:
| Component | Maps to monitor |
|---|---|
| Website | HTTP monitor on yourapp.com/health |
| API | HTTP monitor on api.yourapp.com/health |
| Authentication | HTTP monitor on your auth endpoint |
| Email delivery | HTTP monitor on your email provider's API |
| Payment processing | HTTP monitor on your payment provider's health endpoint |
Share your status page URL in your product, documentation, and email footer. When something breaks, subscribers get notifications before they file support tickets.
Setup: Status Pages → Create → add components → map each component to an existing monitor → publish → set custom domain → share the URL.
10. Build a Multi-Region Latency Baseline
Response time from a server in Virginia doesn't tell you what users in Singapore experience. A caching misconfiguration might leave one region serving everything from origin while others hit the edge. A BGP routing change might add 200ms for users in Europe overnight.
Vantaj runs every check from multiple probe regions and shows you per-region response times. Use this data to:
Detect geographic performance regressions. If your Frankfurt probe shows 800ms response times while Virginia shows 120ms, something is wrong with European routing or your CDN configuration for that region.
Validate CDN configuration. After setting up Cloudflare, Fastly, or CloudFront, watch the per-region latency in your monitor dashboard. If one region is significantly slower than others, that region may be bypassing the cache or routing to a distant origin.
Catch regional outages before they're reported. The June 6, 2026 GitHub outage affected only EU infrastructure for 43 minutes. A team with a Frankfurt probe would have seen the failures immediately. A team with only a US probe might not have noticed until European developers started reporting problems.
Setup: On any HTTP monitor → view the monitor detail → the response time chart shows per-region latency. The multi-region check runs automatically on all plans.
The Full Coverage Checklist
| What you're monitoring | Vantaj feature | Alert condition |
|---|---|---|
| Service health | HTTP monitor with keyword assertion | Body doesn't contain "status":"ok" |
| SSL certificate | SSL monitor | Expires in < 30 days |
| Domain registration | Domain expiry monitor | Expires in < 60 days |
| DNS record changes | DNS monitor | Any record change |
| Cron job completion | Heartbeat | Missed ping within expected window |
| Third-party API health | HTTP monitor | Non-200 response |
| TCP port availability | TCP monitor | Port refused or timed out |
| Status communication | Status page | Automatic via monitor state |
Start with the HTTP health endpoint and SSL monitoring - those are the two changes that catch the most incidents most teams are currently missing. Add heartbeat monitoring for any scheduled job that processes important data. Build outward from there.
Each monitor takes under two minutes to set up. The 30 minutes you spend doing this is worth more than the next 30 minutes of any other reliability work.