How to Monitor Website Availability from Multiple Countries
Single-region monitoring misses regional outages and fires false positives from network path failures. This guide explains how multi-country website monitoring works and how to set it up.
If your monitoring tool checks your website from one location in Virginia and there is a routing problem between Virginia and your users in Tokyo, your monitoring stays green while your Japanese users see errors.
Multi-country monitoring catches regional outages your single-probe setup will never see. It also eliminates most false positive alerts by requiring agreement across multiple independent vantage points before firing.
This guide explains how multi-country monitoring works and how to set it up.
Why Single-Region Monitoring Fails
A monitoring check from one location answers a narrow question: can this specific probe reach your server right now?
It does not answer: can your users in Germany, Australia, or Brazil reach your service right now?
Three failure modes single-region monitoring misses:
Regional CDN failures. CDN providers like Cloudflare and Fastly have regional edge nodes. When a European edge fails, European users get errors. Your US-based probe still hits a healthy US edge and shows green.
Anycast routing problems. Many services use anycast DNS or load balancers that route each user to the nearest available region. A probe in one region may route to a healthy node while users in another region route to a degraded one.
Transient path failures. Network paths between two specific locations fail temporarily without affecting other paths. A blip between your probe and server looks like an outage to a single-region check but is invisible to users on different network paths.
The practical result: your monitoring misses real regional outages and generates false alarms from path issues that affect nobody.
How Multi-Country Monitoring Works
Multi-country monitoring runs the same check from several geographically distributed probe locations simultaneously.
The key detail is what happens when results disagree.
Any-fail alerting - alerts when any single probe reports a failure - makes false positives worse, not better. Every probe has its own network path, each with its own chance of a transient failure.
Consensus-based alerting - alerts only when multiple probes independently confirm a failure - is what you want. When all probes report failure, your service is genuinely down globally. When one probe fails and others pass, a network path had a bad moment.
Check cycle example:
US-East: ✅ 200 OK - 142ms
EU-West: ✅ 200 OK - 198ms
AP-South: ❌ Timeout
Result: 1 of 3 probes failed
Action: No alert. Log as path issue. Re-check next interval.
---
Next cycle:
US-East: ❌ 503 Error
EU-West: ❌ 503 Error
AP-South: ❌ 503 Error
Result: 3 of 3 probes failed
Action: Alert fired. Incident opened.
Consensus eliminates the noise from transient network events. When your phone buzzes, every probe on every continent agrees your service is down.
Setting Up Multi-Country Monitoring in Vantaj
Vantaj runs checks from 10 global probe regions and applies multi-region consensus by default. You do not configure consensus as a feature - every monitor uses it.
Step 1: Add your monitor
- Log in to Vantaj
- Click Add Monitor
- Select HTTP/HTTPS
- Enter your URL:
https://yourdomain.com - Set check interval: 1 minute for production services
- Set expected status code:
200
Step 2: Configure check regions
Vantaj checks from all active probe regions by default. The current regions include:
- US East (Virginia)
- US West (Oregon)
- EU West (Ireland)
- EU Central (Frankfurt)
- Asia Pacific (Singapore)
- Asia Pacific (Tokyo)
- Asia Pacific (Sydney)
- South America (São Paulo)
- Middle East (Bahrain)
- Africa (Cape Town)
Select the regions that matter for your users. For a global SaaS, use all available regions. For a service with a regional user base, use the regions where your users are plus a few others for cross-validation.
Step 3: Set your alert policy
- Go to Alert Policies > New Policy
- Add your notification channels (Slack, email, SMS, webhook)
- Set escalation timing: notify secondary contact if no acknowledgment within 5 minutes
- Assign the policy to your monitor
Step 4: Add keyword validation
A status code check confirms the server responded. A keyword check confirms it responded correctly.
In monitor settings, add a keyword check: enter a string that appears in a healthy response and not in error pages.
For an API health endpoint:
{"status":"ok"}
For a website:
<!-- your expected page title or a unique element -->
This catches cases where the server returns 200 with an error page - a common failure mode for load balancers and CDNs.
What to Monitor from Multiple Countries
Not every endpoint needs global coverage. Prioritize by user impact.
Monitor globally
- Your homepage or main landing page
- Login and authentication endpoints
- API health check endpoints
- Checkout and payment flows
- Any endpoint in your SLA
Monitor regionally
- CDN performance check (add a monitor per major user region if you serve users globally and use regional CDN edges)
- Third-party vendor endpoints that have regional failover
Skip multi-country for
- Internal admin panels (users are on your network)
- Staging environments
- Batch processing endpoints not user-facing
Reading Multi-Region Results
When a monitor reports an incident, check which regions confirmed the failure.
| Pattern | What it means |
|---|---|
| All regions failing | Global outage - your service is down |
| One region failing, others passing | Regional issue or network path problem |
| Two of five regions failing | Possible regional CDN or routing failure |
| Intermittent failures across regions | Overloaded origin, connection pool issues |
| One region consistently slower | Regional latency problem or edge node degradation |
Response time data across regions also reveals performance problems before they become availability problems. If your Tokyo probe reports 1,800ms while US probes report 200ms, users in Japan are experiencing degraded service even though the check passes.
Interpreting Regional Latency Data
Multi-country monitoring gives you latency by region. Track these thresholds:
| Latency | User experience |
|---|---|
| Under 200ms | Fast |
| 200 - 500ms | Acceptable |
| 500ms - 1s | Noticeable delay |
| 1s - 3s | Users start abandoning |
| Over 3s | Significant churn risk |
If a region consistently shows high latency, your users there are experiencing a different product than your internal testing shows. Common fixes:
- Add a CDN edge presence in that region
- Move static assets to a closer object storage bucket
- Use a regional database read replica to cut cross-continental query latency
Setting Up Regional Alerting
You may want different alert policies for different failure patterns.
Global outage policy (all regions failing): wake someone up via SMS and phone call immediately.
Regional failure policy (subset of regions failing): notify the on-call channel in Slack with lower urgency.
In Vantaj, you can configure separate alert policies and assign them to different monitors, or use escalation tiers within one policy.
Testing Your Multi-Region Setup
Verify the monitoring works before you rely on it.
Test 1: Global detection
Temporarily point a test monitor at a URL that returns a 503:
https://httpstat.us/503
Confirm all regions report the failure and an alert fires.
Test 2: Regional check
Confirm the probe list shows checks arriving from the expected regions. In Vantaj, the incident detail view shows which regions confirmed the failure.
Test 3: Recovery notification
Delete the test monitor and confirm a recovery notification arrives through your configured channels.
Quick Checklist
- Monitors configured with at least 3 probe regions
- Consensus-based alerting enabled (not any-fail)
- Production endpoints checked at 1-minute intervals
- Keyword validation set for critical monitors
- Regional latency baseline documented for each endpoint
- Alert policy covers both global and regional failure patterns
- Full pipeline tested end-to-end