GitHub Outages in 2026: A Month-by-Month Analysis

GitHub is the world's largest code hosting platform, running services that 100 million developers depend on daily. When it goes down, CI/CD pipelines stall, deployments block, and teams lose access to code. Understanding when and why it fails - with real data, not vague status summaries - helps engineering teams build better contingency plans.

This analysis covers every public GitHub incident from May 27 through June 26, 2026, sourced directly from githubstatus.com. All durations, error rates, and root causes are taken from GitHub's own incident postmortems.

Incident Summary: May 27 – June 26, 2026

GitHub reported 25 incidents over this 30-day period. That averages to nearly one incident per calendar day - though most were narrow in scope (Copilot-specific or single-service), and several resolved in under 15 minutes.

Date	Incident	Duration	Root Cause
May 27	Git operations, PRs, Issues, API	69 min	Analytics component CPU saturation (cascade)
May 28	Multiple services elevated errors	9 min	Partial auth service deployment, rolled back
Jun 1	OpenAI models disruption	Not detailed	Upstream AI provider
Jun 1	Some GitHub services	Not detailed	Not detailed
Jun 4	Webhook APIs and UI degraded	Not detailed	Not detailed
Jun 5	Auth/API (0.11% wrong 404s) + Slack/Teams	70 min	Authorization component bug with user tokens
Jun 6	EU region: Codeload and Package Registry	43 min	Network circuit migration disrupted EU PoP
Jun 8	GitHub.com, REST API, GraphQL, Webhooks	5-12 min	Transient infrastructure capacity, self-resolved
Jun 8	Copilot Code Review failing	Not detailed	Not detailed
Jun 11	Webhooks delayed	~160 min	Not detailed in postmortem
Jun 12	EU region disruption	Linked to Jun 6	Network migration (same root cause)
Jun 12	Code Scanning and Billing delays	Not detailed	Not detailed
Jun 15	Feature flag service failure (analytics)	44 min	Feature flag client transient error, no retry
Jun 16	Pull Requests and Issues (signed-out)	55 min	Upstream model provider (Opus 4.8)
Jun 17	Copilot availability	Not detailed	Not detailed
Jun 18	Auth/API (9% sporadic 401s, +800ms latency)	80 min	memcached misconfiguration during rollout
Jun 18	Feature flags service elevated errors	Linked to Jun 15	Same feature flag service issue
Jun 19	Webhooks incident	Not detailed	Not detailed
Jun 19	Copilot next edit suggestions	Not detailed	Not detailed
Jun 23	Copilot next edit suggestions elevated errors	Not detailed	Not detailed
Jun 24	Some GitHub services	Not detailed	Not detailed
Jun 25	Webhooks latency increased	Not detailed	Not detailed
Jun 25	Webhooks, PRs, Actions, Issues degradation	Resolved 18:27 UTC	Not fully detailed

The Five Most Significant Incidents

1. May 27 - Git Operations Cascade (69 minutes)

Impact: 3.5% of HTTPS pushes failed. 0.2% of SSH pushes failed. Pull Requests, Issues, GraphQL API degraded.

Root cause: An internal analytics component generated unexpectedly high load, saturating CPU on the underlying infrastructure. Services that depended on Git operations began failing as a cascade.

Resolution: GitHub stopped the offending analytics component. Services recovered shortly after.

What went wrong: An internal background system - not directly user-facing - created enough load to degrade core user-facing services. The analytics component lacked resource limits or circuit breakers that would have contained its impact.

GitHub noted in the postmortem: "We are taking steps to add resource limits and kill switches."

2. May 28 - Partial Deployment Triggers Multi-Service Errors (9 minutes)

Impact: 10% of GitHub Actions runs failed to queue or encountered errors. Web experience, REST API, and Git operations all affected.

Root cause: A change partially deployed to an authentication service caused dependent services to fail. The partial rollout state - neither the old version nor the new one fully applied - was the failure mode.

Resolution: GitHub rolled back the change. Recovery was fast because the rollback was straightforward.

What went wrong: The deployment validation process didn't catch that a partial deployment would produce an inconsistent state that downstream services couldn't handle.

GitHub noted: "We are expanding test coverage and improving our deployment validation process."

This is a common pattern in large distributed systems: safe to deploy fully, unsafe to deploy partially.

3. June 5 - Authorization Bug Deletes Slack/Teams Subscriptions (70 minutes)

Impact: 0.11% of authenticated REST API requests returned incorrect "not found" responses. 12% of organizations with active Slack and Teams channel subscriptions had some subscriptions removed. 2% of all channel subscriptions deleted.

Root cause: A change to an internal authorization component introduced a bug that failed to correctly resolve user-to-server token access for organization-owned repositories. The Slack and Teams integrations interpreted the transient "not found" responses as permanent loss of access and deleted the subscriptions.

Resolution: GitHub reverted the authorization component change.

What went wrong: The authorization bug itself was one failure. But the bigger failure mode was the integrations treating a transient error as permanent. When the API returned 404, the Slack integration assumed the repository was gone and removed the subscription - irreversibly. Recovering deleted subscriptions required users to manually re-add them.

This illustrates a dangerous API consumer pattern: treating any "not found" as permanent action-required, rather than distinguishing between transient and durable errors.

4. June 18 - memcached Misconfiguration Causes 9% Auth Failures (80 minutes)

Impact: ~9% of API requests returned sporadic 401 errors. ~800ms of additional latency on affected requests. Users experienced intermittent "logged out" behavior.

Root cause: A memcached proxy service rollout to GitHub's internal API infrastructure caused the authentication service to pick up an incorrect memcached host configuration. When authentication lookups went to the wrong host, they failed - intermittently, not consistently, which made the issue harder to diagnose.

Resolution: GitHub deployed a configuration change to memcached to use the correct host.

What went wrong: Configuration changes to infrastructure components that authentication depends on require validation before rollout. A canary deployment or pre-rollout config verification step would have caught the incorrect host before production traffic hit it.

GitHub noted plans: "We plan to migrate our authentication system to prevent similar issues."

At 80 minutes, this was the longest duration incident in the period covered by detailed postmortems.

5. June 6 - EU Network Migration Disrupts Package Registry (43 minutes)

Impact: 0.95% average Codeload error rate. 9.2% average Package Registry error rate. Peak Package Registry errors reached 27%. Affected users whose traffic routed through European infrastructure.

Root cause: A planned network circuit migration disrupted connectivity at one of GitHub's European Points of Presence. The traffic-shifting process "did not operate as expected," leaving some production traffic routed through the affected site.

Resolution: Traffic shifted away from the affected PoP.

What went wrong: Planned maintenance caused an unplanned outage. The traffic-shifting procedure had a failure mode that the team hadn't fully anticipated. Package Registry errors hit 27% at peak - significant for teams doing package installs in CI pipelines routed through EU infrastructure.

Recurring Failure Patterns

Across the 25 incidents in this period, four patterns account for most of the impact.

Pattern 1: Webhooks (5 incidents)

Webhooks degraded or failed on June 4, June 11, June 19, and June 25 (twice). No single postmortem in this dataset explains what causes GitHub's webhook delivery to fail repeatedly. The frequency suggests either fragile infrastructure or a shared dependency that's hit by multiple different upstream issues.

For teams that depend on webhooks for CI/CD triggers, deployment notifications, or workflow automations, GitHub webhook failures are a significant operational risk. Having a secondary delivery mechanism or monitoring for missed webhook events is worth the investment.

Pattern 2: Copilot AI Services (6 incidents)

Copilot-specific incidents appeared on June 1, June 8, June 17, June 19, June 23, and affected June 16's model disruption. GitHub Copilot depends on external AI model providers (OpenAI, Anthropic), which introduces a dependency layer outside GitHub's direct control.

These incidents are largely independent of core GitHub services. If Copilot completions fail, PRs and Issues continue working normally. But for teams where Copilot is integrated into developer workflows, the frequency of AI model disruptions is notable.

Pattern 3: Deployment-Triggered Failures

Two of the five detailed incidents trace directly to a deployment or rollout: the May 28 partial authentication deployment and the June 18 memcached rollout.

Both could have been caught earlier with stricter pre-deployment validation. Both resolved quickly once identified. Both caused disproportionate impact relative to the change being made - the May 28 incident affected 10% of Actions runs from a single configuration change.

Pattern 4: Auth and API Instability

The June 5 authorization bug and June 18 memcached issue both affected authentication. Auth is a foundational dependency - when it degrades intermittently, every service that requires authentication sees errors. The 80-minute duration of June 18 and the subscription deletion side effect of June 5 make these the highest-impact incident types in this dataset.

Incident Frequency by Affected Service

Service	Incidents (May 27 – Jun 26)
Webhooks	5
Copilot / AI features	6
API / Auth	4
Core GitHub services (PRs, Issues, Git)	3
EU / Regional	2
Other (Code Scanning, Billing)	2

Uptime Estimates

GitHub doesn't publish an overall uptime percentage on their status page. Based on the detailed postmortem durations available:

Incident	Duration
May 27 Git cascade	69 min
May 28 Auth deployment	9 min
Jun 5 Auth/API/Slack	70 min
Jun 6 EU network	43 min
Jun 8 GitHub.com/API	5-12 min
Jun 11 Webhooks	~160 min
Jun 15 Feature flags	44 min
Jun 18 Auth/API memcached	80 min
Total (documented)	~500 min over 30 days

500 minutes of documented degradation over 30 days (43,200 minutes) represents roughly 98.8% availability for the services specifically affected during those windows - not accounting for the many incidents without detailed duration data.

This aligns with GitHub's informal track record of 99.x% availability, with occasional multi-hour events and frequent short-lived degradations.

What This Means for Teams That Depend on GitHub

Don't build pipelines with a single webhook trigger. Webhooks are GitHub's most unreliable service based on this dataset - five incidents in one month. If a missed webhook blocks a deployment or notification, build a polling fallback.

Model AI feature dependency separately. Copilot, Code Review AI, and AI-powered features depend on upstream model providers that GitHub doesn't control. Design workflows that degrade gracefully when Copilot is unavailable.

Monitor your integration points. The June 5 incident deleted Slack/Teams subscriptions silently. If your GitHub Slack integration had stopped posting notifications, your team might not have noticed for hours. Monitor the output of your GitHub integrations, not just GitHub's status page.

Watch for EU-specific issues. Two incidents in this period specifically affected European infrastructure. If your team routes CI/CD through EU GitHub infrastructure, regional monitoring that checks from inside Europe gives earlier signal than a US-based check.

Watch the GitHub Status API. GitHub publishes machine-readable status at api.githubstatus.com/v2/summary.json. Monitor that endpoint programmatically or subscribe to status page notifications so you get the first alert, not the second-hand report from a developer who noticed their PR wasn't building.

All incident data sourced from githubstatus.com and GitHub's published postmortems. Durations and error rates are taken verbatim from GitHub's own incident reports. This analysis covers the 30-day window available in the public incident feed at time of writing (June 26, 2026).