Buildkite Status

All Systems Operational

About This Site

Status updates for Buildkite’s services and components. You can also follow @buildkitestatus on Twitter for updates.

Uptime over the past 90 days. View historical uptime.

Web Operational

90 days ago

99.98 % uptime

Today

Agent API Operational

90 days ago

100.0 % uptime

Today

REST API Operational

90 days ago

100.0 % uptime

Today

Job Queue Operational

90 days ago

99.93 % uptime

Today

Notifications Operational

90 days ago

99.88 % uptime

Today

GitHub Commit Status Notifications Operational

Email Notifications Operational

Slack Notifications Operational

Webhook Notifications Operational

90 days ago

99.88 % uptime

Today

Hosted Agents Operational

90 days ago

100.0 % uptime

Today

Hosted Agents Operational

90 days ago

99.98 % uptime

Today

MacOS Operational

90 days ago

99.98 % uptime

Today

Linux (ARM64) Operational

90 days ago

99.99 % uptime

Today

Linux (AMD64) Operational

90 days ago

99.98 % uptime

Today

Package Registries Operational

90 days ago

100.0 % uptime

Today

Web Operational

90 days ago

100.0 % uptime

Today

Package Managers - API Operational

90 days ago

100.0 % uptime

Today

Rest API Operational

90 days ago

100.0 % uptime

Today

Test Engine Operational

90 days ago

99.93 % uptime

Today

Web Operational

90 days ago

99.93 % uptime

Today

Ingestion Operational

90 days ago

99.93 % uptime

Today

REST API Operational

90 days ago

99.93 % uptime

Today

SCM Integrations Operational

90 days ago

100.0 % uptime

Today

SCM Providers Operational

GitHub Operational

GitHub API Requests Operational

GitHub Webhooks Operational

Atlassian Bitbucket SSH Operational

Atlassian Bitbucket Website and API Operational

Atlassian Bitbucket Git via HTTPS Operational

Third Party Services Operational

AWS ec2-us-east-1 Operational

AWS elasticache-us-east-1 Operational

AWS elb-us-east-1 Operational

AWS rds-us-east-1 Operational

Docs Operational

Operational

Degraded Performance

Partial Outage

Major Outage

Maintenance

System Metrics Month Week Day

Web Response Time

Fetching

REST API Response Time

Fetching

Agent Job Dispatch

Fetching

Agent API Response Time

Fetching

Past Incidents

Mar 13, 2026

No incidents reported today.

Mar 12, 2026

No incidents reported.

Mar 11, 2026

Increased queue times on hosted agents

Resolved - This incident has been resolved.
Mar 11, 21:14 UTC

Monitoring - We identified increased demand affecting hosted agent queue times. We have added additional capacity and are seeing recovery of hosted agent queue times.
Mar 11, 20:44 UTC

Investigating - We are investigating reports of elevated queue times with hosted agents.
Mar 11, 19:50 UTC

Mar 10, 2026

Increased error rates from Test Plan API

Resolved - Our mitigations have resolved the elevated latency and likelihood of suboptimal fallback test plans. We have also identified and fixed a blind-spot in our automated alerting, which was previously unable to detect this scenario as an issue. Work continues this week to resolve the underlying performance issue by restructuring how the relevant data is ingested and accessed.
Mar 10, 09:34 UTC

Monitoring - We have implemented several mitigation and continue working on fixing the underlying cause. Our team is actively monitoring the situation to ensure the stability. We will provide further updates as we make progress on resolving this issue.
Mar 10, 02:25 UTC

Investigating - We've observed periodic test splitting plan timing out and falling back to non-intelligent splitting. Performance appears to be back to normal as of an hour ago. We are continuing to investigate the root cause and solve the underlying issue.
Mar 10, 01:21 UTC

Mar 9, 2026

No incidents reported.

Mar 8, 2026

No incidents reported.

Mar 7, 2026

Elevated ingestion latency for Test Engine

Resolved - Processing of test execution ingestion data has successfully caught up.
Mar 7, 01:05 UTC

Monitoring - We've identified the issue and the system is currently processing the backlog of test executions
Mar 7, 00:56 UTC

Investigating - We are investigating the elevated latency issue for Test Engine. Processing the backlog of test executions is taking longer than expected, so elevated ingestion latency remains.
Mar 7, 00:21 UTC

Mar 6, 2026

Slow artifact uploads

Resolved - With artifact upload latency continuing to be stable, we are resolving this incident.
Mar 6, 10:23 UTC

Monitoring - Latency for artifact uploads has remained at normal levels for some time now, and we now have a mitigation in place for a common source of load going forward. We are continuing to monitor.
Mar 6, 08:02 UTC

Investigating - We're investigating slow artifact uploads. This is isolated to artifacts, dispatch remains unaffected.
Mar 5, 22:14 UTC

Hosted Agents: Job start latency for a small subset of customers

Resolved - Buildkite Hosted Agents experienced degraded start-time performance due to a network partition issue in the Hosted Agents control plane. A small subset of customers may have seen delayed job starts during 04:40-04:50 UTC and 05:06-05:16 UTC. The issue has been resolved and we are monitoring to confirm stability.
Mar 6, 04:30 UTC

Mar 5, 2026

Mar 4, 2026

Elevated ingestion latency for Test Engine

Resolved - Processing of test execution ingestion data has successfully caught up.
Mar 4, 12:35 UTC

Update - Processing the backlog of test executions is taking longer than expected, so elevated ingestion latency remains.
We continue to monitor progress.
We have also identified database schema improvements we can make that will increase throughput to avoid this scenario in future.
Mar 4, 08:00 UTC

Monitoring - We've identified and fixed the root cause and now monitoring the issue. Test execution may be delayed for up to 30 minutes.
Mar 4, 03:59 UTC

Latency issues

Resolved - We have completed the provisioning of additional capacity mentioned in our last update, and error rates and response times have returned to normal.

This incident is now resolved.
Mar 4, 05:24 UTC

Update - We continue to observe high latency on isolated infrastructure serving Agent API endpoints for a subset of customers. We are provisioning additional capacity to address this latency, and have informed impacted customers.
Mar 4, 03:29 UTC

Update - We've seen a small number of unrelated issues, each affecting a subset of customers. Most impact is resolved, but we are continuing to monitor impact for a small number of remaining customers. We are in touch with those customers directly.
Mar 4, 01:06 UTC

Monitoring - We've made some changes to address the issue and are seeing signs of recovery. We continue to monitor the situation.
Mar 4, 00:11 UTC

Update - We continue to experience high latency on some services. We're continuing to identify root causes.
Mar 3, 23:21 UTC

Update - We're still experiencing latency issues for agent api and job dispatch. We continue to investigate and identify the root cause.
Mar 3, 22:41 UTC

Investigating - We're seeing elevated job dispatch latency and Agent API latency across multiple shards. We're investigating.
Mar 3, 21:51 UTC

Mar 3, 2026

Mar 2, 2026

No incidents reported.

Mar 1, 2026

No incidents reported.

Feb 28, 2026

No incidents reported.

Feb 27, 2026

Increased dispatch latency

Postmortem - Read details
Mar 4, 02:35 UTC

Resolved - We have seen a full recovery of service, and have a good understanding of the underlying cause. We will publish a post-incident review next week.
Feb 27, 02:30 UTC

Monitoring - We've seen recovery for the remaining subset of customers. We will continue to monitor.
Feb 27, 00:57 UTC

Investigating - We're seeing ongoing latency impact across for a subset of customers. Some customers are seeing signs of improvement, but we are continuing to investigate the issue.
Feb 26, 23:41 UTC

Monitoring - We're seeing signs of recovery and will continue to monitor.
Feb 26, 19:36 UTC

Identified - Some customers are experiencing increased latency for jobs being assigned to agents. We have identified the cause and are working on mitigations.
Feb 26, 19:10 UTC

All Systems Operational

About This Site

Related

Past Incidents