Coinbase reviews the May outage incident: AWS cascading failure exposes architectural risks

By: rootdata|2026/06/02 04:45:00
0
Share
copy

Coinbase released a retrospective report on the large-scale service interruption event on May 7, 2026.

The outage lasted approximately 8 hours, with full recovery taking about 12 hours. During this time, trading, deposits, withdrawals, and most core services were unavailable or severely degraded. Coinbase stated that the outage was caused by multiple cooling units failing simultaneously in the cooling system of a data center in one availability zone (use1-az4) in the AWS us-east-1 region, triggering cabinet thermal protection shutdowns, which led to EC2 instances and EBS volumes going offline, affecting multiple internet services.

During the recovery process, the Coinbase trading matching engine lost quorum due to the cluster architecture deployed in a single AWS data center losing most nodes. It required urgent code adjustments and the reconstruction of a new node group to restore operation, gradually restarting market trading during the recovery.

Additionally, the AWS-managed Kafka (MSK) service experienced control plane failures, preventing the automatic re-election of partition leaders, further blocking quotes, fees, and some settlement and data flow systems, which expanded the overall impact.

After manual partition migration in collaboration with the AWS engineering team, the system gradually returned to normal. Coinbase stated that this incident exposed its shortcomings in cross-availability zone automatic switching capabilities and disaster recovery for managed middleware. The company will upgrade its cross-region hot backup architecture, strengthen regular failure drills, and migrate the Kafka system from dual availability zones to a three availability zone deployment, while also working with AWS to advance root cause fixes and improvements.

-- Price

--

You may also like

Tokenized US stocks are not the "liquidity killer" of the crypto market

"As garbage coins are gradually eliminated, the protocols, infrastructure, and financial products that can truly create value have the opportunity to obtain a more reasonable valuation."

Why do I still have confidence in ETH?

As stablecoins and RWAs accelerate on-chain, Ethereum's role as a global value settlement layer has only just begun, and the market will eventually reprice ETH.

CRCL surges and plummets, COIN follows with a dive: The real battle for interests behind the CLARITY Act

The leak of the CLARITY bill draft has triggered a plunge in Circle and Coinbase, directly hitting the core provision of the stablecoin "ban on interest," revealing the deep political and economic game in Washington's strict prevention of stablecoins evolving into on-chain savings accounts and the c...

What Is TradFi and Why Is Everyone Talking About It in 2026?

Gold is rallying, SpaceX is heading for a historic IPO, and oil remains highly volatile. Discover why TradFi is back in focus and how crypto traders can access these opportunities with USDT. Put another way, TradFi Is Having Its Biggest Moment Ever, and Crypto Traders Are Perfectly Positioned

From Poland to Paris: A Look Back at WEEX's Global Community Journey in May 2026

Follow WEEX's global journey across Poland, Barcelona, Dubai, Milan and Paris. Explore Bitcoin Pizza Day, LALIGA VIP experiences, Web3 networking events, trading education and more from an action-packed May.

WEEX WXT Eco Carnival: How to Join WXT Events and Plan Trading Tasks

The WEEX WXT Eco Carnival is an ecosystem campaign built around WEEX Token (WXT), designed for users interested in platform tokens, spot trading, futures trading, deposit tasks, and referral rewards.

Contents

Popular coins

Latest Crypto News

Read more
iconiconiconiconiconiconicon
Customer Support:@weikecs
Business Cooperation:@weikecs
Quant Trading & MM:bd@weex.com
VIP Program:support@weex.com