Operations | Monitoring | ITSM | DevOps | Cloud

Catchpoint

Incident Review: Another Week, Another AWS Outage

The following is an analysis of the Amazon Web Services incident on 12/15/2021. It may be the holiday season for most of us, but for AWS it appears to be Groundhog Day, Bill Murray style. For the second week in a row, the company reported an outage, this time affecting its US-West-2 region in Oregon and US-West-1 in Northern California.

A Year of WebPageTest and Catchpoint: A Q&A with Mehdi Daoudi and Jeena James

As WebPageTest and Catchpoint celebrate one year of partnership, Jeena James, General Manager, WebPageTest, sat down for a Q&A with Mehdi Daoudi, CEO and co-founder, Catchpoint, to look at the key milestones from the last year, and ahead to what's next! Hope you enjoy!

WebPageTest and Catchpoint: Our Year Building and Growing with the Community

WebPageTest recently completed a year as part of the Catchpoint family (yes, we acquired a company during the pandemic). In the past twelve months, we have built an entire WebPageTest team to power the developer experience around web performance. We’ve also launched initial premium experiences on the platform. Our developer community continues to contribute to the beloved open-source version, as well as share best practices with other users.

Why you Need WiFi Observability in the Era of Work From Anywhere

“Work from anywhere” is now a common occurrence. With so many companies now dependent on a distributed workforce, IT teams need to be able to quickly diagnose and troubleshoot WiFi problems. Moreover, they, themselves, are often working remotely. In order to successfully do their jobs, consistent WiFi is obviously essential for remote workers.

Incident Review - AWS Outages Crash Major Online Services - Including Amazon

The following is an analysis of the Amazon Web Services incident on 12/07/2021. Millions of users were affected by an Amazon Web Services outage that took down major online services such as Amazon, Amazon Prime, Amazon Alexa, Venmo, Disney+, Instacart, Roku, Kindle, and multiple online gaming sites. The outage, which originated in the US-EAST-1 region on Dec. 7, 2021, is still ongoing at the time of blog publication.

Incident Review - Google Cloud Outage has Widespread Downstream Impact

Outages on the Internet always catch you by surprise, whether you are the end user or the Head of SRE or DevOps trying to keep a clear mind while you execute your incident playbook. As people in charge of ensuring reliable services for our customers, our normal experience of outages involves surfing a deluge of fire alarms and video calls as we work to solve the problem as quickly as we can. We often forget, therefore, what an outage means to the end user.

Risky Business: Implementing a Redundant Networking and Multi-CDN Monitoring Strategy

Last month, we partnered with AWS to put together a webinar on the importance of implementing a comprehensive redundant networking and multi-CDN monitoring strategy. You can replay the event in full here. In this article, we’ll recap the key takeaways covered by the panel of experts who included Leo Vasiliou, Director of Product Marketing at Catchpoint, and Steve Campbell, our Chief Strategy Officer.

Incident Review - Rolling Comcast Outage Disrupts Work from Home for Millions of Users Across the U.S.

The rolling Comcast outage on Monday, November 8th and Tuesday, November 9th affected customers across the U.S., knocking users offline around the country. The first wave took place Monday evening in the San Francisco Bay area. The second, which had a wider geographic impact, occurred Tuesday morning, primarily affecting broad swathes of the Midwest, Southeast, and East Coast.