Operations | Monitoring | ITSM | DevOps | Cloud

%term

Amazon Cognito outage: How StatusGator notified customers 30 minutes before Amazon did

On December 12, 2024, Amazon Cognito experienced a significant outage in the US-EAST-1 (N. Virginia) region, impacting authentication for numerous applications. This operational issue, caused by a configuration change deployment, led to widespread “TooManyRequestsException” errors for several hours. Many Amazon Cognito users were left scrambling to figure out why their application was down, why users could authenticate, and how to get back up and running.

What is API Monitoring? How It Works, Benefits, & Best Practices

API Monitoring is the process of continuously observing and testing APIs to ensure they perform as expected, maintain uptime, and deliver the desired functionality. This includes tracking metrics such as API availability, uptime, latency, and response times. Whether you’re dealing with a REST API, a web API, or a microservices architecture, it’s important to understand that monitoring is essential for detecting issues before they impact end-users.

12 Ways We Sleighed Innovation This Year

As we wrap up an incredible year, it’s the perfect time to celebrate Cribl’s progress and innovation in 2024! This year brought many exciting features designed to solve real-world problems and make life easier for our customers. In the spirit of reflection and festivity, I’ll highlight twelve game-changing product features, releases, and enhancements— each a testament to listening, learning, and delivering value to you, our users.

Proactive Protection with Exposure Management

Every organization faces risk. What separates the vulnerable from the well-protected isn’t whether you have exposure — it’s how you manage it. Gaining visibility over your organization’s attack surface lets you prioritize which threats you address based on the degree of risk they pose. That’s when you can put together an exposure management strategy that empowers you to gauge your vulnerabilities and maintain the security posture appropriate to your business.

AI Log Analysis - Shaping the Future of Observability

As digital applications and infrastructures grow increasingly complex, managing and understanding log data has become increasingly vital in achieving practical observability, enabling organizations to detect, diagnose, and prevent issues across their systems. However, traditional log analysis methods often struggle with the volume and complexities of modern log data in cloud-native environments.

Full-Stack Observability with OpenTelemetry and DX Operational Observability

DX Operational Observability (DX O2) from Broadcom supports ingestion and retention of OpenTelemetry (OTel) data. Teams who have instrumented applications with OpenTelemetry SDKs and APIs can now ingest telemetry into DX O2 using the OpenTelemetry Collector, a core component of OpenTelemetry, and the OTel Collector Exporter, which is now available through early access in DX O2.