Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Challenges in Monitoring Applications That Use OAuth

OAuth (Open Authorization) has become a critical component in enabling secure and third-party access to APIs which makes it one of the most widely adopted authentication protocols for modern applications. From allowing users to sign into apps using their Google or Facebook accounts to enabling third-party service integrations, OAuth simplifies the process of granting access to resources without compromising security.

What are Kubernetes audit logs and how to monitor them?

Security and compliance: Many industries, especially those governed by regulations like HIPAA, the PCI DSS, or the GDPR, require detailed logs for compliance and to trace security incidents. Troubleshooting and forensic analysis: If something goes wrong—whether due to accidental configuration changes or malicious activity—having detailed logs helps diagnose the root cause and quickly remediate it.

Using Amazon RDS for high availability: How monitoring ensures reliable failover

Database downtime can lead to significant disruptions, revenue loss, and frustrated users. Amazon Relational Database Service (RDS) provides a managed database solution with high availability and automated failover to minimize such risks. However, continuous monitoring is crucial to ensuring reliable failover and minimizing downtime by detecting potential issues before they impact operations.

Managing Multiple Service Instances with a Systemd Generator

When working with systemd services in Linux, you might encounter situations where multiple instances of a service need to be managed dynamically. When I had to develop a solution to monitor multiple Kubernetes clusters with Icinga for Kubernetes, I ran into exactly this challenge.

Why Context Matters: Mastering Serverless App Monitoring

Hi there, and welcome to the second video in this series on observing AWS serverless applications with Datadog. In this video, you’ll learn how important it is to add custom business context to the telemetry you send to Datadog and how you can use that inside APM to quickly diagnose and debug issues. You’ll walk away with an understanding of the importance of distributed tracing, as well as how you can add specific business context to the telemetry you send.

Netdata vs. Prometheus: Which Monitoring Tool is Right for You? #monitoring #realtime

Netdata's founder Costa Tsaousis built Netdata with performance and efficiency in mind. The result? 8x less RAM usage, 30x less disk I/O, 40x more data retention, 40x more data stored, and up to 22x faster queries—all thanks to our innovative tiered storage system, enabling ultra-efficient long-term queries.

State of DevOps: 2024 DORA Report Insights with Google

Enjoy this exclusive webinar with Ben Good from Google as we explore the findings in the 2024 State of DevOps report. For over a decade, the DORA report has provided critical insights into the capabilities and practices that fuel high-performing technology organizations. This report highlights the significant impact of AI on software development, explores platform engineering’s promises and challenges, and emphasizes user-centricity and stable priorities for organizational success.

GTMetrix Alternatives: The Best Tools for Website Performance Testing

GTMetrix used to be the go-to tool for checking website speed, but let’s be honest—paying for one-off synthetic tests isn’t worth it. If you’re still relying on synthetic testing alone, you’re missing a big part of the web performance picture. If you care about Core Web Vitals, SEO performance, and user experience, you need more than just lab data. The good news? There are better (and free) alternatives like PageSpeed Insights and WebPageTest for synthetic testing.

How to Implement OpenTelemetry in NestJS

Modern applications are becoming increasingly complex, and debugging distributed systems can feel like searching for a needle in a haystack. This is where OpenTelemetry (OTel) comes in. If you're using NestJS, integrating OpenTelemetry can provide deep insights into your application's behavior, helping you track performance, troubleshoot issues, and understand service interactions.