Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Generate metrics from your high-volume logs with Datadog Observability Pipelines

Logs are a rich source of information, providing you with the minute details you need to troubleshoot a specific issue or perform extensive historical analysis. But with billions of logs being generated from your infrastructure every day, it isn’t practical to sift through them all to derive actionable insights. Firewall, CDN, network activity, and load balancer logs are especially high volume, requiring storage solutions that can be expensive and difficult to scale.

The Role of External Service Monitoring in SRE Practices

Modern businesses rely on a variety of external services to support their operations, including APIs, cloud platforms, CDNs, payment gateways, and more. Whether it's pulling data from an external API, using a cloud service for storage, or integrating a third-party tool for analytics, these services help achieve many business objectives. Given their criticality, it’s important to have a reliable mechanism for monitoring external services.

The role of AI in Kubernetes monitoring

In a dynamic environment like Kubernetes, where manual tracking is impossible, AI-powered monitoring tools, such as Site24x7, surf through enormous amounts of data, detecting irregularities, predicting vulnerabilities, and alerting the user about a possible outage that is about to happen if the resource is not handled.

Best Practices for Mainframe Modernization with MQ Infrastructure

Mainframe systems may be the workhorses of many enterprises, but let’s face it, modernization is long overdue for most organizations. With decades-old infrastructure running mission-critical workloads, updating these systems isn’t just about keeping up with the times—it’s about ensuring that your business remains agile, competitive, and efficient. And a big part of this journey? MQ infrastructure. They form the backbone of communication between mainframes and newer technologies.

The Rising Role of Slack in Incident Management

Why is Slack becoming so popular in incident management? Slack is one of the most popular communication tools used in companies. If you're part of a remote team, your team is probably on Slack or something similar like MS Teams. Although IM tools lack the communication nuances that are taken for granted in face to face interactions, they provide many other advantages.