Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Increase visibility into network incidents using moovingon.ai and Datadog

moovingon.ai is a platform that consolidates alerts, incidents, audits, runbooks, and other resources for 24/7 network operations center (NOC) engineering teams. These teams often have to work collaboratively to maintain uptime for mission-critical cloud infrastructure and applications and need specialized resources to facilitate investigations in the event of an issue.

New private status ingestion integrations: Meraki, Neat Pulse, AT&T

Managing the reliability and uptime of critical services is a cornerstone of smooth business operations. While public cloud status pages provide general updates, they often fall short in reflecting the true status of your specific hosted tenants. Enter Private status ingestion, a powerful feature available exclusively on our Enterprise plan.

What is Performance Engineering?

Performance engineering transforms how organizations build and optimize software systems. System delays and performance issues directly impact revenue, user satisfaction, and business success. This guide covers performance engineering fundamentals, implementation approaches, and advanced strategies for building high-performing systems.

Troubleshooting SD-WAN with Kentik Journeys AI

Discover how Kentik Journeys simplifies SD-WAN troubleshooting with the power of AI. In this video, we walk through identifying and resolving a network issue impacting a business application using a Postgres database. See how Kentik's conversational interface streamlines iterative network analysis, offering real-time insights into traffic patterns, device metrics, and routing behaviors. Learn how Kentik Journeys empowers teams to diagnose root causes quickly and collaborate effectively.

The Year in Internet Analysis: 2024

Join Doug Madory, Kentik's Director of Internet Analysis, for an in-depth look at "The Year in Internet Analysis: 2024." This webinar replay explores key developments in BGP security, RPKI ROV adoption, and the evolving landscape of routing security. Discover insights into major submarine cable incidents, including their impacts and recovery, as well as an overview of Kentik's new Cloud Latency Map tool. Doug shares his expert perspectives on Internet trends, resilience, and what lies ahead in 2025.

Cloud Status Third-Party Monitoring Gets Upgraded!

At Uptime.com, we’re committed to helping you monitor and manage the uptime and reliability of your websites and critical infrastructure. Based on your feedback, we’ve enhanced Cloud Status to deliver even more powerful insights into third-party dependencies and improve your experience. Here’s what’s new and what’s coming next!

Opslogix explores: How to bridge the gap between SCOM and Grafana with a SCOM Prometheus Exporter

As an observability architect, I have seen firsthand the power and importance of a robust monitoring solution. For infrastructure monitoring System Center Operations Manager (SCOM) stands tall. It is widely adopted and excels at monitoring the health and performance of infrastructure. However, as the need for advanced observability grows, such as tracking application logs and tracing code paths, SCOM's capabilities can fall short.