Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

ITOps vs. DevOps: what's the difference?

Titles within an organization evolve nearly as fast as the technology itself. For a long time, the title of DevOps was considered a literal interpretation of “Development” and “Operations” – a catch-all term for hybrid roles encapsulating everything from on-prem, cloud, and hybrid infrastructures, to code execution and lifecycle management. Sounds like a lot? It is.

ML-Powered Assistance for Adaptive Thresholding in ITSI

Adaptive thresholding in Splunk IT Service Intelligence (ITSI) is a useful capability for key performance indicator (KPI) monitoring. It allows thresholds to be updated at a regular interval depending on how the values of KPIs change over time. Adaptive thresholding has many parameters through which users can customize its behavior, including time policies, algorithms and thresholds.

From Solution to Startup

Before Datadog was a widely adopted SaaS platform, it was a tool developed to solve our founders’ own monitoring needs. As technology-oriented people, we often build solutions for our own problems, then discover those problems are widespread. But how do you know when your solution should be something more? In this panel session, we’ll talk with tech startup founders to hear their stories and advice for turning tools into businesses.

Integrate Monitoring as Code into your Software Development Lifecycle

Learn how the new Checkly features (code exporter and activity log) enable you to integrate Monitoring as Code into your Software Development Lifecycle. Define and debug your monitoring resources during development, test your preview deployments and start monitoring productions with ease.

Unleash Microsoft Call Quality Dashboard Insights

Finding answers when someone has a Teams performance issue is clunky and time-consuming for IT teams. The Microsoft Call Quality Dashboard (CQD) has a wealth of data, but there’s so MUCH data that it can be hard to find the answers quickly to optimize Microsoft Teams performance.

Troubleshooting Cloud Application Performance: A Guide to Effective Cloud Monitoring

The scalability, flexibility, and cost-effectiveness of cloud-based applications are well known, but they’re not immune to performance issues. We’ve got some of the best practices for ensuring effective application performance in the cloud.

From On-call to Non-call: Resolving Incidents Before They Even Happen

Artificial intelligence has captured the attention of the world, with tools like ChatGPT and large language models (LLMs) driving the conversation. But you don’t need to wait for the future or new features powered by LLMs to start working smarter—the tech industry has been investing in intelligent, automated tools for years and they’re ready for production now. In this talk, you’ll learn how the engineering teams at Toyota Connected use tools like Datadog Watchdog, Anomaly Detection, and Workflows to make our lives easier and keep our platform stable.