Operations | Monitoring | ITSM | DevOps | Cloud

It's Never Different This Time: LLM Reliability Without the Hype with Julien Simon

In this episode, Julien Simon, longtime voice in the open-source ML world, reminds us that even in the era of GenAI, reliability fundamentals haven’t changed. Julien breaks down why calling “the same model” from different providers can produce wildly different results, how deployment choices introduce hidden variability, and why reliability teams need to think of LLM systems as distributed systems.

ML inference in PHP by example: leverage ONNX and Transformers on Symfony

This blog is based on a presentation by Guillaume Moigneu at the Symfony 2024 conference. Machine learning and AI are no longer limited to Python and Node.js. PHP developers can now run AI models directly in their applications using modern tools and libraries. This guide shows you how to implement machine learning inference in PHP using ONNX and Transformers.

Node.js Performance Monitoring Guide

Node.js applications power millions of APIs, microservices, and real-time systems. But without proper monitoring, performance issues, memory leaks, and errors can go undetected until they impact users. This guide explains how to monitor Node.js applications in production, what metrics to track, and which tools deliver the best results.

Agents of IT podcast - Ep. 6 - What's real agentic AI and what's just hype?

Sean Heuer and Ari Stowe break down “agent washing,” governance, and what it really means for AI to take action instead of just chatting. In this clip from Agents of IT, they share practical ways to spot the difference between chatbots, scripted automations, and true agentic systems that can plan, reason, and execute autonomously. Watch the full episode to hear their perspective on.

Explore Cloud Instance Pricing and Performance with Datadog Instance Explorer

Meet Datadog Instance Explorer — a way to explore, compare, and monitor cloud instance pricing and performance across AWS, Azure, and Google Cloud in one place. In this quick overview, you’ll learn how to: Start exploring your instance options today and make smarter, data-driven infrastructure decisions.

Build a multi-agent AI system using CrewAI, Gemini, and CircleCI

Multi-agent AI systems are trending in the software development industry right now. These systems consist of a group of individual agents that collaborate to achieve a desired goal. They mimic real world teams and departments in how they are organized. In multi-agent AI systems, each agent is assigned a task that is required to achieve a final output.

Grafana 12.3 release: Interactive learning experiences, new and improved logs visualizations, and more

Grafana 12.3 is here, delivering new features for interactive learning, deeper insights into logging data, and so much more. Overall, a big theme in the latest minor release is to make data exploration easier, faster, and more customizable. Grafana 12.3: Download now! Below are just some of the highlights from Grafana 12.3. If you want to explore all the latest updates, please refer to the changelog or our What’s New documentation, and be sure to check out the TL;DR video below.

Grafana Data Visualization Update: Panel Time Settings & Time Comparison in 12.3

The new panel time settings drawer gives you greater control over time ranges and shifts at the panel level without editing the dashboard. The time comparison feature, in particular, was a request from the community, and allows you to easily perform time-based (for example, month-over-month) comparative analyses in a single view. This eliminates the need to duplicate panels or dashboards to perform trend tracking and performance benchmarking.