Operations | Monitoring | ITSM | DevOps | Cloud

Agentic AI Is Here-Are You Keeping Up?

Artificial intelligence (AI) has arrived in the workplace, powering everything from the personalization of tailored experiences, to automation, to predictive analytics, all for the purpose of better decision making. No longer a buzzword tossed around in boardroom brainstorming or futuristic planning sessions, AI is a present-day reality reshaping how businesses operate. Generative AI kicked off the revolution, and its rapid adoption is changing how humans create and work.

Don't Let Agentic AI Become the Next Windows Paperclip

Microsoft’s recent trials of Co-Pilot Vision are paving the way for Agentic AI, a proactive and context-aware assistant that can enhance productivity by intelligently responding to user needs. By having visibility into what you’re working on, such AI can anticipate tasks, offer relevant suggestions, and reduce the friction of daily workflows. However, history has shown us that AI assistance, if not executed correctly, can become more of a nuisance than an asset.

Getting started with InfluxDB dashboards

InfluxDB is a powerful open-source time-series database widely used for monitoring system performance, IoT metrics, and application telemetry. With SquaredUp's InfluxDB plugin, you can effortlessly visualize and monitor your InfluxDB data, gaining real-time insights into your metrics alongside your other tools and services. This guide will walk you through connecting InfluxDB with SquaredUp, creating dashboards, setting up monitoring, and sharing your visualizations.

This Month in Datadog - March 2025

On the March episode of This Month in Datadog, Jeremy Garcia (VP of Technical Community and Open Source) covers Attacker Clustering, Auto Test Retries, and new Observability Pipelines features, including keyword dictionaries and several integrations. Later in the episode, Jinwu Liu (Product Manager) spotlights Reference Tables, which is now generally available, and Yash Kumar (Product Lead, Cloud SIEM) shows how these tables can be used to add context to detection rules in Cloud SIEM.

Agentic AIOps use cases: How AIOps protects your revenue and reduces risk

Real problems need real solutions. We’ve all heard the same lofty claims about AI in IT operations: “Reduce alert noise” and “Detect anomalies.” While these sound great on paper, they often fall flat when critical systems fail during peak buying seasons or a major security threat goes undetected.

Benchmarking Kotlin Coroutines performance with CircleCI

A benchmark can be interpreted as a standard of comparison used to assess something. In everyday life, for example, when we want to buy a new cellphone and want to know which one is faster, we can see the speed test (benchmark) by measuring how fast the cellphone opens applications or runs games. From there, we can compare which cellphone is better based on the numbers produced.

Ensuring your AI systems can scale to meet demand

The amount of traffic handled by AI systems can’t be overstated. Over half of all organizations in India, the UAE, Singapore, and China use AI, and traffic from generative AI sources jumped by 1,200% since July 2024. While demand for AI-powered workloads is steadily increasing overall, traffic to individual AI providers is much more unpredictable. User demand spikes and wanes unexpectedly, but like any service, users expect you to always be available and responsive.

When Readiness Really Matters: How Seasonal Spikes Become the Catalyst for Long-Term Discipline

Every engineering leader knows the stress of an upcoming seasonal spike. Whether it’s tax season, open enrollment, or Black Friday, there’s always that moment where someone says, “Are we actually ready?” It’s usually followed by a scramble: auditing services, chasing down owners, updating spreadsheets, running perf tests, checking alerting thresholds, verifying infra configs—much of it manual, fragmented, and slightly different every time. It’s exhausting.

PagerDuty Pricing Breakdown 2025 (And How To Save 85%)

This in-depth analysis examines PagerDuty’s pricing structure for 2025, going far beyond the advertised rates to uncover the true total cost of ownership. We break down the additional fees, essential add-ons, implementation timelines, and ongoing maintenance costs that most organizations discover only after committing.

OpsGenie Shutdown: What You Need to Know and Your Next Steps

Atlassian recently dropped a bombshell: OpsGenie is shutting down. If you’re an OpsGenie user, this news probably hit hard. After investing time setting up your alerts, configuring oncall schedules, and training your team on OpsGenie, you’re now faced with finding and migrating to a new incident management solution. We understand the frustration and uncertainty you’re feeling right now. The reactions on Hacker News show you’re not alone in this challenge: Take a deep breath.