Operations | Monitoring | ITSM | DevOps | Cloud

Getting Started with Home Assistant Webhooks & Writing to InfluxDB

If you’re already running or are familiar with Home Assistant, you’ve likely worked with integrations, maybe a few automations, and possibly MQTT as a way to wire devices together. But webhooks add another layer of flexibility that lets you level up your smart home into a fully-customized, intelligent network. Instead of relying on built-in integrations and being confined to the same local network, you can let external devices and services push events directly into Home Assistant.

AWS outage takes down more than 150 cloud services

On May 7th and 8th, 2026, Amazon Web Services (AWS) experienced an outage affecting Amazon Elastic Compute Cloud (EC2) in the dreaded US East 1 region. The original region of AWS located in Northern Virginia, us-east-1 or just “US East” as it is known, has been the subject of some of the internet’s most high profile and destructive outages and remains Amazon’s least reliable region.

Collective IQ Business: meet the artificial intelligence that transforms IT management

The employee digital experience (DEX) is no longer just a concept; it has become a concrete discipline supported by specialized tools. At the center of this transformation is Collective IQ, Almaden’s DEX solution, available in the Essential and Business editions. The Business edition includes AlmaAI a family of generative AI capabilities that take IT management to a new level.

The Journey to Production AI: Five Steps for SRE and Platform Teams

In a recent webinar, The Journey to Production AI, Andre Elizondo walked through what separates a working agent demo from an agent worth trusting on a 2 a.m. page. Live polls during the session put numbers behind a pattern most platform teams already feel. ‍ ‍ Most teams are early. The ones who are further along did not get there by shipping a flashier demo. They got there by treating production AI as a platform problem.

How Modern Ops Lost Their Bearings

Modern operations carry a quiet contradiction. Organizations have never had more data, more dashboards, or more instrumentation, yet teams increasingly struggle to gain a reliable sense of what the environment is actually doing. The problem is not the absence of information. It is the absence of bearings. This drift did not happen suddenly. It accumulated across years of transformation.

A Runnable Reference Architecture for Battery Energy Storage Systems on InfluxDB 3

A battery is a complex electrochemical system where safety and revenue are decided in milliseconds. Cell temperatures, voltages, and state of charge change in real-time; dispatch decisions and thermal alarms must fire in real-time. Anything in between—your data pipeline, your historian, your alerting layer—has to disappear into the background.

Diagnose and resolve database performance issues faster with Database Investigator

When your database performance degrades, diagnosing the root cause is rarely quick or straightforward. Your existing tools might surface metrics like CPU utilization, wait events, and query duration, but then leave you to correlate the data and identify what went wrong. Worse, what first appears to be the root cause can often just be a downstream effect of multiple interrelated issues.

How Much Is A Petabyte? Local vs Cloud Storage Costs

Today, roughly 94% of enterprises are projected to use some form of cloud service, with 82% adopting hybrid models that combine cloud and on-premises infrastructure. By 2030, around 60% of all corporate data is expected to be stored in the cloud, highlighting a major shift away from traditional storage systems toward distributed, on-demand infrastructure. With such a huge amount of data, companies need a scalable, affordable, and secure method to store 1 PB.

The zero-trust agent: why your AI needs a sandbox, not a blank check

Key takeaway: Granting AI agents unrestricted access to cloud infrastructure is an unacceptable security risk. Upsun provides a "zero-trust" framework by utilizing isolated, production-perfect preview environments that allow AI to be productive without the risk of a hallucinated production outage.

Zero-Code OpenTelemetry for Vert.x

Drop a JAR on the JVM. Get distributed tracing, RxJava context propagation, log-trace correlation, and Vert.x internal metrics. No code changes. No Maven dependency. Java 8–21. Inside the design of last9/vertx-opentelemetry v2.3.4. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.