Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

The Outage Anxiety Test: Can You Answer These 3 Questions In Under 10 Minutes?

On Oct. 20, the Internet woke up and seemingly chose violence. For more than 12 hours, Amazon Web Services (AWS) went down. From banking platforms to hospital communications to mobile ordering apps, digital services came to a screeching halt. The cause? Two programs are trying to write a DNS entry simultaneously, failing, and leaving the entry blank. Thus began the incredibly costly failure cascade.

AI And Sustainability: Measuring The Impact Of The Generative AI Boom

Before 2022, Alex Hanna worked on Google’s Ethical AI team. Today, she’s the director of research at the Distributed AI Research Institute, a transition sparked by Google’s handling of a paper exposing AI’s growing environmental footprint. So, how bad is it, really? That depends on who you ask. Take Jesse Dodge, a senior research analyst at the Allen Institute for AI. Jesse told NPR that a single ChatGPT query can use as much electricity as keeping a light bulb on for 20 minutes.

AWS & Splunk: Accelerating Innovation Through Partnership

Discover how AWS and Splunk are pushing the boundaries of innovation to empower your security, observability, and cloud transformation journey. This video highlights our joint commitment to driving digital resilience through unified visibility, faster threat detection, and seamless integration across AWS services.

Are You Missing the Easiest Azure Discount in Your Stack?

If you’re using Microsoft Defender for Cloud, you’re probably overpaying. There’s a commitment-based pricing model that can save you up to 22% annually. But Azure won’t recommend it, and third-party tools ignore it. This blog breaks down how Defender Commit Units (DCUs) work, why they’re a blind spot, and what you need to do about it.

40 Best Cloud Network Monitoring Tools of 2026 for All Platforms and Giants like AWS, Google, Azure, IBM, and Oracle

Cloud network monitoring software is a type of software designed to monitor and manage the performance, availability, and security of networks and network devices in cloud environments. These tools use various techniques to gather information about network traffic, bandwidth utilization, application performance, and other metrics related to network health and availability.

Enterprise data centre security solutions: scaling securely for growth and resilience

Securing a data centre requires multiple layers of protection. Physical access controls, surveillance, and network safeguards reinforce one another to prevent disruption. As estates expand and workloads increase, those measures have to scale. If they don’t, gaps appear in both resilience and compliance. A data centre security solution must therefore protect infrastructure day to day while adapting to future requirements. Pulsant delivers this through an integrated framework.

Webinar Recap: 3 Cost Allocation Mistakes FinOps Teams Can Avoid

In a webinar hosted by CloudZero on Oct. 30, 2025, Larry Advey, Director of Cloud Platform and FinOps and a respected voice in the FinOps community, joined Umesh Rao to deliver a practical session on cloud cost allocation. The session, titled Three Allocation Mistakes Most FinOps Teams Make, unpacked hard-earned lessons and offered a guided tour of CloudZero’s new Dimension Studio.

AWS Fargate Alternatives: Comparing Serverless Container Options

Imagine you have an API service composed of multiple microservices. Traffic fluctuates — sometimes light, sometimes spiking. Without Fargate, you’d have to manage EC2 instances, autoscaling, patching, and more. With Fargate, you define each microservice as a task, setting the CPU/memory, container image, network rules, and AWS schedules, and then run them as needed. The result: faster deployment, lower ops overhead, and smooth scaling.

Store and search logs at petabyte scale in your own infrastructure with Datadog CloudPrem

As AI workloads and cloud-native applications expand, organizations are generating more log data than ever. Each service, container, and model inference produces continuous telemetry that must be stored, secured, and analyzed. As telemetry grows more complex, teams must balance full visibility with new retention and residency needs.