Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Our customers aren't just numbers-they're a priority

At incident.io, “We care about our customers” isn’t just a talking point. It’s a core part of how we operate. Whether it’s a big feature request or a small bug fix, we’ve been intentional about making sure that customers always feel heard and seen—no matter the ask. But it’s not just that.

How to test your systems for scalability and redundancy with Fault Injection

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Do you know if your services can tolerate losing a node? What about an entire availability zone? Or a region?‍ Large-scale outages aren’t unheard of. When you’re running critical services, it’s vital that those services can keep running even if an AZ or region fails. In addition to failing over, these services also need to scale quickly so traffic shifts don’t overwhelm your systems. How do you prove that a service is both scalable and redundant? The answer is with Fault Injection.

The role of secure data storage in fueling AI innovation

Artificial intelligence is the most exciting technology revolution of recent years. Nvidia, Intel, AMD and others continue to produce faster and faster GPU’s enabling larger models, and higher throughput in decision making processes. Outside of the immediate AI-hype, one area still remains somewhat overlooked: AI needs data (find out more here).

Decentralized Monitoring Explained

Users often find themselves puzzled by the concepts of decentralized or distributed monitoring. This confusion is likely due to many monitoring systems claiming distributed capabilities, making it challenging to discern how Netdata stands out. To grasp the distinction, we must delve into the evolution of monitoring systems. When the first monitoring systems were created, about 20-25 years ago, they were built as SNMP collectors.

Managing Multiple Kubernetes Clusters: What, Why and How

Kubernetes is a powerful tool for deploying and managing containerized applications. However, managing its clusters is a critical but challenging task for many organizations. Today, we will discuss the benefits of managing multiple clusters, and the challenges they present, and offer a quick technical guide on managing these clusters efficiently.

Colo Rental Rates are Rising: Are You Keeping Track of Your Power Utilization?

It’s getting more expensive to rent space in colocation data centers. According to CBRE, colocation rates are up 18.6% year-over-year to a record $163.44 per kW/month due to limited supply and strong demand. Average Asking Rental Rate with Y-o-Y % Change for Primary Markets *Rental rates are quoted asking rates for 250-500 kW at N+1/Tier III requirements. Image Source: CBRE Research, CBRE Data Center Solutions, H2 2023.

Building an Internal Developer Platform for 20k Engineers on a Single Tenant with Will Stewart

Explore the world of Internal Developer Platforms (IDPs) with Will Stewart, co-founder of @northflank9144. Gain insights into developer experience, security, and scalability, drawing from Will's extensive expertise. Learn how IDPs empower engineering teams for enhanced productivity and innovation in modern software development environments. Dive into this comprehensive overview to unlock the potential of IDPs and optimize your organization's workflow.

Facing the Future of SBOMs: Are You Ready to Overcome Their Biggest Challenges?

In this session at Navigate North America 24, Cortez Frazier Jr. from Fossa delves into the critical world of Software Bill of Materials (SBOMs). As regulatory demands increase and the call for software component transparency becomes louder, mastering SBOMs is essential. Cortez unpacks the complexities of creating, managing, and distributing SBOMs, offering actionable solutions to streamline the process.

Azure Mangement Platform - Turbo360

Turbo360 (Formerly Serverless360) is an advanced Cloud Management platform that empowers you with significant Azure Cost savings and Infra Monitoring for complex Azure Environments. This tool has helped customers experience annual savings of up to 30% through advanced cost monitoring, granular analysis, optimization insights, and reduced incident resolution time by 80% through holistic infra monitoring across multiple Azure resources with business context.