Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

New Feature Friday: Understand & Improve Your DORA Performance with Cortex

This week on New Feature Friday, we’re highlighting two new releases that make it easier than ever to understand and improve your DORA performance: DORA Academy Course A guided learning experience that shows you how to use DORA Metrics and Cortex together to drive better engineering outcomes—without the data chaos. DORA Operational Readiness Scorecard An out-of-the-box template that benchmarks each service against DORA standards, giving teams an instant snapshot of where they stand and where to focus.

Enhanced Environment Compliance with Environment Policies

We’re excited to announce an important enhancement to Kosli that will improve how environment compliance is managed across your organization. Starting with our next release, all compliance evaluation for Kosli environments will be consolidated through our powerful Environment Policies feature.

Searching Certificate Transparency Logs (Part 3)

Clickhouse is an incredible database. Here at Certkit, we’ve long worked in the world of “No SQL” databases like Elasticsearch precisely for their ability to query large amounts of data. But for every database, there’s an amount of data that’s “Too big”. Too big to query quickly or too big to store affordably. Clickhouse manages to thread the needle by efficiently storing truly ridiculous amounts of data while still providing impressive query performance.

Cloud Efficiency Masterclass: 6 Data-Driven Ways To Reduce Costs And Scale

Discover the basics of cloud efficiency as well as six advanced data-driven strategies you can use to make your cloud environment more efficient. With incredibly complex cloud architecture — that may even include Kubernetes and multi-tenant infrastructure — organizations are finding it hard to measure and monitor the performance and cost of their cloud environments.

4 Golden Signals of System Reliability: A Practical Guide for Your Team

Modern systems produce endless streams of metrics. CPU usage, request volume, cache hit rates, node counts, queue depth, the list keeps growing. With this much data, it’s easy for teams to get lost in dashboards without knowing what actually matters. That’s why DevOps and SRE teams rely on the 4 Golden Signals of System Reliability. They provide the simplest and clearest way to understand user experience and system health.

Incident Management vs Change Management: Key Differences Explained

The Incident Management vs. Change Management are two such moments that highlight a core difference teams face every day. One is a reaction to failure. The other is a planned improvement. That’s the heart of incident management vs. change management. Both keep systems reliable, and both help teams move faster without breaking things. Let’s explore how they differ and how they work together.