Latest News

AIOps for SAP: From Ground to Cloud

Jul 29, 2025 By Avantra Team In Avantra

Anyone working in the SAP market in 2025 is aware of two big topics: migration to cloud-based ERP and the end of many long-used tools for managing SAP operations including Focused Run, Landscape Manager and Solution Manager. Both are impossible to ignore. Cloud-based ERP presents a new era of business software possibilities, and with it the opportunities and complexities of migration, transformation, and leveraging the elastic capacity and scalability of cloud-based designs. But right behind it, the question becomes "how are we going to run and manage this?"

Read Post

Avantra

Read more about AIOps for SAP: From Ground to Cloud

Disposable Code Is Here to Stay, but Durable Code Is What Runs the World

Jul 29, 2025 By Charity Majors In Honeycomb

Every day I seem to run into yet another post with someone solemnly opining that “writing code has never been the hardest part of software engineering. And hey, that’s smashing. As an engineer from the ops/infra/SRE side of the house, I feel like I’ve been saying this my whole career. (Is there anything more satisfying than being proven right in public? Not in my book.) So, which is it?

Read Post

Honeycomb

Read more about Disposable Code Is Here to Stay, but Durable Code Is What Runs the World

Why Your Loki Metrics Are Disappearing (And How to Fix It)

Jul 29, 2025 By Faiz Shaikh In Last9

Grafana Loki is up and running, log ingestion looks healthy, and dashboards are rendering without issues. But when you query logs from a few weeks ago, the data's missing. This is a recurring problem for many teams using Loki in production: while the system handles short-term log visibility well, it often lacks the retention guarantees developers expect for historical analysis and incident review.

Read Post

Last9

Read more about Why Your Loki Metrics Are Disappearing (And How to Fix It)

New in OTel: Auto-Instrument Your Apps with the OTel Injector

Jul 29, 2025 By Anjali Udasi In Last9

As distributed systems scale, maintaining manual instrumentation across services quickly becomes unsustainable. The OTel Injector addresses this by automatically attaching OpenTelemetry instrumentation to applications, no code changes needed. This blog covers how the OTel Injector works, how it integrates with Linux environments, and how to set it up for consistent telemetry across your stack.

Read Post

Last9

Read more about New in OTel: Auto-Instrument Your Apps with the OTel Injector

Hands-On with Continuous Observability

Jul 29, 2025 By Johan Kraft (PhD) In Percepio

Ask any embedded developer about their worst debugging experience, and chances are you’ll hear stories of unreproducible bugs, late-night watchdog resets, or CI test failures with no trace. Traditional tools often leave us blind at the exact moment we need insight.

Read Post

Percepio

Read more about Hands-On with Continuous Observability

Building an Effective Post-Mortem Culture: A Step-by-Step Guide

Jul 29, 2025 By Nuno Tomas In isDown

Post-mortems are the cornerstone of continuous improvement in incident management. When done right, they transform failures into learning opportunities and prevent future outages. Yet many teams struggle to build a culture where post-mortems are valued rather than feared.

Read Post

isDown

Read more about Building an Effective Post-Mortem Culture: A Step-by-Step Guide

From Alert to Answer in Seconds: Accelerating Incident Response in Dynatrace

Jul 29, 2025 By Mezmo In Mezmo

It is 12PM and you just start eating lunch when your phone starts buzzing. A storm of different monitoring and system-level alerts start stacking up on your phone and slack. The incident response "war room" opens and downtime communications are being drafted to customers. Your team is under pressure to find the root cause, but you are immediately hit with roadblocks.

Read Post

Mezmo

Read more about From Alert to Answer in Seconds: Accelerating Incident Response in Dynatrace

Incident IQ integration is here!

Jul 29, 2025 By Colin Bartlett In StatusGator

We’re excited to launch one of our most highly requested integrations: StatusGator now connects directly with Incident IQ. This powerful new integration bridges the gap between real-time service monitoring and your internal support workflow. Now, whenever someone reports an outage on your public StatusGator page, a ticket is automatically created in Incident IQ—ensuring your IT team can respond quickly and efficiently.

Read Post

StatusGator

Read more about Incident IQ integration is here!

Evals are just tests, so why aren't engineers writing them?

Jul 29, 2025 By Eli Hooten In Sentry

You’ve shipped an AI feature. Prompts are tuned, models wired up, everything looks solid in local testing. But in production, things fall apart—responses are inconsistent, quality drops, weird edge cases appear out of nowhere. You set up evals to improve quality and consistency. You use Langfuse, Braintrust, Promptfoo—whatever fits. You start running your evals, tracking regressions, fixing issues, and confidence goes up as a result. Things improve.

Read Post

Sentry

Read more about Evals are just tests, so why aren't engineers writing them?

Multi Factor Authentication for Synthetic Monitoring for AVD

Jul 29, 2025 By SatheeshKumar S In eG Innovations

Today, I’ll cover some of the basics of monitoring Multi-Factor Authentication and why ensuring MFA is implemented is essential, particularly in environments where remote access is possible. I’ll cover some recent, specific case studies where a lack of MFA has led to security breaches and the mechanisms the bad actors used.

Read Post

eG Innovations

Read more about Multi Factor Authentication for Synthetic Monitoring for AVD

Operations | Monitoring | ITSM | DevOps | Cloud

AIOps for SAP: From Ground to Cloud

Disposable Code Is Here to Stay, but Durable Code Is What Runs the World

Why Your Loki Metrics Are Disappearing (And How to Fix It)

New in OTel: Auto-Instrument Your Apps with the OTel Injector

Hands-On with Continuous Observability

Building an Effective Post-Mortem Culture: A Step-by-Step Guide

From Alert to Answer in Seconds: Accelerating Incident Response in Dynatrace

Incident IQ integration is here!

Evals are just tests, so why aren't engineers writing them?

Multi Factor Authentication for Synthetic Monitoring for AVD

Monthly Archive

Follow Us