%term

AI Observability is Coming...

Apr 15, 2026 By Grafana In Grafana

Thanks for watching!

View Video

Grafana

Read more about AI Observability is Coming...

Site Reliability Engineering (SRE) 101: Everything You Need to Know | Harness Blog

Apr 15, 2026 By Eric Minick In Harness

A single second of latency can cost e-commerce sites millions in revenue, while just minutes of downtime trigger customer churn that takes months to recover. Modern users expect instant responses and seamless experiences, making reliability a competitive feature that directly impacts business outcomes. Site Reliability Engineering treats operations as a software problem rather than a manual discipline. SRE applies engineering principles to achieve measurable reliability through automation.

Read Post

Harness

Read more about Site Reliability Engineering (SRE) 101: Everything You Need to Know | Harness Blog

What's New in InfluxDB 3 Explorer 1.7: Table Management, Data Import, Transforms, and More

Apr 15, 2026 By Daniel Campbell In InfluxData

InfluxDB 3 Explorer 1.7 is a step forward for anyone who wants to manage their time series data without constantly switching between the UI and a terminal. This release adds table-level schema management, the ability to import data from other InfluxDB instances, and a new Transform Data section to reshape your data, all within the Explorer UI.

Read Post

InfluxData

Read more about What's New in InfluxDB 3 Explorer 1.7: Table Management, Data Import, Transforms, and More

Ephemeral Leaks and Automated BGP Route Leak Detection

Apr 15, 2026 By Doug Madory In Kentik

Many BGP route leaks reported by automated detection systems are actually brief, low-impact artifacts of normal BGP convergence. Doug Madory examines examples from Cloudflare Radar, Routeviews, and Jared Mauch’s long-running leak detector to show how these “ephemeral leaks” arise, why they usually don’t disrupt traffic, and why they still matter for routing security.

Read Post

Kentik

Read more about Ephemeral Leaks and Automated BGP Route Leak Detection

AI vs. Hype: Redefining Engineering Excellence with Ron Miller

Apr 15, 2026 By Harness In Harness

In this episode of "ShipTalk: Engineering Excellence," host Thomas Dockstader sits down with Ron Miller, editor at Fast Forward, to discuss the real-world impact of AI on software development. They dive deep into the maturity of AI-driven code, the rise of the "citizen developer," and why traditional writing and communication skills are becoming the new must-have for modern engineers.

View Video

Harness

AI
DevOps

Read more about AI vs. Hype: Redefining Engineering Excellence with Ron Miller

N+1 Detection in AppSignal's OpenTelemetry Trace Timeline

Apr 15, 2026 By Karen Patteri de Souza In AppSignal

N+1 query problems are one of the most common, and quietly damaging, performance issues in production applications. One extra query per record feels harmless in development. At scale, it becomes the reason your response times degrade and your database buckles under load. Today, AppSignal adds N+1 detection to its OpenTelemetry support. When we identify the pattern in a trace, we collapse the repetitive spans directly in the timeline, making the problem immediately visible in the trace itself.

Read Post

AppSignal

Read more about N+1 Detection in AppSignal's OpenTelemetry Trace Timeline

Beyond the pull request: why code review is not infrastructure validation

Apr 15, 2026 By Upsun In Upsun

Code review and infrastructure validation are distinct problems. While AI can review syntax, only an active, data-complete environment can validate system-wide state. Upsun provides the unified configuration file needed to turn "looks good to me" into verified production-readiness.

Read Post

Upsun

Read more about Beyond the pull request: why code review is not infrastructure validation

Beyond the Prompt: AI Agent Design Patterns and the New Governance Gap

Apr 15, 2026 By Alister Baroi In Tigera

If you are treating Large Language Models (LLMs) like simple question-and-answer machines, you are leaving their most transformative potential on the table. The industry has officially shifted from zero-shot prompting to structured AI agent design patterns and agentic workflows where AI iteratively reasons, uses external tools, and collaborates to solve complex engineering problems.

Read Post

Tigera

Read more about Beyond the Prompt: AI Agent Design Patterns and the New Governance Gap

Grave improvements: Native crash postmortems via Android tombstones

Apr 15, 2026 By Mischan Toosarani-Hausberger In Sentry

Native crashes on Android have always been harder to debug than they should be. The platform has its own crash reporter (debuggerd) that captures the crashing thread, every other running thread, register state, and memory maps into a file called a tombstone. Tombstones have been a part of Android for a long time; in fact, they’ve been there in one form or another since Android's first commit.

Read Post

Sentry

Read more about Grave improvements: Native crash postmortems via Android tombstones

What Is an AI SRE? And Why Do They Need Live Runtime Evidence?

Apr 15, 2026 By Lightrun Team In Lightrun

AI SREs are autonomous systems that handle incident triage, root cause analysis, and remediation by correlating logs, metrics, traces, and code signals. However, as they rely on pre-configured telemetry, the critical execution details of a specific failure, such as variable state and code paths, can often be missed. As a result, they either force users into manual redeploy loops or make inferences from partial data, diagnosing issues using probability rather than proof.

Read Post

Lightrun

Read more about What Is an AI SRE? And Why Do They Need Live Runtime Evidence?

Operations | Monitoring | ITSM | DevOps | Cloud

AI Observability is Coming...

Site Reliability Engineering (SRE) 101: Everything You Need to Know | Harness Blog

What's New in InfluxDB 3 Explorer 1.7: Table Management, Data Import, Transforms, and More

Ephemeral Leaks and Automated BGP Route Leak Detection

AI vs. Hype: Redefining Engineering Excellence with Ron Miller

N+1 Detection in AppSignal's OpenTelemetry Trace Timeline

Beyond the pull request: why code review is not infrastructure validation

Beyond the Prompt: AI Agent Design Patterns and the New Governance Gap

Grave improvements: Native crash postmortems via Android tombstones

What Is an AI SRE? And Why Do They Need Live Runtime Evidence?

Monthly Archive

Follow Us