%term

The latest News and Information on Service Reliability Engineering and related technologies.

Lightrun Launches Industry's First AI SRE With Live Dynamic Runtime Context

Feb 25, 2026 By Lightrun In Lightrun

Autonomously Remediates Software Issues, Generates Missing Runtime Evidence on Demand, and Validates Hypotheses Against Live Execution from Code to Production.

Read Post

Lightrun

Read more about Lightrun Launches Industry's First AI SRE With Live Dynamic Runtime Context

Best Incident Management Software for Engineering Teams (2026)

Feb 23, 2026 By Sahil Khan In Last9

Compare 9 incident management tools: PagerDuty, Opsgenie, Incident.io, Rootly, FireHydrant, BetterStack, Grafana OnCall, Squadcast, and Last9. Features, pricing, and which fits your team. Product Marketing Manager.

Read Post

Last9

Read more about Best Incident Management Software for Engineering Teams (2026)

AI SRE in Practice: Accelerating Engineer Onboarding with Contextual Expertise

Feb 22, 2026 By Itiel Shwartz In Komodor

Onboarding new engineers to complex Kubernetes environments is expensive. Junior engineers need to learn cluster architecture, understand organizational conventions, navigate internal documentation, and build relationships with senior team members who can answer questions. The process takes weeks or months, and during that time, senior engineers spend significant time mentoring instead of working on complex problems.

Read Post

Komodor

Read more about AI SRE in Practice: Accelerating Engineer Onboarding with Contextual Expertise

Database Partitioning: Types, Strategies, and When to Use Each

Feb 22, 2026 By Prathamesh Sonpatki In Last9

How database partitioning works in PostgreSQL and MySQL. Range, list, and hash partitioning with SQL examples and guidance on when to partition vs shard. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Read Post

Last9

Read more about Database Partitioning: Types, Strategies, and When to Use Each

Database Sharding: How It Works and When You Actually Need It

Feb 21, 2026 By Prathamesh Sonpatki In Last9

How database sharding works, common strategies (hash, range, directory), shard key selection, and the operational cost of running a sharded database in production. Prathamesh works as an evangelist at Last9, runs SRE stories - where SRE and DevOps folks share their stories, and maintains o11y.wiki - a glossary of all terms related to observability.

Read Post

Last9

Read more about Database Sharding: How It Works and When You Actually Need It

Database Performance Tuning: A Practical Guide for Production Systems

Feb 20, 2026 By Preeti Dewani In Last9

Tune PostgreSQL and MySQL for production with connection pooling, memory configuration, write path optimization, vacuum management, and lock contention fixes. Technical Product Manager at Last9.

Read Post

Last9

Read more about Database Performance Tuning: A Practical Guide for Production Systems

Traces Are Not Your Business Logic

Feb 19, 2026 By Mukta Aphale In Last9

Distributed traces track how your system processed a single request — not what your customers did over time. Confusing the two leads to poorly instrumented systems.

Read Post

Last9

Read more about Traces Are Not Your Business Logic

SQL Query Optimization: Techniques That Actually Improve Performance

Feb 19, 2026 By Sahil Khan In Last9

Find and fix slow SQL queries using execution plans, missing index detection, N+1 pattern fixes, and pagination strategies for PostgreSQL and MySQL. Product Marketing Manager.

Read Post

Last9

Read more about SQL Query Optimization: Techniques That Actually Improve Performance

Database Indexing: How It Works, Types, and When to Use It

Feb 18, 2026 By Faiz Shaikh In Last9

How database indexes work, when to use B-tree vs hash indexes, clustered vs non-clustered indexes, and how to tell if your indexes are actually helping.

Read Post

Last9

Read more about Database Indexing: How It Works, Types, and When to Use It

Code Is Cheap, Reliability Isn't: Owning Production in the AI era w/ Swizec Teller

Feb 16, 2026 By Rootly In Rootly

In this episode, Swizec Teller, author of the bestselling Scaling Fast, makes a bold claim: code is cheap, reliability is not. As AI coding tools accelerate feature development, the real competitive advantage shifts to operating systems reliably in production. We explore the hidden complexity of SRE work, the addictive nature of agentic coding, and why ownership — not automation — remains at the core of modern software engineering.

View Video