Latest News

Bob Lee - Lead DevOps Engineer at Twingate

Mar 15, 2024 By Shubham Srivastava In Zenduty

I was out there in sunny Austin this February, speaking at Civo Navigate 2024. The event was jam packed with amazing talks, and it was great meeting so many people with long and fascinating careers in engineering and Site Reliability. I had the privilege of meeting Bob Lee, who currently leads DevOps at Twingate — a cloud-based service that provides secured remote access, and poised to replace VPNs.

Read Post

Zenduty

Read more about Bob Lee - Lead DevOps Engineer at Twingate

Design Details: On-call

Mar 15, 2024 By Tom Petty In Incident.io

On your bedside table sits a piece of software designed to wake you up. It loves bothering you when something goes wrong — and making it your responsibility to sort it out Meet the new incident.io On-call app. We designed it this way: to be as interruptive as possible. Whether you’re watching telly, at the gym, or as mentioned, fast asleep, it’ll get you. Got called even though you’re in silent mode? Great! We’ve done our job properly.

Read Post

Incident.io

Read more about Design Details: On-call

Strategies for Scaling Systems Reliably by Bob Lee

Mar 15, 2024 By Shubham Srivastava In Zenduty

I was out there in sunny Austin this February, speaking at Civo Navigate 2024. The event was jam packed with amazing talks, and it was great meeting so many people with long and fascinating careers in engineering and Site Reliability. I had the privilege of meeting Bob Lee, who currently leads DevOps at Twingate — a cloud-based service that provides secured remote access, and poised to replace VPNs.

Read Post

Zenduty

Read more about Strategies for Scaling Systems Reliably by Bob Lee

ROI Demystified: A Deep Dive into What ROI Truly Means for Your Business

Mar 14, 2024 By Vishal Padghan In Squadcast

The term ROI (Return on Investment) often gets thrown around without a thorough understanding of its implications. Many see it merely as a financial metric, but in reality, ROI encompasses much more than monetary gains. In this comprehensive exploration, we delve into the true essence of ROI, its multifaceted nature, and how it impacts every aspect of your business strategy.

Read Post

Squadcast

Read more about ROI Demystified: A Deep Dive into What ROI Truly Means for Your Business

The Role of the SRE in the Incident Management Process

Mar 14, 2024 By Lee Atchison In Blameless

In the world of modern businesses, where IT systems play a major role in all types of businesses, the role of the Site Reliability Engineer (SRE) has become central to managing the effectiveness and reliability of the entire business. SREs are the bridge between the rapid deployment of software and systems and the stable operation of those systems in a production environment. They ensure that reliability and performance criteria are defined and are met.

Read Post

Blameless

Read more about The Role of the SRE in the Incident Management Process

The engineering on-call experience: misconceptions, lessons learned, and how to prepare

Mar 14, 2024 By Grafana Labs Team In Grafana

The on-call experience is sometimes a dreaded one for software engineers. Those late-night alerts and frantic Slack messages, after all, don’t exactly sound pleasant. But what’s an on-call shift really like? Is that perception of constant fire-fighting and 3 AM wake-up calls actually realistic? Michael Mandrus and Owen Smallwood, both senior software engineers here at Grafana Labs, wanted to set the record straight.

Read Post

Grafana

Read more about The engineering on-call experience: misconceptions, lessons learned, and how to prepare

From Deploy to Commit: Building the Ultimate Development Pipeline - A Comprehensive Guide

Mar 13, 2024 By Chitra Bisht In Squadcast

‘Manual deployment is (should be) a sin.’ Well, calling manual deployment a sin may sound strong, but consider this: building the ultimate development pipeline demands a focus on automation. Although the selection of a deployment method depends on the specific needs and requirements of a project or environment, can you really deny the power of automated deployment? There's a better way.

Read Post

Squadcast

Read more about From Deploy to Commit: Building the Ultimate Development Pipeline - A Comprehensive Guide

How AIOps improves IT service assurance and optimization

Mar 13, 2024 By BigPanda In BigPanda

ITOps and DevOps teams face many challenges. Their responsibilities are extensive, from navigating complex IT environments at scale to quickly addressing performance issues and minimizing downtime and outages. Enhancing your organization’s IT service assurance requires you to ensure the reliability, performance, and availability of IT services.

Read Post

BigPanda

Read more about How AIOps improves IT service assurance and optimization

How to deal with alert fatigue head-on

Mar 13, 2024 By incident.io In Incident.io

Everyone experiences stress at work—thankfully, it’s a topic folks aren’t shying away from anymore. But for on-call engineers, alert fatigue is a phenomenon closer to home. Unfortunately, like stress, it can be just as insidious and drastically impact those it affects. First discussed in the context of hospital settings, this phrase later entered engineering circles.

Read Post

Incident.io

Read more about How to deal with alert fatigue head-on

How Squadcast's Snooze Incidents Promotes Focussed On Call Shifts

Mar 12, 2024 By Chitra Bisht In Squadcast

Dealing with a flood of incidents, each with varying degrees of urgency, can be a daily struggle for Incident Response teams. Suppose a low-priority alert pings while you're tackling a critical incident. This pulls your focus away from the urgent issue. This constant alert bombardment can: How do engineers ensure that high-severity issues take precedence? Don't they want to avoid being bothered or bombarded with notifications while addressing critical matters? They sure do.

Read Post

Squadcast

Read more about How Squadcast's Snooze Incidents Promotes Focussed On Call Shifts

Operations | Monitoring | ITSM | DevOps | Cloud

Latest News

Bob Lee - Lead DevOps Engineer at Twingate

Design Details: On-call

Strategies for Scaling Systems Reliably by Bob Lee

ROI Demystified: A Deep Dive into What ROI Truly Means for Your Business

The Role of the SRE in the Incident Management Process

The engineering on-call experience: misconceptions, lessons learned, and how to prepare

From Deploy to Commit: Building the Ultimate Development Pipeline - A Comprehensive Guide

How AIOps improves IT service assurance and optimization

How to deal with alert fatigue head-on

How Squadcast's Snooze Incidents Promotes Focussed On Call Shifts

Monthly Archive

Follow Us