%term

The latest News and Information on Incident Management, On-Call, Incident Response and related technologies.

DevOps/SRE Model: Bursting the Developer's Bubble

Oct 7, 2020 By BigPanda In BigPanda

Welcome to The CTO Perspective – discussions on the most current issues in IT Operations. In this talk: Many organizations are transitioning toward a DevOps operational model, and experiencing challenges in the process. What does it takes to make the transition easier?

View Video

BigPanda

Read more about DevOps/SRE Model: Bursting the Developer's Bubble

How Our Latest Release Makes Your PagerDuty Experience Frictionless

Oct 7, 2020 By Ariel Russo In PagerDuty

In a world that’s always on, keeping services up and running isn’t just ideal—it’s mission-critical for all of PagerDuty’s customers. It’s not lost on us that serving as the central nervous system for digital operations at some of the world’s largest companies is no small job.

Read Post

PagerDuty

Read more about How Our Latest Release Makes Your PagerDuty Experience Frictionless

DevOps/SRE Model: Bursting the Developer's Bubble. Here's the CTO Perspective.

Oct 7, 2020 By Yoram Pollack In BigPanda

Many organizations are transitioning toward a DevOps operational model, where software developers are responsible for operating the applications they develop, instead of a centralized IT operations group. In this “CTO Perspective” interview we talk to BigPanda’s CTO Elik Eizenberg about the challenges in that transition, and what it takes to make it easier. Lean back and watch the interview, or if you prefer reading, take a few minutes to read the transcript.

Read Post

BigPanda

Read more about DevOps/SRE Model: Bursting the Developer's Bubble. Here's the CTO Perspective.

Alerts out of your database (SQL, Powershell, REST API)

Oct 7, 2020 By Derdack In Derdack

Whether it be on the administrative side of the house or in a production environment, the digital world is not slowing down. In fact, it is increasing by the second. Data is collected from a thousand different sources and often stored in the same number of places. Automating the collection, analyzing and augmentation of this data can be quite a cumbersome task and very time-consuming. Not to mention the loss in revenue when this is not done.

Read Post

Derdack

Read more about Alerts out of your database (SQL, Powershell, REST API)

How to Reduce MTTR With PagerDuty and Puppet's Relay

Oct 6, 2020 By Melissa Sussmann In PagerDuty

DevOps and SRE teams are under intense pressure to reduce the mean time to recovery (MTTR) when resolving incidents. With the proliferation of cloud services and the increasing complexity of DevOps toolchains, engineers today need to not only learn how to use these services, but also troubleshoot them when an incident is raised at 2 a.m. The problem is, many incident response processes are still manual today—cobbling together runbooks and ad hoc scripts and orchestrating people to respond.

Read Post

PagerDuty

Read more about How to Reduce MTTR With PagerDuty and Puppet's Relay

Modern IT Systems Have Outgrown Traditional Monitoring

Oct 6, 2020 By Will Cappelli In Moogsoft

Legacy monitoring tools fall short for SRE teams and DevOps pros tasked with maintaining uptime of key applications in modern, cloud-based IT systems. To have visibility and control over these environments, these teams must collect and analyze more granular, underlying system information — observability data. This article explains why the only way for SRE teams and DevOps pros to extract the necessary insights from this data is through the application of AI capabilities.

Read Post

Moogsoft

Read more about Modern IT Systems Have Outgrown Traditional Monitoring

The rise of 'Compliance-ops': Bridging the tech and compliance gap in iGaming

Oct 6, 2020 By David Sachs In Exigence

Kimberley Wadsworth gambled £36,000 in a fortnight, committing suicide shortly after the loss and leaving her mother homeless as a result. Kimberley Wadsworth started gambling in 2015, visiting brick-and-mortar shops and playing at online casinos. There was no one to promptly alert or save Kimberly from her dreadful destiny.

Read Post

Exigence

Read more about The rise of 'Compliance-ops': Bridging the tech and compliance gap in iGaming

BigPanda celebrates IT Operations teams around the world

Oct 5, 2020 By BigPanda In BigPanda

IT Ops, NOC, DevOps and SRE teams work 24x7x365 to make sure the rest of us can live our digital lives to the fullest.

View Video

BigPanda

Read more about BigPanda celebrates IT Operations teams around the world

Stuff Happens: How Slack and PagerDuty Work Together to Resolve Incidents Quickly

Oct 5, 2020 By Slack In PagerDuty

Like death and taxes, IT incidents are inevitable. Issues like server outages and broken code are common—and costly. A single hour of downtime costs businesses more than $300,000 on average, according to Gartner. That’s why a solid incident management strategy is a must for any organization. “People solve incidents, but we can’t do it alone,” says Ali Rayl, Slack’s vice president of customer experience.

Read Post

PagerDuty

Read more about Stuff Happens: How Slack and PagerDuty Work Together to Resolve Incidents Quickly

Why you need to stop the handover of that shared on-call duty phone

Oct 5, 2020 By Matt In SIGNL4

If you are still handing over a shared on-call duty phone or pager (sometimes called ‘operations phone’), it is time to rethink your process. The Covid19-induced new normal has a dramatic impact on our work live and social behavior. We work from home and that is especially true for the IT workforce. We meet with less people and limit our social network to relatives and close friends.

Read Post