Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on AIOps, alerting in complex systems and related technologies.

Episode 3: Mooving to... Stability: The Role of Catastrophic Failure in Software Design

In this episode of Mooving to… Stability: The Role of Catastrophic Failure in Software Design, we had the opportunity to chat with Jeff Atwood, yes that Jeff Atwood of, Coding Horror, Stack Overflow, and Discourse (Chief Happiness Officer). Jeff started writing 911 software in Boulder, Colorado for a small company, which was a crash-course in writing code for software that has real consequences. With this unique and deep perspective, B.J.

What Is Government Digital Transformation?

The U.S. federal government knows it has not kept pace with technology innovation. Recent legislation and a $1 billion modernization fund aim to bring the federal government up-to-date. What does government digital transformation mean, and what are federal IT leaders doing to modernize their agency’s IT?

How Many Tools Do ITOps Teams Need to Observe?

In the recent past, every enterprise has had to deal with an outage, leading to war rooms where ITOps teams are put on the spot. While they take on the burden of ensuring 100% uptime, it is often the tools they employ which don’t live up to their promises. Especially in the wake of the pandemic, with working norms being redefined, ITOps teams have been under even greater pressure to deliver. While they strive to be efficient and rely on cutting-edge technology, uptime is often elusive.

mooving To...Stability

Join seasoned veteran, Jeff Atwood (yes, that Jeff Atwood of Stack Overflow and Discourse) as he discusses the role of catastrophic failure in software design. Users of modern apps require as close to 100% uptime as possible, which also means they require quick results. When these expectations aren't met, we need to learn from them to create better design. But what if your fault tolerance design ends up being the cause of your issues? Sean Molloy, and BJ Maldonado talk with Jeff about how you can learn from failure to improve your software.

AIOps in 2022 and Beyond: A Conversation with Gartner

Modern digital businesses adopt AIOps tools to enable continuous insights across an IT stack. These insights tell the full story of what’s happening behind systems, allowing IT teams to achieve the operational efficiencies and high availability that lead to customer satisfaction. Old siloed monitoring disciplines provide data specific to performance of the digital experience, IT infrastructure, application or network.

Tips to implement AIOps the right way in 2022

A lot of things have changed in recent years. From the way of working to executing IT operations, the business strategies have changed overnight with arising advances like Machine Learning, Automation, and Artificial intelligence. The technologies have changed present-day applications and IT operations, and with AI and ML on board, IT industries operate more perplexing undertakings and resolve issues across complex infrastructures.

What is AIOps. 4 Types of AIOps Platforms. How to Effectively Navigate the AIOps Landscape.

AIOps or Artificial Intelligence for IT Operations refers to a set of technologies that augment human decisions with autonomous decisions driven by AI and machine learning that learn patterns, relationships from data. AIOps is the term originally coined by Gartner, and pictorially illustrated in the following way.

Can your AIOps platform do Log Noise Reduction in addition to Alert Noise Reduction? If not, it is time to re-evaluate your AIOps

One of the core value propositions of AIOps platforms is to increase IT efficiency & productivity by applying AI & ML techniques to perform Alert Noise Reduction. This in turn translates to direct cost reduction due to savings in IT man-hours. In this approach, the AIOps platform kind of becomes like a gatekeeper for all the IT alerts/events, and it can help effectively, reduce and correlate such events, so as to send meaningful incidents to NOC or Service Desk.