Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Tutorial: Elasticsearch Snapshot Lifecycle Management (SLM)

Let’s face it, nothing is perfect. The better we architect our systems, though, the more near-perfect they become. But even so, someday, something is likely to go wrong, despite our best effort. Part of preparing for the unexpected is regularly backing up our data to help us recover from eventual failures and this tutorial explains how to use the Elasticsearch Snapshot feature to automatically backup important data.

How Gremlin monitors its own Chaos Engineering service with Datadog

Reliable systems are vital to meeting customer expectations. Downtime not only hurts a company’s bottom line but can be detrimental to reputation. Our goal at Gremlin is to help enterprises build more reliable systems using Chaos Engineering. Whether your infrastructure is deployed on bare metal in a corporate-owned data center or as Kubernetes-orchestrated microservices in a public cloud, chaos experiments can help you find system weaknesses early, before they affect customers.

Sponsored Post

Introducing the ITOM podcast: Listen and learn how to avoid remote work roadblocks in an IT environment

In administrating all technology and application requirements within an organization, IT operations management (ITOM) is pretty complex, and tends to send IT admins scrambling for authentic and actionable insights across the internet. We’re taking matters into our own hands and launching our very own podcast series to provide you valuable information on ITOM, which you can choose to listen to at your leisure or on the go!

Improve Manageability of NetApp Infrastructure with AIOps-Powered IT Operations

In this interactive webinar, we’ll review how Maple Networks and OpsRamp are bringing AI and machine learning to drive down the cost and complexity of monitoring and managing NetApp infrastructure stacks, such as Flexpod, FAS, and HCI.

Eliminating the Multi-Cloud Noise with Razor Technology and OpsRamp

Razor Technology's vision is "to reinvent what it means to be an IT solutions provider through best-in-class technology, industry-leading expertise, and long-term partnerships built on mutual trust with our customers." They have a broad set of solutions, from Digital Transformation to Managed Cloud Services.

What is Azure VM Insights?

Microsoft recently announced general availability of Azure VM Insights, aka Azure Monitor for VM. This service is basically a set of features that allow you to monitor your VMs in more detail, from collecting the telemetry from your VM to displaying it meaningfully – all with a single click. I am satisfied with Azure VM Insights for the most part, but I also have some mixed feelings about it. Read on to find out why.

Through the Crisis - Philosophy at Work

Dr. Brennan Jacoby is a philosopher and the founder of Philosophy at Work, an organization that works with businesses seeking to improve their thinking skills, by leveraging the great philosophers and philosophical techniques. Here – as part of our Through the Crisis series on remote work during and beyond the COVID-19 lockdown – Dr. Jacoby discusses the new Virtues of Virtual report, Aristotle, and how the way we approach technology can help us get the best out of it…

Extending and Integrating the Monitoring System with Automation and Scripting

One of the hidden gems within eG Enterprise is the ability to perform remote actions and automated tasks using built-in functionality. In conversations with customers and community peers, I often get asked why we at eG Innovations don’t offer functionality in regard to adding custom scripts and a community database of shared scripts.

Introducing the Datadog IoT Agent

From smart thermostats and grocery store checkouts to public utility infrastructures and industrial manufacturing lines, the Internet of Things (IoT) is all around us—and growing larger every day. But with this rapid growth comes a number of operational challenges: IoT devices collect a large amount of data, and are often distributed across harsh, ever-changing environments.