Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

What the Google Outage Can Teach Us About Our Own Services

This week, the world stopped for a few hours as Google users experienced an outage on a massive scale. The outage affected ALL services which require Google account authentication. This includes the Google Cloud Platform (Cloud Console, Cloud Storage, BigQuery, etc.), Google Workspace (Gmail, Calendar, Docs, Drive, etc.) and Google Classroom. With the myriad of affected platforms, this particular outage was far from passing by unnoticed by users.

Webinar: Building Serverless Flows with AWS Step Functions

AWS Step Functions are an emerging solution in the field of serverless architectures, as they provide capabilities to coordinate between many services. This webinar takes the "Right to be forgotten" (GDPR) workflow as an example of a hands-on AWS Step Functions tutorial: use-cases, difficulties, and solutions. Combining Cloudway's experience with Lumigo's monitoring and debugging solution, we provide a full framework to manage multiple subsystems into a single serverless flow and get the work done.

Webinar: How to Monitor Serverless Applications

The software we write does not always work as smoothly as we'd like. To know if something went wrong, find the root cause, and fix the problem, we need to monitor our system and get alerts whenever issues pop up. There are many useful tools and practices for non-serverless applications. As we adopt serverless architecture can we continue to use the same approach? Unfortunately, the answer is no.

How to Enable Detection Rules via Elastic Security - Version 7.10

The detection engine brings automated threat detection to the Elastic Stack through the Security app in Kibana. As part of our belief in the power of open-source, Elastic Security has open sourced all our detection rules to work alongside the security community to stop threats at scale and arm every analyst. In this video, you’ll learn more about the detection engine and how to automate the protection of your data.

An Open Source Prometheus Tutorial for System and Docker Monitoring

Prometheus is an essential element in DevOps. It has one of the largest contributor communities in the open source world and is the favorite database for time-series metrics data. It is also part of Logz.io’s complete observability package, which integrates open source grafana, Jaeger, and ELK Stack into one platform. It’s a favorite for system and container monitoring. This tutorial will show you how Prometheus works with Docker and your local machine.

How to Set up an Integration with Slack

In this week’s tech tip, we are going to show you how to set up an integration with the instant messaging and collaboration app, Slack. These days, collaboration tools like Slack and Microsoft Teams are indispensable to the distributed workforce. In addition to being an instant messaging system that lets you bypass email, Slack has lots of add-ins for other workplace tools.

Pandora FMS Enterprise in High Availability

Pandora FMS is a proactive, advanced, flexible and easy-to-configure monitoring tool according to each business. It gets integrated into all the needs of servers, network computers, terminals and whatever needs to be monitored. This article applies exclusively to Pandora FMS Enterprise and what is exposed cannot be used in Pandora FMS Community.

Elastic Cloud on Kubernetes is now a Red Hat OpenShift Certified Operator

We are delighted to announce that Elastic Cloud on Kubernetes (ECK), the official Elastic Operator, is now a Red Hat OpenShift Certified Operator. The operator helps make it easier to deploy and automate Elasticsearch, Kibana, APM Server, Beats, and Enterprise Search in your OpenShift environment.