Operations | Monitoring | ITSM | DevOps | Cloud

Checkly Expands Monitoring Capabilities with Introduction of Heartbeat Checks

Checkly, the leading Monitoring as Code provider, expanded its platform's monitoring capabilities with the introduction of Heartbeat Checks, also known as CRON monitoring or dead man's switches. Also introduced today, Smart Retries is designed to reduce alert fatigue.

When generative AI helpdesks take control, will humans make the cut?

Let’s take a nostalgic trip down memory lane of traditional IT helpdesks. It’s reminiscent of waiting in long lines only to be told you’re in the wrong line. Or being serenaded by never-ending loops of elevator music? But, in a world where businesses are now laser-focused on customer success, these methods are antiquated and impractical. Don’t believe me? Ask any customer success manager.

Comparing Datadog and New Relic's support for OpenTelemetry data

OpenTelemetry is the future of Observability, APM, Monitoring, whatever you want to call ‘the process of knowing what our software is doing.’ It’s becoming common knowledge that your time is better spent gaining experience with an open, standardized system for telemetry than closed-source or otherwise proprietary standard. This truth is so universally acknowledged that all the big players in the market have made announcements of how they’re embracing OpenTelemetry.

Can Your Racks Support NVIDIA DGX H100 Systems?

AI is booming. The AI market is projected to grow 37.3% annually from 2023 to 2030. With so many organizations adopting or considering AI applications, data centers need to be ready to support the new demand. However, without the right tools and data, it is difficult to understand if your existing facilities have the capacity to support systems like the “gold standard for AI infrastructure,” the NVIDIA DGX H100.

Kubernetes Logging with Filebeat and Elasticsearch Part 2

In this tutorial, we will learn about configuring Filebeat to run as a DaemonSet in our Kubernetes cluster in order to ship logs to the Elasticsearch backend. We are using Filebeat instead of FluentD or FluentBit because it is an extremely lightweight utility and has a first-class support for Kubernetes. It is best for production-level setups. This blog post is the second in a two-part series. The first post runs through the deployment architecture for the nodes and deploying Kibana and ES-HQ.

Kubernetes Logging with Filebeat and Elasticsearch Part 1

This is the first post of a 2 part series where we will set up production-grade Kubernetes logging for applications deployed in the cluster and the cluster itself. We will be using Elasticsearch as the logging backend for this. The Elasticsearch setup will be extremely scalable and fault-tolerant. ‍

AWS KMS Use Cases, Features and Alternatives

A Key Management Service (KMS) is used to create and manage cryptographic keys and control their usage across various platforms and applications. If you are an AWS user, you must have heard of or used its managed Key Management Service called AWS KMS. This service allows users to manage keys across AWS services and hosted applications in a secure way.

How to Fix Source Map Upload Errors

A stack trace lacking your source code with all the variables and function names, is like putting together a jigsaw puzzle without a picture for reference. You have all these randomly shaped pieces but no way to know how they fit together. Unless you are fluent in computer, making sense of a JavaScript stack trace with minified code is going to make debugging very difficult. Thankfully, by uploading source maps to Sentry, you can map back to the original source code to make sense of what went wrong.