Operations | Monitoring | ITSM | DevOps | Cloud

Dos and Don'ts of Observability: Lessons Learned from RedMonk

On November 16, 2022, I sat down with analyst KellyAnn Fitzpatrick from RedMonk to discuss my favorite topic: observability. This time, we looked at observability in a context of what to do and what to avoid doing as you’re starting and going on an observability journey. Click the image below (or here) for a replay of the session. A machine-generated transcript is available at the end of the post.

Zero-Friction AWS Lambda Instrumentation with external extensions

If you’ve been in the software business for some time, you’ve probably noticed that creating software isn’t only about adding features. There are usually many different tasks involved. You have to test your system, fix bugs, and ensure it keeps working over its lifetime.

5 Ways to Ensure Success With Your Kubernetes Platform

Moving towards a Kubernetes platform might seem a simple move. You’ll ask your smartest engineers to get started. They will love a move towards cloud and container technology. However, if you want to realize maximum benefit as you start using a platform like Kubernetes, there is more to it.

Introducing Outlier Detection in Grafana Machine Learning for Grafana Cloud

Outlier Detection is now available as part of the Grafana Machine Learning toolkit in Grafana Cloud for Pro and Advanced users. With this feature, you can monitor a group of similar things, such as load-balanced pods in Kubernetes, and get alerted when some of them start behaving differently than their peers. There’s supposed to be a video here, but for some reason there isn’t. Either we entered the id wrong (oops!), or Vimeo is down.

Optimizing the AWS CloudWatch Log Process

Amazon’s native monitoring and management service AWS CloudWatch is great for basic monitoring and alerts. However, on its own, it may not be the best solution for analyzing log data at scale — especially if you need to analyze data outside of AWS. Many teams may find themselves restricted by retention issues and basic analytic features with Amazon CloudWatch logs for troubleshooting use cases.

SigNoz - Logs Performance Benchmark

Logs are an integral part of any system that helps you get information about the application state and how it handles its operations. The goal of this blog is to compare the commonly used logging solutions, i.e., ElasticSearch(ELK stack) and Loki(PLG stack), with SigNoz on three parameters: ingestion, query, and storage. Performance benchmarks are not easy to execute. Each tool has nuances, and the testing environments must aim to provide a level playing field for all tools.

How Monitoring Helps Avoid the Greatest Dangers to Your Website this Holiday Season

The holidays are here. It’s the happiest time of year but also the most dangerous time for your website. This season usually means sales and events, which bring in a surge of website traffic and strains to your systems. If you are not prepared for these changes, your website could pay the price and ultimately damage your business’s reputation and revenue. We want to avoid these catastrophes.

What is hyperconvergence?

Pandora FMS blog has a very clear purpose: for you to find out everything there is to know about the largest number of rare words related to computing, technology or monitoring, so you can show off among your peers (with whom the hell may you brag about this). Today it’s “Hyperconvergence“! It may sound like something about spacecrafts going into a state close to the speed of light or psychic-type Pokémon attack, but no, it’s something else!