%term

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How Monitoring, Observability & Telemetry Come Together for Business Resilience

Mar 1, 2023 By Chrissy Kidd In Splunk

Systems going down because of an unforeseen incident? Got problems with your app or website? Is your audience missing out on products and services because your load times are too slow? Then monitoring and observability (and telemetry) should be of interest to you! In this long article, we’re covering everything! I’ll start with the concepts and how they work.

Read Post

Splunk

Read more about How Monitoring, Observability & Telemetry Come Together for Business Resilience

Reduce 60% of your Logging Volume, and Save 40% of your Logging Costs with Lightrun Log Optimizer

Mar 1, 2023 By Eran In Lightrun

As organizations are adopting more of the FinOps foundation practices and trying to optimize their cloud-computing costs, engineering plays an imperative role in that maturity. Traditional troubleshooting of applications nowadays relies heavily on static logs and legacy telemetry that developers added either when first writing their applications, or whenever they run a troubleshooting session where they lack telemetry and need to add more logs in an ad-hoc fashion.

Read Post

Lightrun

Read more about Reduce 60% of your Logging Volume, and Save 40% of your Logging Costs with Lightrun Log Optimizer

How Synthetic Transaction Monitoring Provides Complete Site Visibility & Why Basic Monitoring is Not Enough

Mar 1, 2023 By Jonathan Franconi In uptime

We’ve all been in the situation before: it’s Friday at 5 PM and the only on-call engineer available to handle incidents is about to hit the slopes. Unfortunately, at that very moment, a customer reports to support that they are unable to access the company’s ecommerce website to complete a purchase. Internal monitoring systems seem quiet and services appear available on internal health dashboards.

Read Post

uptime

Read more about How Synthetic Transaction Monitoring Provides Complete Site Visibility & Why Basic Monitoring is Not Enough

Monitoring with Custom Metrics

Mar 1, 2023 By Javier Martínez In Sysdig

By kickstarting a monitoring project with Prometheus, you might realize that you get an initial set of out-of-the-box metrics with just Node Exporter and Kube State Metrics. But, this will only get you so far since you will just be performing black box monitoring. How can you go to the next level and observe what’s beyond? They are an essential part of the day-to-day monitoring of cloud-native systems, as they provide an additional dimension to the business and app level.

Read Post

Sysdig

Read more about Monitoring with Custom Metrics

Why is Icinga called Icinga?

Mar 1, 2023 By Feu Mourek In Icinga

It’s the year 2009, a nice weekend in late spring and a small group of monitoring enthusiasts comes together to discuss how to move forward with the idea of forking Nagios. The Icinga team in 2009, just to set the mood. Plans were made to make it faster, easier, more scalable, and simply better. Of course, such a project has a lot of hurdles to take – the most important one was of course: the name.

Read Post

Icinga

Read more about Why is Icinga called Icinga?

How Splunk Users can Maximize Investment with CloudFabrix Log Intelligence

Mar 1, 2023 By Srinivas Miriyala In Fabrix

Good people over at Splunk explain that the platform “removes the barriers between data and action, empowering observability, IT and security teams to ensure their organizations are secure, resilient and innovative.” Splunk is a unified security and observability platform that allows companies to go from visibility to action quickly and at scale.

Read Post

Fabrix

Read more about How Splunk Users can Maximize Investment with CloudFabrix Log Intelligence

How 3 Companies Implemented Distributed Tracing for Better Insight into Their Systems

Mar 1, 2023 By Rebecca Carter In Honeycomb

Distributed tracing enables you to monitor and observe requests as they flow through your distributed systems to understand whether these requests are behaving properly. You can compare tiny differences between multiple traces coming through your microservices-based applications every day to pinpoint areas that are affecting performance. As a result, debugging and troubleshooting are simpler and faster.

Read Post

Honeycomb

Read more about How 3 Companies Implemented Distributed Tracing for Better Insight into Their Systems

How Delivery Hero uses Kubecost and Datadog to manage Kubernetes costs in the cloud

Mar 1, 2023 By Guto Costa In Datadog

As the world’s leading local delivery platform, Delivery Hero brings groceries and household goods to customers in more than 70 countries. Their technology stack comprises over 200 services across 20 Kubernetes clusters running on Amazon EKS. This cloud-based, containerized infrastructure enabled them to scale their operation to support increasing demand as the volume of orders placed on their platform doubled during the pandemic.

Read Post

Datadog

Read more about How Delivery Hero uses Kubecost and Datadog to manage Kubernetes costs in the cloud

Troubleshoot blocking queries with Datadog Database Monitoring

Mar 1, 2023 By Aaron Kaplan In Datadog

Blocked queries are one of the key issues faced by database analysts, engineers, and anyone managing database performance at scale. Blocking can be caused by inefficient query or database design as well as resource saturation, and can lead to increased latency, errors, and user frustration. Pinpointing root blockers—the underlying problematic queries that set off cascading locks on database resources—is key to troubleshooting and remediating database performance issues.

Read Post

Datadog

Read more about Troubleshoot blocking queries with Datadog Database Monitoring

How to Achieve Full Stack Observability in Highly Distributed Environments Webinar

Mar 1, 2023 By WhatsUp Gold In WhatsUp Gold

Your modern IT infrastructure has become an increasingly complicated mix of on-premises, public and private cloud applications, devices and environments. Forward-thinking organizations are addressing this complexity by transitioning to a proactive “observability” approach for infrastructure management. This methodology produces and then applies actionable data to optimize and secure the entire network.

View Video