Operations | Monitoring | ITSM | DevOps | Cloud

Replay Data From Object Storage for Long-term Incident Investigations

Psst, hey pal, would you like to buy a time machine? I am not talking about some H.G. Wells monstrosity where you somehow end up being chased by dinosaurs or become your own grandparent. But a time machine for your observability data. License costs and tool performance often keep organizations from ingesting all their data or require them to limit data retention time. Security incidents are often discovered long after these retention times are exhausted or require data that was never ingested.

How Metrics Behave in Honeycomb

Honeycomb has the ability to receive events from applications. These events can take the shape of Honeycomb wide events, OpenTelemetry trace spans, and OpenTelemetry metrics. Because Honeycomb’s backend is very flexible, these OpenTelemetry signals fit in just fine—but sometimes, they have a few quirks. Let’s dive into using metrics the Honeycomb way and cover a few optimizations.

Accelerating R&D in pharma with Elasticsearch, ESRE, LLMs, and LangChain - Part 1

A comprehensive guide to support faster drug innovation and discovery in the pharmaceutical industry with generative AI/LLMs, custom models, and the Elasticsearch Relevance Engine (ESRE) Faster drug discovery leading to promising drug candidates is the main objective of the pharmaceutical industry. To support that goal, the industry has to find better ways to utilize both public and proprietary data — at speed and in a safe way.

Detecting Main Thread Issues in Mobile Applications

Mobile device users care about three things when it comes to good app performance: We’re going to look at how modern concurrency APIs can help with some of these. We recently shipped a new profiling feature to help you find the sources of main thread contention; specifically detecting issues with image and JSON decoding or regex matching. These point you to spots where you can immediately make improvements to your app’s UI performance.

Graphite Graphing and Monitoring tool

The Graphite graphing and monitoring tool is open-source software for monitoring time-series data, and it can be installed on any system, from cheap hardware to the cloud. Graphite collects time series data from infrastructure, servers, networks, and applications, and then provides the Graphite graphing UI for analyzing the data. Graphite has been around since 2008, and it has been continuously developing over the past 12 years.

How to visualize time series from SQL databases with Grafana

Relational databases like MySQL, PostgreSQL, Oracle, and others have a wealth of time series data locked inside of them. Often this data can be used to enhance observability dashboards, or keep track of important application factors, like how many users have signed up for a service. In this article, we’re going to show you how to visualize any time series from any SQL database in Grafana using the time series visualization.

Integrating BindPlane Into Your Splunk Environment

Splunk is a popular logging, and in the case of Splunk Cloud also metrics, platform. The BindPlane Agent is capable of integrating with Splunk; both for incoming telemetry to a Splunk Indexer and outgoing telemetry from a Splunk Forwarder. By integrating in this manner, telemetry not natively supported by Splunk can be sent in; and going the other way the telemetry can be sent to other platforms.

Top Container Monitoring Tools

Container monitoring refers to the process of monitoring and managing containers deployed within a containerization platform, such as Docker or Kubernetes. As containerization has become increasingly popular in software development and deployment, monitoring and managing containerized environments has become increasingly important.

IT Operations Analytics: An Introduction

Information Technology Operations Analytics (ITOA) is an analytics technology that uses datasets generated by IT systems to improve their efficiency and effectiveness as part of the practice known as IT operations management (ITOM). The primary goal of ITOA is to make IT operations more effective, efficient, faster and more proactive through the use of an organization’s own machine data.

The Hidden Problem Draining Productivity

Discover the alarming truth: over 80% of users choose to suffer in silence, never reporting the tech issues they encounter. Meanwhile, many companies rely on Application Performance Monitoring (APM) tools, assuming they have a watchful eye on their systems. However, a critical blind spot persists, particularly when it comes to detecting and alerting you about problems lurking within Microsoft 365’s cloud-based applications.