Operations | Monitoring | ITSM | DevOps | Cloud

Do you Grok It?

Most people are probably familiar with the word “grok” from Robert A. Heinlein’s novel A Stranger in a Strange Land, in which it is used to describe a deep, almost mystical understanding of something. ‍ Grok is also the name of a plugin for LogStash that enables you to parse and analyze log data using a syntax similar to regular expressions, but specialized for various log formats and fields.

How Dropbox rebuilt its logging stack with Grafana Loki after a data center went dark

Two years ago, a power outage knocked a Dropbox data center offline. It wasn’t just any data center. It was the only one where Dropbox hosted Grafana Loki, meaning engineers couldn’t access their log data. “We had considered a data center outage when we were rolling out Loki, but it had just never risen up in priority enough to get put into multiple data centers,” said Chris Hodges, an infrastructure software engineer at the cloud storage company.

Drive Public Sector Efficiencies of Scale with Splunk and AWS

Today’s public sector organizations are tasked with delivering a staggering amount of technology capabilities to support a growing set of digital services, meet IT modernization goals, and continue to protect against a wide range of attack vectors. Cloud technology adoption has played a significant role in ensuring that ongoing IT modernization not only aligns with each organization’s mission-strategic capabilities but also enables efficiencies of scale.

Elastic Cloud Serverless now generally available on Microsoft Azure

Elastic Cloud Serverless provides the fastest way to start and scale security, observability, and search solutions — without managing infrastructure. Today, we are excited to announce the general availability of Elastic Cloud Serverless on Microsoft Azure — now available in the EastUS region. Elastic Cloud Serverless provides the fastest way to start and scale security, observability, and search solutions without managing infrastructure.

Elastic's journey to build Elastic Cloud Serverless

Stateless architecture that auto-scales no matter your data, usage, and performance needs How do you take a stateful, performance-critical system like Elasticsearch and make it serverless? At Elastic, we reimagined everything — from storage to orchestration — to build a truly serverless platform that customers can trust. Elastic Cloud Serverless is a fully managed, cloud-native platform designed to bring the power of Elastic Stack to developers without the operational burden.

Logz.io Adds PrivateLink Support, Introduces the Parsing Rules Hub, and Significantly Enhances Parsing Capabilities

Today, we’re excited to announce support for AWS PrivateLink, allowing Logz.io customers to securely send logs and metrics through private VPC connectivity, without any data ever hitting the public internet. If you’re running workloads inside a VPC on AWS, this upgrade drastically improves your security posture, simplifies your networking architecture, and – most notably – reduces your data transfer costs (a lot).

Hidden Value in Sumo Logic: What Customers Often Miss -- Customer Brown Bag -- June 26th, 2025

Join us as Andy Makings reveals 12 powerful tips and tricks that many users overlook in Sumo Logic. These practical insights can streamline your daily workflows and unlock deeper, more actionable intelligence from your data.

The Road to Loki 4.0 (Loki Community Call June 2025)

In this Loki Community Call, we welcome back Ed Welch, Principal Engineer on the Loki team. We will be discussing with Ed what is next for Loki as we push forward to Loki 4.0. If you are interested, learn more about potential architecture changes, storage formats, and an open discussion on where Ed and the Loki team would like to see the future of Loki, then make sure you join us live and have your questions answered!

Navigating Shopware logs and slow pages in a real world scenario

A Shopware store goes from smooth to sluggish—pages take 10 seconds to load, even longer in some cases. What happened? In this post, we tell the true story of how one overlooked plugin setting nearly collapsed a storefront, and how it was resolved using native tools. If you’re shipping code in Shopware without clear performance observability, this is your wake-up call. Everything was working, until it wasn’t.

What is log tagging and how to configure it in Site24x7

In this video, learn what is Site24x7's log tag and how to configure, categorize, filter, and monitor your logs more effectively—so you can create your custom log tag that gives you full visibility into your logs or categorize them even better. Here’s what you’ll learn: Whether you're an IT personnel, DevOps engineer, or security analyst, this video will help you make smarter tags for monitoring decisions.

11 Best Log Monitoring Tools for Developers in 2025

Your checkout API just started throwing 500s during peak traffic. You SSH into production, tail logs across six microservices, and realize the database timeout buried in service's logs is causing cascade failures. Two hours later, you've fixed it, but you're thinking: "There has to be a better way." There is. Log monitoring tools centralize logs from your entire stack, making debugging systematic instead of archaeological.

Coralogix adds OTel-based service dependency tracking for distributed systems

Coralogix has released its APM Dependencies feature. This feature automatically surfaces and maps the relationships within and between your software and external services. It allows fine grained tracking of which endpoints within your APIs, depend on other endpoints, or external services and database tables.

Zero-effort alert migration from Prometheus to Coralogix

Having spent two decades in technical leadership, I’ve seen first hand what separates great development teams from merely good ones. It’s not about the number of features shipped or the elegance of the codebase — it’s about the ability to consistently deliver value to the customer through really great user experience.

The Visibility vs Cost Trap: A Dangerous Tradeoff

“You can’t investigate what you don’t have”. Every analyst knows the pain of missing context. You’re in the middle of a high-stakes investigation, but the logs you need are gone, archived weeks ago due to retention limits. Or worse, they were never collected in the first place to keep costs under control. This is the Visibility vs. Cost trap, and it puts analysts at a disadvantage every day.

Structured Logging in NextJS with OpenTelemetry

Traces tell you what happened and when. Logs tell you why. When something breaks, logs are often your first clue—and if they’re correlated with traces, they can cut debugging time down from hours to minutes. In this section, we’ll wire up end-to-end structured logging across both server and browser environments in your Next.js app, complete with trace correlation and SigNoz integration.

Fluentd vs Fluent Bit: A Side-by-Side Comparison 2025

Fluentd and Fluent Bit are both open-source data collection and processing tools, but they serve different purposes. Fluentd offers a comprehensive, plugin-rich architecture ideal for centralized log aggregation. Fluent Bit is designed for performance and efficiency, making it a better fit for edge devices and environments with limited resources. This Fluentd vs Fluent Bit comparison outlines their key differences, helping you decide which fits your infrastructure best.

Prometheus Logging Explained for Developers

Running apps in production? You need visibility fast. Traditional logging gives you scattered events. Prometheus gives you structured, queryable data that scales. In this guide, we’ll break down how to use Prometheus for logging-style observability, where it fits in your stack, and how to plug it into tools like Grafana or your cloud-native setup.

Building and Using a Custom #OpenTelemetry #Collector with #Bindplane

Check out the full ‪‪@bindplane community call in June. We explore building custom OpenTelemetry collectors with the OpenTelemetry Distribution Builder and using Bindplane's new Bring Your Own Collector feature. We showcase source and destination compatibility within Bindplane and how BYOC does not let you misconfigure a custom built collector.

Bindplane Recommendation Engine: Automatically Improve Telemetry Parsing #opentelemetry #collector

Check out the full ‪‪@bindplane community call in June. See how Bindplane instantly suggests improvements using its recommendation engine. This video explores how to automatically parse severity with default values, enhancing data analysis efficiency. Learn how to quickly optimize your setup.

Blueprints Are Pre-Built Processor Bundles #opentelemetry #collector #observability

Check out the full ‪‪@bindplane community call in June. Here we explore automated JSON parsing using a JSON Parse processor bundle that was added with the Blueprint feature in Bindplane. Learn how to parse JSON strings, extract fields, and set accurate timestamps without having to add any custom configs. Bindplane handles all the heavy lifting automatically.

Regex Log Parsing Made Easy with AI/LLM Support #opentelemetry #collector #observability

Check out the full ‪‪@bindplane community call in June. We explore Apache HTTP source and the new AI regex log parsing capabilities. We leverage a Bindplane processor for complex pattern matching, enabling efficient data processing. This guide demonstrates how to easily generate and apply regex patterns with AI support.

VictoriaLogs Unleashed: Cluster Version Now Available for Exceptional, Linear Scaling

You asked, and we listened! We’re thrilled to announce the release of the VictoriaLogs Cluster version – one of the most requested and anticipated updates from our user community. This marks a significant leap forward for VictoriaLogs, empowering users to handle log volumes and ingestion rates far beyond the limits of a single node.

Mastering Global Telemetry: How Cribl Puts You in Control

Let’s face it: managing global data infrastructure isn’t just hard, it’s “I-just-deployed-the-wrong-config-to-prod-again” hard. If you’re a Cribl Admin or Operator working across clouds, continents, and compliance regimes, your to-do list probably reads like a series of increasingly desperate Post-it notes. Sources. Destinations. Pipelines. TLS settings. Proxies. Dev, staging, prod. Repeat. Forever. But what if we told you there’s a better way?

Get more out of Sumo Logic: five log search hacks you'll actually use

Think Sumo Logic is only for query language pros? Think again. Whether you’re deep into JSON logs or just trying to make sense of a Linux error message, these five time-saving hacks turn anyone into a log-searching ninja, no regex, no complexity, just clicks. From instantly parsing values to filtering down with a tap, these tips will help you troubleshoot faster, work smarter, and feel more confident in your observability game. You’ve got logs, now it’s time to put them to work.

The hype is over: Generative AI is driving the evolution of search within enterprises

Discover how Accenture and Elastic are helping businesses seize the opportunities offered by generative AI When it comes to generative AI, enterprises need to think big. Shaving a few seconds off the time needed to draft an email is helpful, but the journey to real value begins when you apply AI at the enterprise level. A new partnership between Accenture and Elastic combines technical expertise and strategic excellence, enabling businesses to build the data foundations for a successful AI future.

Guide for Catching Regressions with GitHub Actions and CI/CD Monitors

This guide aims to help your team shift testing left, simulate real user behavior, and catch critical issues early as part of CI/CD, prevent regressions from reaching production by automating tests as part of your CI/CD and aborting deployments that contain issues. Synthetic monitoring is a great way to check important flows in production and make sure everything is working the way it’s supposed to.

Why Modern Incident Response Strategies Need Network and Service Intelligence: Part 2

In Part 1, we explored how aligning network visibility with IT service context empowers faster, smarter incident response. But what does this actually look like? Here in Part 2, we’ll go deeper into the challenges of traditional monitoring approaches, and how teams should look to move from fragmented alerts to unified insights – because when ITOps and NetOps can both see the “what” & “why” of the problem, actions become instinct.

Top Log Management Tools 2025

In a perfect world, log anomalies would speak clearly and never at 2 a.m. But in reality, log data is massive, alerts can be cryptic, and critical issues often get buried in the noise. That’s why choosing the right log management tool is crucial, it’s the first line of defense against downtime, breaches, and costly oversights. This blog breaks down some of the top log management tools on the market, what they do well, where they stand out, and how they fit into your stack.

How to Set Up a Syslog Server: A Complete Step-By-Step Guide

Syslog servers are essential for centralized log management, helping network engineers monitor, troubleshoot, and secure network devices efficiently. This guide walks you through setting up a syslog server from scratch, focusing on practical steps using rsyslog on a Linux system—a common and robust choice for syslog collection. Windows does not have a native syslog server, so you need third-party software.

Fluent Bit Helm Chart: Simplify Log Collection in Kubernetes

Collecting logs in Kubernetes often starts as a simple goal, and quickly turns into a game of “where did that log line go?” Between sidecars, DaemonSets, and countless config options, it’s easy to get lost. Fluent Bit helps cut through the noise. It's fast, lightweight, and plays well with Kubernetes. And when you deploy it using Helm charts? The setup becomes way more manageable. This guide covers the how and the why, without overcomplicating the what.

Top Five Reasons Telemetry Pipelines Should Be on Every Engineer's Radar

You’ve probably felt the pain: data pouring in from every corner of your stack, tools choking on volume, dashboards lagging behind reality, alerts firing (or worse, not firing) without context. If that sounds familiar, it’s time to get serious about telemetry pipelines. Whether you're an SRE trying to stabilize a flapping service or a developer navigating multi-cloud chaos, a telemetry pipeline helps you take control of the data firehose.

Elastic - The Search AI Company

You may not know it, but you probably use Elastic every day. By combining the transformative power of AI with our deep expertise in search and vector databases, we are changing what's possible with search. Our Search AI Platform empowers organizations to have a conversation with all their data, build powerful GenAI applications, immediately diagnose root causes in observability, and hunt for threats at enterprise scale.

5 Ways to Optimize Your OpenSearch Cluster

OpenSearch is a powerful, scalable search and analytics engine that can do amazing things for logging, observability, and full-text search. But like any distributed system, it only performs well if you keep it properly tuned and healthy. Ignore it, and you risk slower queries, higher costs, and even data loss. Here are five practical tips to keep your OpenSearch cluster running smoothly and efficiently.

The best of both worlds with the Splunk Cloud Platform

This video describes how the value of migrating to the Splunk Cloud Platform provides a comprehensive environment that offers everything from efficiency and sustainability to agility and security plus and lower your costs. How can you be sure? With the Splunk Cloud Calculator we’ll show you the real dollar savings you could get from migrating to the Splunk Cloud Platform.

Why companies keep migrating to Coralogix

As businesses scale, so do their observability needs, but many find themselves stuck with costly, inflexible platforms that no longer serve them. Despite mounting frustrations, the complexity of migration keeps companies from making a change. The risk of losing critical data, disrupting workflows, or rebuilding everything from scratch often outweighs the benefits of switching. Most vendors offer little to no migration support, forcing teams to manually reconfigure dashboards, alerts, and integrations.

The truth you can't afford to miss: Listen as your logs spill the tea

When you hear “spill the tea,” you probably think of pop culture, not outages or anomalies. But the origin may surprise you: before it was slang for juicy gossip, ‘tea’ was actually ‘T,’ which represents truth. We know what you’re thinking: “Are you trying to say ‘spilling the tea’ is a good thing?” And yes, that’s exactly what we’re saying, especially when your logs are doing the talking.

Getting OpenTelemetry Data Into Graylog

OpenTelemetry is emerging as the common framework for collecting observability data, and for good reason. It’s vendor-neutral, open source, and designed to collect traces, metrics, and logs in a consistent way. But while most of the buzz is around tracing and metrics, let’s not forget: logs are still the backbone of investigation and response. That’s why Graylog now supports native collection of OpenTelemetry data over gRPC.

Cisco and Splunk Strengthen Enterprise Digital Resilience in the AI Era

In an era where hybrid environments and AI-driven innovations redefine enterprise operations, organizations face increasing complexity, disruption, and vulnerability in their systems. To overcome this growing challenge, Cisco and Splunk are working together to harness the power of AI to help customers ensure that digital resilience is an inherent part of their systems.

How to Collect .NET Application Logs with OpenTelemetry

Observability is essential for maintaining and scaling modern applications. With.NET 8, Microsoft has enhanced support for observability using OpenTelemetry. In this post, we explore how to monitor.NET 8 applications logs with SigNoz, an open-source observability platform, using the OpenTelemetry Protocol (OTLP) exporter.

Smarter Telemetry Pipelines: The Key to Cutting Datadog Costs and Observability Chaos

Log volume is exploding, costs are rising, and most teams are stuck duct-taping together short-term fixes. During our webinar, "Optimizing Log Management in Datadog: Cut Costs Without Losing Insights," we discuss how DevOps and engineering leaders are navigating the growing pains of observability, especially in environments where tools like Datadog are mission-critical but challenging to manage. Here’s a recap of the key takeaways.

Migrate historical logs from Splunk and Elasticsearch using Observability Pipelines

Migrating to a new logging platform can be a complex operation, especially when it involves both active and historical logs. Observability Pipelines offers dual-shipping capability, making it easy to route active logs to your new platform without disrupting your log management workflows. But migrating years worth of historical logs—which are critical for investigating security incidents and demonstrating compliance with applicable laws—requires a different approach.

A Complete Guide to Linux Log File Locations and Their Usage

Linux log files are text-based records that capture system events, application activities, and user actions. They're stored primarily in the /var/log directory and provide essential information for debugging issues, monitoring system health, and maintaining security. This guide covers the most important Linux log files and a few detailed techniques for reading and analyzing them.

Top 5 Open Source Log Management Tools (and How to Choose the Right One)

Managing logs at scale is no longer just about storing text—it’s about gaining insights fast, keeping systems healthy, and troubleshooting in real time. With cloud-native architectures becoming the norm, the pressure is on for modern teams to adopt log management tools that are fast, scalable, and easy to use. But with so many options, how do you choose the right one?

The One Where We Show You Copilot Editor

Copilot Editor is like an AI-powered Rosetta Stone for telemetry. It helps Cribl users take raw, messy telemetry data and turn it into standardized, analytics-ready formats. The most important piece? It puts YOU in control. Our human-in-the-loop design means that users have full control over and visibility into what’s happening with their critical data, preventing AI-induced mistakes. Watch this fun demo with the AI product team to show Copilot Editor's true value to the average Cribl user!

Fluentd vs Logstash: In-Depth Comparison of Two Popular Log Collectors 2025

In modern observability stacks, log collection is a critical component. Among the most widely adopted logs collector are Fluentd and Logstash. Both tools are designed to collect, process, and forward logs to various destinations like Elasticsearch, Kafka, and cloud services. However, the differences between FluentD and Logstash lie significantly in their design, performance, plugin ecosystems, and user experiences.

AI + Dark Mode: Introducing AI-Powered Insights and The Long Awaited Dark Mode

Join the live stream at 11 am ET, here. Launch Week’s Friday drop delivers two of the most-requested upgrades we’ve ever shipped: Together, they turn Bindplane into a cooler , and smarter , place to manage observability and SecOps telemetry. A full suite of extensive AI features will be rolling out over the coming weeks. This is just the beginning!

From Downtime to Uptime: Monitoring Tools and Techniques for Systems, Websites, APIs, and More

Recently, while visiting a friend in a local hospital, I found myself facing a frustrating distraction: trying to pay parking fees using USSD (a mobile text-based system for quick transactions). The service was either painfully slow or not working at all. I wasn’t alone. Other visitors were just as exasperated, and parking attendants stood idle, their handheld devices frozen in endless loading loops.

Top Features of Splunk Observability Cloud for Engineers

In this video we’ll walk you through a demonstration of Splunk Observability Cloud’s key capabilities. You’ll see how you can monitor Kubernetes cluster health in Infrastructure Monitoring, and alert on your services’ health using AutoDetect Detectors and Alerts. We’ll then take a look at traces and metrics in APM, and use Related Content to find correlated log entries of error traces. Then we’ll use AlwaysOn Profiling to troubleshoot long duration traces for our service.

How to Monitor Frontend Memory Usage

First of all, by frontend memory usage I mean the amount of memory that a user’s browser needs when using your website or webapp. Secondly, do you have any idea how much browser memory your website or webapp requires? Or do you know if or how much the memory footprint of your website/webapp has changed over the last few months? Or after the recent changes or releases you made? I’m guessing you don’t. Yet, this is important to monitor to avoid a bad user experience.

Solve your MTTR mysteries faster with Sumo Logic

Picture this: a crime scene where the evidence is scattered across five different rooms. There’s a footprint in one, a shattered window in another, a stray shoe on the stairs, and a witness across the street, who only saw part of what happened. Each clue matters in solving the case, but none of them tells the full story on their own.

Why Cribl Copilot Editor is Built for the Human, First and Foremost

I’m genuinely excited about what we're rolling out with Copilot Editor, an update to our AI that’s truly packed with new capabilities designed to help you automate pipeline development. You can read about these capabilities here. I wanted to take a moment to share our thinking on a core principle that guides how we build, especially regarding the impactful, and sometimes daunting, world of generative AI.

Blueprints: Ready-Made Processor Bundles For Your Telemetry Pipelines

We’ve noticed a lot of our customers spend countless hours building and configuring processors. Either parsing JSON, standardizing log formats, normalizing timestamps, masking PII, de-duplicating logs, the list never ends. Most work revolves around recreating the same processor bundles in multiple processor nodes. Bindplane’s new Blueprints solves that boring, repetitive work by providing pre-built processor bundles you can drop into any pipeline with a single click.

Shift-Left Monitoring for GitHub and Vercel Workflows

A recent LinkedIn poll by Peter Zaitsev asked: “What is the most common preventable cause of downtime in your environment?” Guess what most respondents said it was? Surprise, surprise – the top answer is Deploying Broken Code, with 57% of respondents selecting it. This reinforces how critical it is to catch issues before they hit production.

Scaling Observability: How We Designed Bindplane to Manage 1,000,000 OpenTelemetry Collectors

Join the live stream at 11 am ET, here. Platform teams tend to start with just one, or in some cases a handful of OpenTelemetry (OTel) Collectors usually running in gateway mode. They then embrace the benefit of a vendor-neutral, standardized, telemetry collector for unified logs, metrics, and traces.

Upgrade Readiness: Unlocking Success with the Splunk Health Assistant Add-On

Splunk recently announced exciting updates and significant modernizations for the upcoming releases of Splunk Enterprise and Splunk Cloud Platform. This blog is the first in a series to help prepare your organization for these changes by exploring upgrade readiness best practices. This first installment will highlight the Splunk Health Assistant Add-On, a vital tool that supplements the Splunk Enterprise Monitoring Console, designed to streamline your transition to the next version of Splunk Enterprise.

Elastic achieves AWS Education ISV Partner Competency, strengthening education solutions portfolio

Advancing digital transformation in education through Search AI and cloud innovation We’re thrilled to share that Elastic has achieved the AWS Education ISV Partner Competency. This prestigious designation recognizes Elastic as an Amazon Web Services (AWS) partner that has proven expertise in delivering high-quality solutions that help education institutions support successful student outcomes while protecting security and privacy.

Splunk on SGTech - Tech Transforms Life

With the explosion of data across endless environment, devices and applications, organisations and government agencies are faced with a pressing challenge of getting their data house in order to achieve efficiency, transparency, security and governance. Learn how Splunk helps businesses like Singapore Airlines, LG Electronics and DANA fintech group transform complex data into valuable business outcomes and strengthening digital resilience.

Why Datadog Falls Short for Log Management and What to Do Instead

Datadog may be the default choice for all-in-one observability, but its logging experience takes a back seat to the broader platform. Logs are primarily designed to feed into metrics and traces, which leads to tradeoffs such as slower search, complex workflows, and a UI that isn’t optimized for log investigations. As a result, Datadog doesn’t align with how developers actually troubleshoot.

Map, Transform, Filter: How Copilot Editor Helps Teams (and Their Pipelines) Have It All

Ever spent a week wrangling log pipelines just to get your SIEM to stop screaming about missing fields? Wasted way too much time stripping out noisy events and reformatting data for analytics? You’re not the only one. If you work in Security or ITOps, you know the pain: every new data source means another round of schema headaches, more manual mapping, endless field transformations, and a quick prayer that you didn’t break something critical (or let in a flood of junk events).

Top five metrics to monitor in IIS Logs

When managing and troubleshooting IIS (Internet Information Services) web server performance, logs are a critical resource. They capture detailed information about every request and response so your team can detect issues quickly. Let’s walk through the main IIS log formats, explore a sample log file, and break down five key types of IIS metrics you should monitor.

Graylog vs ELK: Which Log Management Solution Fits Your Stack?

Your app logs start simple—maybe a few print() or logging.info() calls. But in production, things get noisy. Thousands of log lines per minute, scattered across services, and it’s hard to know what matters. This is when tools like Graylog and the ELK stack help. They let you collect, search, and make sense of logs, but they do it in different ways. This guide breaks down how each one handles setup, scale, and day-to-day use.

Your Collector, Your Rules: Introducing BYOC and the OpenTelemetry Distribution Builder

Join the live stream at 11 am ET, here. OpenTelemetry’s super-power has always been: Choice. Yet, most observability vendors still insist you run their collector. Today we’re removing that last point of friction. With Bring Your Own Collector (BYOC), Bindplane now accepts any upstream-compatible build, recognizes exactly which receivers, processors, and exporters it contains, and adapts the UI and configuration workflow on the fly.

Bindplane Launch Week 1 [June 2-6] - Day 2 - Custom OTel Collectors

The point of OpenTelemetry has been to give you a choice. Yet, most observability vendors still insist you run their collector. We’re removing that last point of friction. With Bring Your Own Collector (BYOC), Bindplane now accepts any upstream-compatible build, recognizes exactly which receivers, processors, and exporters it contains, and adapts the UI and configuration workflow on the fly. No forks, no vendor stamp—just the collector you already trust, fully managed by Bindplane.

Unify telemetry, own your pipeline: New integrations for Windows, Network Telemetry, and Cloud Storage

Today, we're expanding on the integrations front, and launching new integrations for Windows events, network telemetry, and cloud storage. Here's a quick tour of what's new and why it matters.

Sigma Specification 2.0: What You Need to Know

Sigma rules have become the security team equivalent of LEGO bricks and systems. With LEGO, people can build whatever they can imagine by connecting different types of bricks. With Sigma Specification 2.0 rules, security teams can create vendor-agnostic detections without being limited by proprietary log formats. In response to the Sigma rules’ popularity, the team that built them updated them in August 2024, giving security teams new capabilities.

Service Level Objectives -- Customer Brown Bag -- May 29th, 2025

This technical session on Service Level Objectives (SLOs) will cover the fundamentals of SLOs, SLIs, and SLAs, along with how to define, monitor, and optimize them for system reliability. Through hands-on demonstrations, you'll learn to set up SLOs in Sumo Logic, track performance using logs, metrics, and tracing, and configure proactive alerts for incident response. By the end, you’ll have the skills to implement and manage SLOs effectively, ensuring your services meet reliability goals while balancing performance and cost.