Operations | Monitoring | ITSM | DevOps | Cloud

How to set up Grafana Mimir using Ansible

Gerard van Engelen is a seasoned DevOps engineer who ensures the quality of products by drawing parallels between complex issues and simpler, everyday scenarios. This approach helps in delivering value, ensuring that products are not only built correctly but also offer the right functionalities. Ansible is popular with system administrators and DevOps professionals who use it for automating IT tasks such as configuration management, application deployment, and orchestration.

Introducing Mobile Real User Monitoring (RUM)

Human attention spans are seemingly shorter than ever, and your mobile application users are, unfortunately, no exception. Over 70% of users abandon an app if it’s taking too long, with half of these users waiting no more than three seconds. Even minor delays or errors can lead to significant user drop-off, negatively impacting your app’s success and user satisfaction.

Predictive Analytics Pipelines: Real-World AI, Predictive Maintenance, and Time Series Data

There’s so much talk about AI these days that it seems we quickly forget that AI isn’t a single type of technology. It’s a category, almost an umbrella term for a wide range of different technologies, applications, and approaches. The terms “Generative AI” and “Machine Learning AI” (often referred to as “Real-World AI”) describe two different branches that fall under the broader AI heading.

Wireless Troubleshooting Made Easy - How Monitoring Wi-Fi Helps

There is no question that wireless networks are taking over. Offices may still have Ethernet cables to each cubicle, but they usually go unused. Wi-Fi is the new LAN. So many devices, tablets, smartphones and even some laptop-type devices, are now wireless only. Today, Wi-Fi is often the primary way end users connect. “While a wired Ethernet connection is generally faster and more reliable, it forces users to be tethered to their desks.

2.5X faster and 88% cheaper error resolution with GPT-4o mini and Raygun

In May, GPT-4o was released, refining the GPT-4 architecture with native multi-modal input support, faster speeds, and a cheaper price per token. This week, with the release of GPT-4o mini, it’s even more cost-effective and quicker. This model is considered better than GPT-3.5 Turbo, being faster and smarter—a win all around. Let’s put it to the test in a real-world application to see just how good it is for software developers.

Monitoring Third Party Vendors as an Ops Engineer/SRE

Why should you monitor your third-party Cloud and SaaS vendors if you are in SRE/Ops? As part of an SRE team, your primary responsibility is ensuring the reliability of your applications. What makes you responsible for monitoring services that you don't even manage? Third-party services are just like yours - with SLAs. And outages happen, affecting you as well as many others who depend on them.

The Microsoft-CrowdStrike Outage: An In-Depth Analysis

On July 19, 2024, a significant outage impacted globally, causing widespread disruptions across various industries. This outage was primarily linked to a faulty update from CrowdStrike’s Falcon Sensor, which led to severe issues on Windows systems. CrowdStrike is a leading cybersecurity company that specializes in protecting businesses from online threats.

Securing the Foundation of Cribl Copilot

Integrations are the bread and butter of building vendor-agnostic software here at Cribl. The more connections we provide, the more choice and control customers have over their unique data strategy. Securing these integrations has challenges, but a new class of integrations is creating new challenges and testing existing playbooks: large language models. In this blog, we are going to explore why these integrations matter, investigate an example integration, and build a strategy to secure it.

How to Build a Custom OpenTelemetry Collector

Telemetry data collection and analysis are important for businesses. We're diving right in to explain the ins and outs of the OpenTelemetry Collector, including its core components, distribution selection, and customization tips for optimal data collection and integration. Whether you're new to OpenTelemetry or expanding your capabilities, this will help you effectively use the OpenTelemetry Collector in your observability strategy.

Streamlining Debugging with Lightrun Snapshots: A Superior Alternative to Trace Logging

According to a recent study, failing tests alone cost the enterprise software market an astonishing $61 billion annually. This figure mirrors the vast number of resources devoted to rectifying software failures, translating into about 620 million developer hours lost each year. On average, engineers spend 13 hours to resolve a single software failure, a statistic that paints a stark picture of the current state of debugging efficiency.