Operations | Monitoring | ITSM | DevOps | Cloud

From "rebooting" to reliable and secure applications: Optimizing the customer experience

Not so long ago in my career, I remember when it was relatively acceptable for infrastructure or development teams to solve a problem by rebooting a server or just “turning things off and on again.” It didn’t matter what caused the problem or how long the reboot would fix things, provided they were fixed for now. Security teams were always held to a different standard.

Mobile app observability with OpenTelemetry, Embrace, and Grafana Cloud

We are excited to announce an expansion of our partnership with Embrace to bring mobile observability to our users using open standards like OpenTelemetry. We first worked with Embrace last year when they created a plugin for Grafana that gives mobile teams an easy way to visualize and analyze real-time mobile metrics directly in a Grafana dashboard.

Your Guide to Observability Engineering in 2024

It may sound complicated and daunting, but so much of observability is about discovering the unknown unknowns in your critical systems. The capabilities of observability engineering can help you make those discoveries. Most organizations have some form of monitoring, alerting and troubleshooting, which can be adequate to a point but fall short when trying to determine the root cause of unexpected outages.

The Importance of Observability for Healthcare Providers

The systems and data that healthcare providers utilize and process are fundamental to its successful operation. Therefore these organizations must invest in appropriate and powerful observability solutions that enable them to effectively monitor their systems and valuable data. These tools and solutions allow healthcare providers to securely manage, deliver, and ensure uptime for their entire IT infrastructure.

Mastering Centralized Logging with OpenSearch

For effective centralized logging, OpenSearch is a perfect solution as OpenSearch offers powerful querying and analysis capabilities, and it’s highly scalable and flexible. In this article, we will outline why you should use OpenSearch for centralized logging, before outlining how to easily configure centralized logging in OpenSearch.

Reducing MTTR and the Hidden Costs of Downtime Through AI & Automation

Of all the KPIs that gauge the health and operational fitness of an enterprise, Mean Time to Repair (MTTR) from an outage or downtime is one of the most crucial. Yet while MTTR is a universally recognized metric, many organizations still fail to consider the total cost of MTTR when deciding where and how to invest in their IT environments.

INTEGRATE 2024 Day 2 Highlights

Dan Toomey, Senior Integration Architect at Deloitte Australia, kicked off the session by highlighting the essential role of business rules in software development. He emphasized the significance of managing evolving and complex business rules, advocating for the use of effective tools like Business Rules Management Systems (BRMS) to safeguard code and services.

Adding config to AWS ECS tasks

When deploying Docker containers to AWS ECS, you can encounter a situation where you want to run an image that requires some configuration. For example, let's say you wanted to run Vector1 as a sidecar to your main application so you can ship your application's metrics to a service like Honeybadger Insights. To run Vector, you only need to provide one configuration file (/etc/vector/vector.yaml) to the image available on Docker Hub.

Top challenges of digitization and how network traffic analysis can help

As organizations scale, technology adoption also increases across industries to meet major performance and security requirements. This raises the need to support different networks and growing volumes of traffic and manage the bandwidth so that every application is accessible around the clock. Enterprises also need to ensure they leave no room for attacks or downtime. But is digitization that easy?

API update: Sessions, pages and Customers

Today, we’re excited to roll out 6 new endpoints for the Raygun API, making it simpler than ever to query Sessions, Pages, and Customers. Raygun’s Real User Monitoring helps you track and enhance your front-end and mobile page speed performance. It analyzes user sessions and page views to calculate your overall page speed. Previously, this required someone to log into Raygun, find the right application, and manually inspect sessions and page views in Real User Monitoring.