Operations | Monitoring | ITSM | DevOps | Cloud

Business Entities

We’re thrilled to share Version 4.4 release with new feature updates: Business Entities for tracking customers, partners, and departments, improved control for transfer switch power devices, streamlined bulk actions for Business Entity association with assets, enhanced import capabilities for connections and circuits, and advanced search function to filter by Business Entity for easier, more efficient work.

Site Reliability Engineer (SRE) Interview Questions

In this article we will cover the top 25 SRE interview questions to help you prepare for you next SRE interview. As customer demand for reliable and high-performing services continues to grow, the role of Site Reliability Engineers (SRE’s) continues to grow in importance. Whether you are a seasoned SRE or a recent graduate preparing for an SRE interview, these questions will be invaluable for determining your level of expertise and understanding where you need to grow.

Serverless observability: How to monitor Google Cloud Run with OpenTelemetry and Grafana Cloud

OpenTelemetry has emerged as the go-to open source solution for collecting telemetry data, including traces, metrics, and logs. What’s especially unique about the project is its focus on breaking free from the reliance on proprietary code to offer users greater control and flexibility. As a senior solutions engineer here at Grafana Labs, I’ve spent a lot of time exploring OpenTelemetry, including in my spare time.

A look at Azure monitoring and troubleshooting

Even now, plenty of businesses are still making the shift to the cloud. Chief decision-makers are plagued by fears about availability, potential downtime and security. Organizations adopting Microsoft Azure need to be able to confidently make the transition without interruptions, which requires building out a strategy for monitoring your Azure environment.

Engineering Onboarding: The Key to DevEx Success

Engineering onboarding comprises more than just allocation of credentials and orientation to top tools. Depending on staffing, capacity for resource and permission allocation, and maturity of self-serve tooling, it can take weeks or even months for engineers to contribute their first meaningful PR—a common measure of "onboarding completeness." So how should engineering leaders think about optimizing their processes to improve developer effectiveness, velocity, and confidence?

Internet Stack Map: A gamechanger for Internet Performance Monitoring

In this blog, we are going to focus on Internet Stack Map, a milestone development for Internet Performance Monitoring. Our CEO, Mehdi Daoudi, sees this as Catchpoint’s iPhone moment. Why? 15+ years of innovation laser focused on Internet Performance Monitoring have been distilled into an ingeniously simple AI-powered dependency map of everything that impacts an application, customer, or user.

And the Killer App for Observability is...Integrations

Editor’s Note: This is the third installment of a series of blog posts previewing our State of Observability 2024 survey report. So far in this blog series, we’ve looked at where enterprises and MSPs are in their observability journeys and the benefits and challenges of their observability deployments. This week, we look at whether the observability story so far is more about replacing or enhancing existing IT management tools.