Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Website Downtime: Cost, Impact, and Best Solutions

Given the advanced digital age we are in now, a website's uptime and availability determine the success of businesses of all shapes and sizes. There are numerous challenges that each organization must face and overcome to ensure business continuity. One of the top in this list of challenges is website downtime. Your website must always be up so visitors can access it anytime and anywhere. However, if your website is frequently down, it will be tagged as unreliable, which reflects poorly on you.

How OpenTelemetry Powers Observability @ Canva

Canva is an online design platform with a mission to empower everyone in the world to design anything and publish anywhere. To guarantee our customers have the best experience using our products, Canva engineers rely on the tools and products provided by the Observability team to measure and quantify critical application health and performance metrics. Canva’s Observability team uses OpenTelemetry components to collect, transform and export standardised telemetry data from our applications and platforms. Canva has been an early adopter of OTel using OTel SDK for tracing and the collector gateway to process and export telemetry to various tools.

Putting Customers First and Amplifying Our Core Values

Cribl places high importance on its core values of Customer First, Always; Together; Curious; Irreverent but Serious, and Transparent. We strive to embody these values every day, and a particular customer issue recently enabled us to exemplify them to that customer. Recently, the Cribl Support, Software Engineering, and Product Management teams worked together with our largest Cribl Cloud customer to resolve throughput issues that arose when integrating Cribl.Cloud with Azure Event Hubs (EH).

Our Journey Into Cutting Kubernetes Costs by 40%

As companies start their Kubernetes and cloud-native journey, cloud infrastructures and services grow at a rapid pace. This happens all too often as organizations shift left without thorough controls, which can lead to overallocating and overspending on their Kubernetes environments. Organizations running workloads in the cloud can put budgets at risk when they lack information about key facts.

Update: Expanding our new API functionality

Today we continue on our journey towards being API-first with two new updates – non-expiring tokens and regenerating API keys. As you may have seen, late last month we made the exciting announcement about the launch of our Public API. Delivering a world-class API is a core focus here at Raygun. We’re on a mission to give you greater control over how you can extract, manipulate, and visualize the powerful insights surfaced in Raygun, so that you can use them in exciting new ways.

Monitoring your router with MetricFire

Having a healthy network is essential for any online business. Downtime can be costly; if you're not monitoring your devices, you could have more downtime than you realize. Routers, in particular, are essential for keeping a network running. Routers are critical equipment for any business that relies on the Internet to communicate with customers or clients. A slow or faulty router can grind business to a halt, which is why keeping an eye on your router's performance is essential.

5 key factors to consider before choosing network mapping software

With networks now more distributed than ever, network maps have become key components to enabling comprehensive and effective network monitoring and management. Helping IT admins visualize their complex IT infrastructures and draw actionable insights from the end-to-end mapping of network nodes, network maps offer many advantages. IT admins rely on these maps for drilling down to the cause of network issues, troubleshooting more quickly, and enhancing resource management.

Cloud & observability: hot topics from AWS re:Invent

A couple of weeks ago, I had the opportunity to attend AWS re:invent, one of the biggest cloud industry events of the year. An event so massive and big that only AWS can pull it off – 50,000 people marching across half a dozen of the finest hotels on the Las Vegas strip. The expo hall alone would have taken more than a couple of days to cover all the vendor booths spread across the expansive Venetian convention center.

LM Envision Application Topology: A New Way To Visualize Application Connections

Finding service relationships and diagnosing bottlenecks within an application can be incredibly difficult to accomplish, especially if your applications are spread across multiple services, with both internal and external service calls. Although users could get granular visibility into individual traces using our Distributed Tracing features, they couldn’t see how their services were connected across different traces.