Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Sponsored Post

Uptime monitoring: How to track your network availability, 24/7

When it come to measuring an organization's ability to support end users and provide services, network uptime can be a great yardstick. An inability to ensure optimum uptime can negatively impact your business delivery, resulting in financial and reputational losses. If you're doing it manually, ensuring 24/7 network uptime is a challenging exercise requiring considerable resources. It is way more convenient to have a monitoring mechanism in place that can monitor network uptime and notify the network admin proactively about any bottlenecks that might lead to network downtime.

How a corrupted file took down 12,000 flights across the US: Real-world consequences of minor IT negligence

The airport is shutdown in the midst of a busy time, masses of people are stranded, pilots wait in the cockpit awaiting ground information, there’s confusion and panic among the crew. This could easily be a scene from Die Hard 2 where the villains take over an airport and seize control of all electrical equipment. But, hate to break it to you, this actually happened. Is it possible for one person to disrupt the entire nation’s aviation system? Apparently, yes.

Common Errors in Next.js and How to Resolve Them

Bugs are one of the most troubling aspects of software development; they appear out of nowhere and cause everything to stop working. Most of the time, they can be resolved quickly; however, others can be gruesome and take hours/days to fix. Next.js is one of the most popular web development frameworks in the current world, and as a programming tool, it didn’t escape the bug dilemma either.

Hosted StatsD vs. StatsD

When you are designing and building applications, you should consider how to monitor them once they become live. You do not want to be blindsided by errors and degrading performances as you operate them. When your applications fail to provide optimal performance, it can broadly impact your business. Engineers will often be distracted to investigate and fix the issues. Customers will complain. It can eventually hit your bottom line.

Thousands of Insights at a Glance With Coralogix Alert Map

An effective alerting strategy is the difference between reacting to an outage and stopping it before it starts. That’s why at Coralogix, we’re constantly releasing new features that redefine how alerts are consumed, to enable teams to push their ambitions even further, release with confidence, and tackle issues proactively. Alerts Map is now an indispensable tool for that mission.

Getting Started with Cribl Stream: Your First Hundred Days

Congratulations, you’ve worked hard to get Cribl Stream into your technology stack. Buying a new tool is a non-trivial task, so be sure to pat yourself on the back. Now the work starts: You have to deploy Stream and get full value to justify the cost. It’s critical to get started with the right plan to accelerate delivery and maximize the value of Stream. I’m going to start by sharing some ideas about how to get started with Cribl Stream in your first hundred days.

Applying Lessons Learned from Baking Pizza to Kubernetes Observability

Baking a delicious pizza in a wood-fired oven requires a combination of skill, experience and the right tools. The same is true for achieving optimal observability in a Kubernetes environment. In this post, we'll explore some of the lessons learned from baking pizza in a wood-fired oven and apply them to the world of Kubernetes observability.

Jack Henry Incorporates BubbleUp and Honeycomb's New Service Map to Quickly Debug Issues and Get Ahead of Customer Latency

Not long ago, we announced the launch of Honeycomb’s Service Map, a new feature that gives users the ability to get an overall, filterable view of their system and how everything is connected, along with some exciting new enhancements to BubbleUp. What’s the story behind these changes? They make it even easier for developers to zero-in on issues, even when they are hidden in billions of lines of code.