Operations | Monitoring | ITSM | DevOps | Cloud

OnCallogy Sessions

Being on call is challenging. It’s signing up to be operating complex services in a totally interruptible manner, at all hours of the day or night, with limited context. It’s therefore critical to have proper on-call on-boarding procedures, offer continuous training sessions, and continuously improve documentation. We also need to make sure people feel safe by providing ways to reduce their stress, and make room for questions to surface all sorts of uncertainties around our operations.

Observability for K-12 and higher education: Top 4 challenges and how monitoring can help

K-12 and higher education institutions experienced massive changes in 2020 with the shift to online learning. New challenges arose, such as an increase in cybersecurity threats, students and staff requiring 24/7 access to their computers, and the need to update and improve infrastructure and applications IT infrastructure monitoring allows K-12 and higher education institutions to face common technology challenges both reactively and proactively.

Where are Monitoring Tools Headed? Help from Innovation Insight for Observability by Gartner

Enterprises are getting fed-up with their existing system monitoring tools. Despite decades of investments in monitoring tools, many businesses fail to notice a problem in their digital services until a customer calls to complain about it. So, it’s no surprise that businesses are looking for better solutions, and this has sparked an increasing interest in observability, according to Gartner in its updated report, “Innovation Insight for Observability.”

The Bird is the Word: Getting Up and Running Fast on Humio, by Crowdstrike

I’ve been in the log data analytics space for years, and I have loved seeing the technology and methodologies change and evolve. One of my favorite changes has been the emergence of index-less solutions, and Humio has a great solution here. If you haven’t heard of Humio, you should check out their index-less log management solution for yourself (free up to 16 GB/day too).

Kubernetes 1.24 - What's new?

Kubernetes 1.24 is about to be released, and it comes packed with novelties! Where do we begin? Update: Kubernetes 1.24 release date has been moved to May 3rd(from April 19th). This release brings 46 enhancements, on par with the 45 in Kubernetes 1.23, and the 56 in Kubernetes 1.22. Of those 46 enhancements, 13 are graduating to Stable, 14 are existing features that keep improving, 13 are completely new, and 6 are deprecated features.

Five Key Monitoring Capabilities for Top Payment Gateway Performance in E-commerce Applications

Payment gateway outages and performance issues have a disruptive effect on your business. When customers cannot complete a transaction, it leaves them frustrated and anxious. Even if it is not an outright outage, customers are wary of a flaky payment experience. They are often reluctant to retry the transaction for fear of being charged twice. This results in abandoned purchases and lost revenue.

How to Time Your Data Collection with Telegraf Agent Settings

Many Telegraf and InfluxDB users often spend a lot of time finding that perfect balance of getting the data they want in while not writing in too much data that they have to deal with unnecessary data in their database. This blog post will give you a better understanding of Telegraf’s data collection settings and help you fine-tune your configuration.

Network Nickels and Dimes: Government IT on a Budget

State and local governments have always had to get a little creative when it comes to efficiently managing the government IT budget—they have to ensure they can stretch those dollars as far as they’ll go. However, inefficient network management practices are only adding to the challenges of local government IT modernization, and sucking the life out of their information technology budget.

What is remote network monitoring?

Remote network monitoring is a technical specialty that was born almost at the same time as networks themselves. Since then, many strategies have emerged when it comes to monitoring network elements. In this article we will talk about the current techniques based on SNMP polling and network statistic collection through Netflow, and we will also mention outdated systems such as RMON. Most techniques are purpose-oriented, so they are especially useful.

Elasticsearch Release: Roundup of Change in Version 8.1.0

Elastic released a major version of its platform on February 10, 2022. Version 8.0.0 is the latest major version. There has already been a new minor release to version 8.1.0, and there are anticipated minor and patch releases coming as Elastic rolls out new features and fixes. The latest release is the first significant revision since April 2019, when version 7.0.0 was generally available. Users can find a complete list of release notes on the Elastic website.