Operations | Monitoring | ITSM | DevOps | Cloud

Zenduty

Master Kubernetes Monitoring with these Must-Track Metrics

Managing a Kubernetes cluster requires a keen eye for detail and a deep understanding of its complex structure. To ensure smooth operation of your applications and optimal performance, it is vital to monitor a wide range of metrics across the different components of your cluster. In this article, we will discuss key metrics that can be used to monitor both self-managed and cloud-managed Kubernetes environments, helping you to keep your cluster running at its best.

Zenduty - Tutorial 15 - Zenduty API and Postman Collections

Zenduty is a revolutionary incident management platform that gives you greater control and automation over the incident management lifecycle. With the Zenduty API, you can supplement and deploy Zenduty in sync with other tools and services, allowing you to create and update incidents, users, teams, services, integrations, schedules etc. and automate your workflows using simple scripts.

Scaling Your Web Application: A Guide to Scaling for High Performance

If you’re familiar with the frustration of dealing with a poorly constructed web application or the challenges of providing tech support, you understand the importance of building a high-performing and scalable web application. However, with the numerous considerations involved, it can be overwhelming to determine the starting point. This article aims to provide guidance on how to avoid common pitfalls that negatively impact user experience and waste resources.

A Complete Guide to PagerDuty Alternatives

Exploring Options for Incident Management: A Comparison of PagerDuty and Other Tools Effective incident response is crucial for managing operational issues and resolving them in a complex technology environment. With the increasing complexity of systems built from numerous services, it is important for companies to have a way to keep these systems running smoothly.

The Inevitable - Failures in Distributed Systems

Experiencing failure at scale is as the popular Marvel character Thanos would say “Inevitable”. Memory leaks, software or hardware or network I/O failures are just a few. It’s a problem of simple mathematics, the probability of failing rises as the total number of operations performed increases. With each component used to scale the application, the failure quotient increases. So how do you tackle this so-called “Inevitable” problem that comes with scaling?

"Just get on with it!" - The Horrors of Task Prioritization

Learn how to prioritize tasks, get stuff moving by performing non-blocker tasks first, effectively create postmortems, perform RCAs faster and not have an overburdened high priority(P0) dashboard. The below article should help you plan your product/feature launch faster without having to compromise on the reliability of the existing services.

What should you choose? Docker Swarm vs Kubernetes

Since the introduction of containerisation by Linux many years ago, maturity has shifted from the traditional virtual machine to these containers. These tools have made application development much easier than the initial process. Docker Swarm and Kubernetes came into action when the number of containers increased within a system, they helped orchestrate these containers. A question that arises is, which one is the better option?

Engineering Manager from a non-STEM background?

There is a long list of requirements a hiring manager looks at before hiring an Engineering Manager, there needs to be a balance between technical and leadership skills to perform well in the position. Engineering Manager roles differ from company to company. It is hard to list what a day in an engineering manager’s life looks like.