Operations | Monitoring | ITSM | DevOps | Cloud

Monitor kube-state-metrics v2.0 with Datadog

In order to manage complex containerized applications, modern devops teams need to have deep visibility into the status of their Kubernetes resources. By listening directly to the Kubernetes API, the open source kube-state-metrics service generates key metrics about your Kubernetes objects, including pods, nodes, and deployments, which are essential for understanding the status and performance of your clusters.

Top SRE Toolchain Used By Site Reliability Engineers

We have compiled a list of the most popular and sought out tools (some you may have heard of) that SREs need in their toolkit - at every phase of a production system to keep up with SRE best practices Site reliability engineering (SRE) practices help organizations by ensuring smooth functioning of their deliverables with utmost reliability and resilience. These can be achieved by a set of well-defined tools that are deployed at every phase of the production system to keep up with SRE best practices.

SRE fundamentals 2021: SLIs vs. SLAs. vs SLOs

A big part of ensuring the availability of your applications is establishing and monitoring service-level metrics—something that our Site Reliability Engineering (SRE) team does every day here at Google Cloud. The end goal of our SRE principles is to improve services and in turn the user experience. The concept of SRE starts with the idea that metrics should be closely tied to business objectives. In addition to business-level SLAs, we also use SLOs and SLIs in SRE planning and practice.

Accelerating Code Quality with DORA Metrics

What do Google’s DevOps Research and Assessment (DORA) and Rollbar have to do with each other? DORA identified four key metrics to measure DevOps performance and identified four levels of DevOps performance from Low to Elite. One way for a team to become an Elite DevOps performer is by focusing on Continuous Code Improvement.

Improve Your Customer Operations To Increase Your NPS

If you want to improve your customer operations, your attention must turn to your NPS score. The Net Promoter Score for your organization should never be overlooked. This can provide you with vital insights into how your services are perceived by customers. Those actively attempting to improve their NPS scores should focus on improving customer operations. Those running customer-centric businesses recognize the undeniable importance of operating around their needs.

Diagnosing Database Performance Problems When You Aren't a Database Administrator

Deep specialization of IT administrators is a luxury only the largest organizations can typically afford. Smaller organizations rely on IT administrators with a more generalist skill set because they are—by necessity—responsible for a wide array of different technologies, and there simply isn’t time to specialize in the intricacies for any one of them. Yet modern IT is intricate.

Ivanti Neurons for Spend Intelligence: SaaS Management

You’re probably familiar with the many benefits of licensing software using a SaaS (Software-as-a-Service) model. The most frequently touted benefit is that you pay only for what you use when you use it. Additionally, you don’t have to worry about upgrading, or server maintenance, or security. That’s all taken care of. However, even using a SaaS model, you still need to monitor and manage costs, which can be a challenge if you are working with several suppliers.