Operations | Monitoring | ITSM | DevOps | Cloud

Get started and keep using AWS for free

Getting started with AWS and adding your credit card to your own account feels scary, but there are ways to get free credits so you can sleep better in the beginning. In this article, we’ll cover some tricks and tips to get started and keep using AWS for free. Stepping into some new terrain is hard. This is already true if it’s only about learning something new.

Facebook Outage Underscores Need for Real-Time Monitoring

On October 4, Facebook and its family of apps, including Instagram and WhatsApp, suffered a global outage of its services that lasted approximately six hours. The massive outage has been blamed on configuration errors in backbone routers that are used to connect network traffic to the company’s data center. Facebook apologized to its 3.5 billion users who were unable to access any of the company’s services during the downtime.

Dashboard Fridays

We are excited to announce a new community initiative – Dashboard Fridays. Dashboard Fridays is a bite-sized video series where we share and discuss a range of different dashboards created for the community, by the community. Each video is no longer than 20min, so grab a coffee and let’s talk dashboards! Each episode, we will zoom in on one stellar dashboard put together by a member of the community.

Logs for Ops

The evolution of machine data and logging in general has shifted multiple times over the last couple of decades. The log began with Unix and was rooted in command line actions like tail or grep. It evolved from system-based logs to application-based logs and eventually became more UI-friendly and readable. Not only has the log itself evolved, but the purpose of the log and audience for the log has morphed over time as well.

Automate, Group, and Get Alerted: A Best Practices Guide to Monitoring your Code - Part 2

Missed part one? Check out the full guide here. As companies grow, so do their products, teams, and the number of external tools. For engineers, that can mean code sprawl, data silos, notification fatigue, and some “what the…?” moments along the way as they try to make sense of it all.

Grafana 8.2 released: Dynamic plugin catalog, new fine-grained access control permissions, and more

Grafana 8.2 is here! This release marks the start of our work focused on measurable improvements to Grafana’s accessibility — part of our continuing mission to democratize metrics for everyone. The initial changes to Grafana in 8.2 are focused on navigation, with more to come. We’ll be sharing more about our accessibility roadmap in an upcoming blog post.

Trigger a Kubernetes HPA with Prometheus metrics

In this article, you’ll learn how to configure Keda to deploy a Kubernetes HPA that uses Prometheus metrics. The Kubernetes Horizontal Pod Autoscaler can scale pods based on the usage of resources, such as CPU and memory. This is useful in many scenarios, but there are other use cases where more advanced metrics are needed – like the waiting connections in a web server or the latency in an API.

Trigger a Kubernetes HPA with Sysdig metrics

In this article, you’ll learn, through an example, how to configure Keda to deploy a Kubernetes Horizontal Pod Autoscaler (HPA) that uses Sysdig Monitor metrics. Keda is an open source project that allows using Prometheus queries to scale Kubernetes pods. In Trigger a Kubernetes HPA with Prometheus metrics, you learned how to install and configure Keda to create a Kubernetes HPA triggered by a standard Prometheus query.