Operations | Monitoring | ITSM | DevOps | Cloud

How to ensure your AWS workloads are resilient

Part of the Gremlin Office Hours series: A monthly deep dive with Gremlin experts. Cloud providers like AWS give you plenty of tools to make your workloads more resilient, but it’s up to you to apply them. However, considering how complex some of these tools are, where do you start? And how can you be sure your systems are more reliable as a result?

Let Git Find the Bug for You (No Guessing)

Somewhere in your commit history, a bug snuck in. You could scroll. Panic. Guess. Or — you could let Git find the exact commit that broke your code. In this episode of Wait… Git Can Do That?, we show you how git bisect binary-searches your history to isolate the problem — fast, clean, and testable. Use git bisect start, good, and bad Test each step to narrow it down Or automate it with git bisect run.

Overview of Alerts, Real-Time Analysis, & Traceroute

Learn how Uptime.com alerts you the moment a check goes Up or Down, complete with technical details and root cause analysis for API and Transaction checks. Dive into Real-Time Analysis to track outage timelines and get detailed insight into every alert. Plus, see how Traceroute from global or private probe servers helps identify connection issues quickly and accurately. Stay informed. Respond faster. Resolve smarter.

Release v2.6: MCP Server, AI Insights Enhancement, Okta SCIM Integration, SNMP Monitoring and more.

Netdata 2.6.0 is here and it’s our most intelligent release yet! This version brings AI-powered monitoring, easier network visibility, and smoother enterprise integrations, all designed to help you troubleshoot faster and scale smarter. What's New: Netdata Referral Program Every referred user will get a 10% discount when they subscribe to Netdata Business or Homelab - and you will receive 10% of their subscription value (up to a max of 1000$ per space). You can refer an unlimited number of users, so there's no real limit to how much you can earn with the referral program.

How to monitor your Laravel app for critical vulnerabilities using Oh Dear

A critical security vulnerability was recently discovered in Livewire v3 that allows remote code execution, as Stephen Rees-Carter reported on Securing Laravel. While patches are released quickly, many applications remain vulnerable because developers simply don't know about the issue yet. Oh Dear's Application Health monitoring solves this by continuously checking your production environment for security vulnerabilities and immediately notifying you when issues are detected.

Being on-call at incident.io

At incident.io, we are building a product that our users rely on 24/7, all year round. This means it is crucial that it is always working, and that is where our on-call rotation comes in. We believe that everyone should be on-call because it tightens the feedback loop between shipping new features and maintaining what we have, leading to more pragmatic engineering decisions.

The Dashboard That Lets You Track the ISS in Real Time | Golden Grot Awards | Grafana Everywhere

Ruben Fernandez turned his love for space into a stunning ISS dashboard that won the Golden Grot—twice. Watch how he brings data and dreams together. Congratulations to Ruben Fernandez, our 2025 Golden Grot Award winner, recognized for this unique use case and dashboard! Grafana Cloud is the easiest way to get started with Grafana dashboards, metrics, logs, and traces. Our forever-free tier includes access to 10k metrics, 50GB logs, 50GB traces and more.