Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Observabilty for complex systems and related technologies.

The Confident Commit ep. 10 | Observability improving speed and reliability with Ben Sigelman

Rob sits down with Lightstep CEO, Ben Sigelman to discuss observability and how it connects with delivering change with confidence. Get answers to questions like: Watch, learn, and leave us a comment with your thoughts, questions, or ideas for future podcast episodes. And don't forget to Like and Subscribe to The Confident Commit Podcast playlist for alerts to new episodes published biweekly. The Confident Commit: A podcast for developers, engineering managers, and business leaders alike to join in the conversation on how to deliver software better and faster.

An Introduction to Distributed Tracing

There’s no strict definition of a distributed system. But generally speaking, if you have reached a point where you’re running more than five interdependent services at once, that means you’re running a distributed system. It also means you are more than likely experiencing difficulties when troubleshooting using traditional debugging tools. Unfortunately, pulling up multiple tools, each built for a monolithic world, doesn’t help pinpoint the problem.

Serverless observability and real-time debugging with Dashbird

Systems run into problems all the time. To keep things running smoothly, we need to have an error monitoring and logging system to help us discover and resolve whatever issue that may arise as soon as possible. The bigger the system the more challenging it becomes to monitor it and pinpoint the issue. And with serverless systems with 100s of services running concurrently, monitoring and troubleshooting are even more challenging tasks.

The Fast & The Foolproof: Automation & Observability For DevOps

When software teams are charged with delivering higher quality software, faster - how do you effectively enable collaboration and observability while eliminating risk and manual processes? In this webinar, Ali Sardar from JFrog and Rob Jahn from Dynatrace will address how to overcome these challenges and unlock speed, observability, and automation across your DevOps lifecycle. In addition to best practices shared by our speakers, you will also see both products in action - meeting the critical needs of development and operations teams.

Introducing the Honeycomb plugin for Grafana

Over the years, we’ve heard many versions of the same familiar story: large businesses struggling with observability data living in several different systems. At Grafana Labs, our “big tent” philosophy is based on the belief that our users should determine their own observability strategy and choose their own tools. Grafana allows them to bring together and understand all their data, no matter where it lives.

How Developers Can Benefit from Observability | IAmDevloper and Splunk's Mark Woods

DevOps teams have felt pressure from all sides to innovate faster and keep services reliable. The growing complexity of applications and cloud infrastructure create more challenges for everyone, but the tools that developers and SRE teams require have been disconnected - keeping everyone from working as an efficient team. IAmDevloper and Splunk’s Chief Technical Advisor EMEA, Mark Woods discuss how observability can help break down silos and promote agility.

Model-driven observability: Taming alert storms

In the first post of this series, we covered the general idea and benefits of model-driven observability with Juju. In the second post, we dived into the Juju topology and its benefits with respect to entity stability and metrics continuity. In this post, we discuss how the Juju topology enables grouping and management of alerts, helps prevent alert storms, and how that relates with SRE practices.