Operations | Monitoring | ITSM | DevOps | Cloud

Honeycomb + Squadcast Integration: Routing Incident Alerts Made Easy

Honeycomb is an application monitoring tool that helps DevOps and SRE teams to operate more efficiently by offering rich observability solutions and intuitive team collaboration. It helps understand complex relationships within your distributed systems and troubleshoot issues accordingly. Squadcast is an end-to-end incident response tool. Built with an SRE mindset, it streamlines all the incident response activities.

AWS Budgets Vs. AWS Cost Explorer: The Ultimate Comparison Guide

AWS currently offers over 200 services. Some of those make up the AWS Cost Management suite. This group comprises AWS Cost and Usage Report (CUR), AWS Budgets, AWS Cost Explorer, AWS Cost Categories, and AWS Cost Anomaly Detection. Budgets and Cost Explorer are an excellent pair of complementary tools in this group. They have similarities that cause users to wonder if they need both and, if so, why. Here is a brief overview of the differences between AWS Budgets and Cost Explorer.

How to Get Started with Heroku Logging

Heroku is a platform for deploying, running, and managing applications, which is written in a variety of programming languages, including Python, Java, C#, JavaScript, PHP, and others. Heroku's goal is to free you up to focus on your applications rather than infrastructure management. Logging is usually included in infrastructure management. Heroku provides a high-level log maintenance tool. In this Heroku logging article, we'll learn how to get the most out of Heroku logs.

Logic App Best practices, Tips and Tricks: #5 Delete comments

Are you surprised? Are you under where are the first four tips? I start this series of blog posts on my blog, and you can see and read the previous Best practices, Tips, and Tricks here: And I will be sharing some of them here and others on my blog. So stay tuned for both blogs. Of course, the most recurring task is adding comments to our triggers and actions, but it is always good to know you to delete them. Some of you may be thinking that is a trivial task, simple like adding a comment.

SRE Metrics: Four Golden Signals of Monitoring

SRE (site reliability engineering) is a discipline used by software engineering and IT teams to proactively build and maintain more reliable services. SRE is a functional way to apply software development solutions to IT operations problems. From IT monitoring to software delivery to incident response – site reliability engineers are focused on building and monitoring anything in production that improves service resiliency without harming development speed.

DevOps vs SRE - Reducing Technical Debt and Increasing Efficiency and Resiliency

One more blog topic stemming from our weekly office hours that we hold with the field team here at Shipa. In our last office hours, was asked a question about “what are the difference between DevOps Engineers and SREs?”. Both professions are emerging disciplines and cultures that continue to evolve and play an importance in technology organizations. I’ve been fortunate to have written and spoken about this before; though taking a fresh look at what the two domains try to accomplish.

Observability versus monitoring in software development

To supervise the behavior of distributed applications and track the origin of service failures and downtime, developers often use traditional monitoring technologies and tools. However, this approach can fall short in its ability to measure the overall health of modern cloud-native architectures, which can span multiple hosting environments and encompass hundreds of microservices.

What Is Automated Discovery and Dependency Mapping (DDM) and Why Do You Need It?

In a perfect world, your Configuration Management Database (CMDB) acts as the single source of truth for all your IT device inventory and the relationships between those devices. However, maintaining accuracy is easier said than done. That’s because the traditional method for provisioning and maintaining a CMDB is complex, unwieldy, and outdated the second it's updated. To keep up with the needs of a modern CMDB, an automated discovery and dependency mapping (DDM) solution is a must.