Latest Posts

How we slashed detection and resolution time in half (Salt Security)

Jul 24, 2023 By Helios In Helios

Salt Security had deployed OpenTelemetry but found it insufficient. So the company engineers evaluated Helios, which visualizes distributed tracing for fast troubleshooting. My role as the Director of Platform Engineering at Salt Security lets me pursue my passion for cloud-native tech and for solving difficult system-design challenges. One of the recent challenges we solved had to do with visibility into our services. Or lack thereof.

Read Post

Helios

Read more about How we slashed detection and resolution time in half (Salt Security)

Debugging and troubleshooting microservices in production-All you need to know

Jul 23, 2023 By Helios In Helios

What do you do when things break in production? Debugging microservices isn’t a walk in the park. Microservices are designed to be loosely coupled, which makes them more scalable and resilient, but also more difficult to debug. When a problem occurs in a microservices app, it can be difficult to track down the source of the problem. When the problem is in production, the clock is ticking and you have to resolve the issues fast.

Read Post

Helios

Read more about Debugging and troubleshooting microservices in production-All you need to know

Lambda monitoring: Combining the three pillars of observability to reduce MTTR

Jul 13, 2023 By Yaron Dinur In Helios

Observability & monitoring can be challenging when it comes to distributed applications, serverless architectures being a typical examples of that. As with any other service that we run, we need to understand how our Lambda functions are executed, how to identify issues, and how to optimize performance.

Read Post

Helios

Read more about Lambda monitoring: Combining the three pillars of observability to reduce MTTR

API latency in microservices - Trace based troubleshooting

Jul 9, 2023 By Helios In Helios

In microservices architectures, apps are broken down into small, independent services that communicate with each other using APIs in a synchronous or asynchronous way.

Read Post

Helios

Read more about API latency in microservices - Trace based troubleshooting

How we combined OpenTelemetry traces with Prometheus metrics to build a powerful alerting mechanism

Jun 28, 2023 By Ran Nozik In Helios

One of the qualities of engineering team excellence is thinking outside the box to find creative solutions to hard problems. It’s our responsibility, as dev leaders, to pass on to the next generations of developers tips and tricks to help them look beyond the surface to solve complex business problems and leverage the power of the open source community, when possible.

Read Post

Helios

Read more about How we combined OpenTelemetry traces with Prometheus metrics to build a powerful alerting mechanism

OpenTelemetry .NET Distributed Tracing - A Developer's Guide

Jun 26, 2023 By Helios In Helios

Modern applications are becoming increasingly distributed due to a wide range of benefits including enhanced scalability, high availability, fault tolerance, and better geographical distribution. But it also makes the overall system complex making it challenging to understand how they function internally. Distributed tracing helps to address it by tracking how requests flow through various system components with detailed insights.

Read Post

Helios

Read more about OpenTelemetry .NET Distributed Tracing - A Developer's Guide

Serverless observability, monitoring, and debugging - Overview and best practices

Jun 11, 2023 By Helios In Helios

Serverless, as you may already know, is a cloud computing model where the cloud provider dynamically manages and allocates resources to execute code without the need for server provisioning or infrastructure management on the developer. This article overviews serverless observability, monitoring, and debugging, based on distributed tracing and OpenTelemetry (OTel).

Read Post

Helios

Read more about Serverless observability, monitoring, and debugging - Overview and best practices

API monitoring vs. observability in microservices- Troubleshooting guide

Jun 4, 2023 By Helios In Helios

Monitoring APIs through enhanced observability has gained traction with the popularity of microservices. Since microservice applications are built as independent and scalable modules, the number of microservices can grow dramatically as the application grows, increasing the complexity drastically. Since APIs work as the connective tissue between microservices, the number of APIs also grows in parallel.

Read Post

Helios

Read more about API monitoring vs. observability in microservices- Troubleshooting guide

Kafka monitoring: Message brokers observability and troubleshooting

May 29, 2023 By Aviv Kerbel In Helios

Message brokers like Kafka enable microservices to scale. But this same quality makes them hard to troubleshoot. How can developers avoid messages and errors getting stuck in oblivion? In this post we look at a few solutions: Kafka Owl, Redpanda, and Helios.

Read Post

Helios

Read more about Kafka monitoring: Message brokers observability and troubleshooting

Distributed tracing Node.js- OpenTelemetry-based monitoring

May 24, 2023 By Helios In Helios

As the trend toward microservices-based architectures continues to gain momentum, it’s becoming increasingly clear that distributed tracing will be a crucial tool for monitoring and debugging these complex systems in the future. When designing a microservices-based architecture, breaking extensive services into smaller, more manageable components is standard practice. Communication between these components becomes crucial, but finding the root cause can be challenging when issues arise.

Read Post

Helios

Read more about Distributed tracing Node.js- OpenTelemetry-based monitoring

Operations | Monitoring | ITSM | DevOps | Cloud

Latest Posts

How we slashed detection and resolution time in half (Salt Security)

Debugging and troubleshooting microservices in production-All you need to know

Lambda monitoring: Combining the three pillars of observability to reduce MTTR

API latency in microservices - Trace based troubleshooting

How we combined OpenTelemetry traces with Prometheus metrics to build a powerful alerting mechanism

OpenTelemetry .NET Distributed Tracing - A Developer's Guide

Serverless observability, monitoring, and debugging - Overview and best practices

API monitoring vs. observability in microservices- Troubleshooting guide

Kafka monitoring: Message brokers observability and troubleshooting

Distributed tracing Node.js- OpenTelemetry-based monitoring

Monthly Archive

Follow Us