Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Test Observability with Sumo Logic

The software industry has seen many evolutions. There is a new disruption in the market every five years or so. Software testing cannot remain isolated from all the latest trends and technologies. Testing strategies need to keep up with agile development, faster deployments and increasing customer demand for reliability and user-friendly interfacing. They need to be able to grow just as quickly and just as reliably as the business logic.

Datadog's commitment to OpenTelemetry and the open source community

The OpenTelemetry (OTel) project is an open source initiative with the goal of providing vendor-neutral standards and tools that enable users to collect telemetry from any source in their environment and send it to any backend. A core tenet of Datadog is to provide a single, unified platform for customers to easily collect and monitor all of their observability data, regardless of where it comes from.

Sumo Logic platform video

Sumo Logic SaaS analytics platform makes the world's applications reliable and secure 24x7x365. Learn how Sumo Logic ingests data at scale, helps find and troubleshoot issues fast, and secures user experiences. We integrate with hundreds of out-of-the-box apps, making it easy and seamless to get more from your data quickly. Whether your data resides in multiple clouds or on-premises, now you can monitor, troubleshoot and secure your apps from ONE platform powered by logs.

Webinar Recap: How Observability Impacts SRE, Development, and Security Teams

In today’s fast paced and constantly evolving digital landscape, observability has become a critical component of effective software development. Companies are relying more on and using machine and telemetry data to fix customer problems, refine software and applications, and enhance security. However, while more data has empowered teams with more insights, the value derived from that data isn’t keeping pace with this growth. So how can these teams derive more value from telemetry data?

Outages Happen. Now What?

Network outages happen more often than you think. We may not experience them directly or even know they're occurring at all. When outages affect household names like Facebook, Amazon, Microsoft, and others, however, we're sure to find out after the fact that there was an issue. Depending on the user's activities and the duration of the issue, stress and frustration levels can vary. When a marketer can’t get that ground-breaking advertisement up on Facebook, they can get antsy.

A beginner's guide to Kubernetes application monitoring

Application performance monitoring (APM) involves a mix of tools and practices to track specific performance metrics. Engineers use APM to monitor and maintain the health of their applications and ensure a better user experience. This is crucial to high quality architecture, development, and operations, but it can be difficult to achieve in Kubernetes since the container orchestration system doesn’t provide an easy way to monitor application data like it does for other cluster components.

The Great Debate of 2023: Single Vendor vs Best of Breed Solutions

The debate between single vendor solutions and best of breed approaches has been ongoing for decades in the technology industry. Engineers have always sought out options and choice, and this has led to a shift in the dominance of large vendors in each stage of technological development. As soon as IBM sold enterprises the mainframe solution, engineers started to look for other options.

Using AIOps for automation and efficiency in observability and IT operations

Artificial intelligence for IT Operations (or AIOps) has been playing an expanding role in helping SREs, DevOps, and developers effectively navigate the challenges around application and infrastructure complexity, pace of change, and data volume that characterize the operations landscape.

Complete Guide to Distributed Tracing with OpenTelemetry - Part II

In the previous article, we learned what distributed tracing is, why it is necessary, how to do tracing, encountered challenges with existing tracing tools, and finally discovered that there is a more mature option available for the industry to adopt in terms of telemetry and observability. In this article, we will be trying to understand OpenTelemetry in more depth. To begin, we will examine how OpenTelemetry addresses some of the issues confronting the observability ecosystem.