Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

MQ application compatibility across a quarter century

I was working on something recently where I had to upgrade various components in the tooling. And I was getting more and more annoyed that the upgrades broke my existing programs and scripts. None of that was MQ’s fault and I’ll write more about the project once it’s available alongside the newly-announced MQ 9.3. But it got me thinking about the efforts we’ve made to keep MQ application compatibility across its lifetime.

SCOMathon 2022 | Recap

SCOMathon 2022, hosted by SquaredUp and Cookdown on May 10, delivered massive SCOM content packed into 21 sessions, 3 keynotes, and 3 panel discussions. The Microsoft SCOM team, MVPs, experts, and customers discussed all things SCOM, including the latest updates, best practice tips, specific use case scenarios, and lessons learned during their SCOM migrations. 245 attendees joined the sessions to learn and chat with their peers. As every year, a donation is made based on participants per session.

Visualize relationships between your Kubernetes resources with Datadog Live Containers

A Kubernetes environment includes a wide range of resources—such as clusters, nodes, and pods—that work together to run dynamic applications at scale. In order to monitor a Kubernetes application effectively, you need a multi-dimensional view into your clusters’ health that encompasses the complex dependency relationships among these resources.

Ingest OpenTelemetry traces and metrics with the Datadog Agent

OpenTelemetry is a Cloud Native Computing Foundation (CNCF) initiative that provides open, vendor-neutral standards and tools for instrumenting services and applications. Many organizations use OpenTelemetry’s collection of APIs, SDKs, and tools to collect and export observability data from their environment to their preferred backend. As part of our ongoing commitment to OpenTelemetry, we are proud to have contributed our distributed tracing libraries to the CNCF community.

Cloud SQL: Concepts of Networking

Cloud SQL provides a managed service for MySQL, PostgreSQL, and SQL Server databases as well as backups, high availability, maintenance, and so much more! In this episode of Networking End to End, Lorin Price discusses networking concepts from implementation and security to connectivity on Cloud SQL. Watch along to learn about the options for deploying Cloud SQL and tips on how to determine who and what can access your Cloud SQL instance.

Event Reduction in Four Easy Ways with Cribl Stream

One of Cribl Stream’s selling points is the reduction of ingested log volume, which helps our customers control costs and improve system performance. This can be accomplished in two ways – either by eliminating duplicate or unnecessary fields and null values within the events, or controlling the number of specific events that actually get sent to the destinations through strategic filtering.

All things logs: best practices for logging and Grafana Loki

What’s the saddest log line in the multiverse? A log line without context. That’s according to Grafana Labs software engineer and Grafana Loki tech lead Ed Welch, who joined Grafana Labs VP of Culture Matt Toback and Engineering Director Mat Ryer for the latest episode of “Grafana’s Big Tent," our new podcast about people, community, tech, and tools around observability.

Introducing a high-usage tier for Managed Service for Prometheus

Prometheus is considered the de facto standard for Kubernetes application metrics, but running it yourself can strain engineering time and infrastructure resources when your usage grows. In March, we announced the general availability of Google Cloud Managed Service for Prometheus to help you offload that burden, and today, we’re excited to announce a new low-cost, high-usage pricing tier designed for customers who are moving large volumes of Kubernetes metrics over to the service.

Alerting on error log messages in Cloud SQL for SQL Server

With Cloud SQL for SQL Server, you can bring your existing SQL Server on-premises workloads to Google Cloud. Cloud SQL takes care of infrastructure, maintenance, and patching so you can focus on your application and users. A great way to take better care of your application is by monitoring the SQL Server error log for issues that may be affecting your users such as deadlocks, job failures, and changes in database health.

Tracing a Ruby application with OpenTelemetry for performance monitoring

Ruby on Rails is a popular MVC framework for creating web applications. It is necessary to monitor your Ruby applications for performance issues. In today’s cloud-native and microservices-based architecture, it is difficult for engineering teams to troubleshoot performance issues. Tracing your application can give the much needed context required to troubleshoot performance issues.