Operations | Monitoring | ITSM | DevOps | Cloud

Visualizing Logs Alongside Metrics: A Practical Use Case

Security threats aren’t always loud and don’t always crash systems or trigger alarms. Sometimes they creep in quietly as a steady stream of unauthorized login attempts, slow brute-force probes, or unknown IPs scanning your server for vulnerabilities. These behaviors often show up in logs before they surface in metrics but if you're only watching logs or only tracking metrics, you're missing part of the story.

Keep an eye on remote access to your Kubernetes infrastructure with Datadog Workload Protection

To improve efficiency and reduce cloud spending, teams frequently schedule pods on Kubernetes nodes dynamically, based on available resources. However, this practice has also introduced a new security challenge: The workloads maintained by a development team are now spread between Kubernetes nodes, exposing more hosts and increasing the blast radius when user credentials are compromised.

Tracing asynchronous systems in your event-driven architecture: When to use parent-child vs. span links

Asynchronous communication patterns are commonly used in distributed systems, especially in those that rely on events or messages to coordinate activity. Rather than responding to direct API calls like in a traditional request-response architecture, services in an asynchronous system produce, route, or consume events and messages independently.

How to build reliable and accurate synthetic tests for your mobile apps

Mobile applications offer increased flexibility to both users and developers. Users can access content on a wide range of devices, operating systems, and network types, while developers can leverage touch screens and orientation-based layouts to create more responsive features. However, all of these factors create new testing challenges. To ensure a good user experience (UX), developers have to test their apps across many device models and platforms, which can become costly and time-consuming.

Deletion protection in Grafana Cloud: a simple way to safeguard your observability stack

We’ve all had that “uh-oh” moment. You press Enter and your blood runs cold, as you realize you just deleted something critical. For engineering teams, this type of disaster takes many forms. For example, maybe you used a DELETE statement without a WHERE clause to delete a row in a database, and accidentally deleted all of them instead. To protect you from the accidental deletion of critical resources in Grafana Cloud, we’re introducing a feature called deletion protection.

Powering What's Next: ScienceLogic's Vision for Intelligent, Outcome-Driven IT

The observability market is changing rapidly. The days of simply collecting logs, metrics, and traces are giving way to something bigger: delivering actionable intelligence that actually connects IT operations with business goals. Organizations don’t just want to know what’s happening anymore; they need to understand why it’s happening, what actions to take, and whether their systems can respond independently.

Common Unity errors and how to fix them

Unity has a reputation for handing out surprises: the play-mode freeze just after a hot-reload, the sudden sea of pink materials, or the stack trace that politely reminds you your transform was null all along. Rather than letting those moments derail the rest of your sprint, this post rounds up four of the most common runtime offenders, and shows you exactly how to trigger, spot, and fix each one.

Migrate in any Season: Seamless Apache Kafka Migration to Aiven with the Migration Accelerator

Finding yourself migrating and weighing the options when it comes to moving your entire Apache Kafka setup to a new cloud? This blog will point you to a solution to efficiently migrate your workloads to an Aiven for Apache Kafka cluster circumventing the headache that usually comes with cloud migration.

A local fix just spreads the problem

“You fixed a bug in QA — great! But did that fix go into version control and get tested and deployed everywhere? If not, you just created drift, and more problems down the line.” Peter Kruis, Microsoft SQL Engineer at Monin Fixing a bug in the environment where it appears feels like progress, but without a proper process, it creates fragility everywhere else.

Break it early to ship it safely

“We want developers to break things – just not for the customers. If all our tests are green, I get nervous that we’re not testing deep enough.” Naga Santhosh Reddy Vootukuri, Principal Software Eng. Manager, Microsoft Azure SQL Naga Santhosh, Sunny to most, leads a team that ships changes to Azure SQL databases worldwide. Those deployments must be fast, frequent, and invisible to customers. That kind of reliability doesn’t come from playing it safe during development.