Operations | Monitoring | ITSM | DevOps | Cloud

Autoscaling Made Easy with Rancher Cluster API

Kubernetes has revolutionized application deployment and management. However, manually adjusting cluster sizes to meet fluctuating workloads, without constantly under- or over-provisioning resources, quickly drains platform teams’ time and energy. While traditional cloud provider autoscaling tools are functional, they often fall short when it comes to truly dynamic, Kubernetes-aware scaling, especially in a world with diverse infrastructure.

Getting started with the relaxAI API: Sovereign, cost-effective AI

Earlier this year, we launched relaxAI, an AI assistant designed with one paramount focus: your privacy. We’re now excited to announce the relaxAI API is in General Availability (GA) offering an OpenAI interface. This gives UK organizations up to 90% cost savings versus leading providers while ensuring data never leaves UK jurisdiction.

Stop guessing! Speedscale's Notebook finds anything in your traffic.

Debugging complex microservices just got an upgrade. This video demonstrates Speedscale's innovative Notebook capability, allowing you to perform advanced substring searches and filter production traffic based on deeply nested JSON fields within request and response bodies. Unlike traditional observability tools that only record telemetry, Speedscale's always-on recorder captures full traffic payloads, empowering you to precisely pinpoint issues, identify specific user calls, or validate API versions. Streamline your troubleshooting, enhance your testing, and gain unprecedented visibility into your production environment.

Top 10 Status Page Examples: What We Like and What's Missing

A great status page does more than show uptime—it builds trust, communicates clearly during incidents, and empowers users to stay informed. Here are 10 standout examples of public status pages, with a quick breakdown of what they do well—and where there’s room for improvement.

Speedscale: Avoid Regulatory Icebergs with Traffic Replay, and Save Money

It has never been more critical to establish a solid foundation for regulatory compliance. Regulations govern a wide range of functions. Some of them are obvious, such as health and human services, patient data, medical devices, and credit payments. Some of them are less obvious, especially with the ever-changing definition of what constitutes private and identifiable data. This article provides an overview of regulatory compliance challenges and the hidden risks organizations face beneath the surface.

The Right Way to Automate SMS in Your Ops Stack

SMS messaging continues to be one of the most effective channels for time-sensitive communication. In operations environments where every second counts, it offers a direct line to the right person, fast. But too often, SMS is bolted onto workflows as an afterthought, leading to unreliable delivery, alert fatigue, and unnecessary manual overhead. Automation solves most of these problems-but only when done correctly.

StatusHub Q2 2025: API v3 Preview, Analytics Visibility & Subscriber Quota Alerts

In Q1, we introduced API v3 as an upcoming release. In Q2, while the full release is still in progress, parts of the new StatusHub API V3 are available and ready for use. We are also highlighting a small but useful improvement: embedded analytics dashboards in the control panel can now be restricted to owners only.

API Security: Validating Auth and Access with Traffic Simulation Starts with Behavior

Security breaches rarely begin with a hidden zero-day exploit or a complex web of escalated hacks. They often start in very simple ways – an internal team member is breached, a permission is misconfigured, an overly permissive API endpoint is overlooked, or a JWT simply doesn’t expire. An API, or application programming interface, is a set of protocols and tools that enable different software systems to communicate and exchange data, making them essential in modern software development.

Checkly Is Now Available in the AWS Marketplace

If your team runs on AWS, getting new tools into your workflow isn’t just about functionality. It’s about how quickly you can procure, integrate, and see value. With Checkly now available on AWS Marketplace, monitoring doesn’t have to be an exception. This launch means Checkly fits into your procurement flow the same way it fits into your CI/CD: seamlessly. No vendor approval bottlenecks, no procurement delays, just faster access to the tools your developers already want to use.

The Silent API Killer: Data Coupling in Your Tests

In API testing, speed, accuracy, and confidence in test results are everything. Regardless of whether you’re validating functionality, testing performance under load, or ensuring compliance with your security posture and standards, the ultimate goal is the same: catching problems before they reach production. But what if your tests are lying to you? Lurking beneath even the most sophisticated test suites is a subtle, pervasive threat: data coupling.

Enable Kong Gateway Tracing in 5 Minutes

Kong Gateway is a popular API gateway that sits at the edge of your infrastructure, routing and shaping traffic across microservices. It’s fast, pluggable, and battle-tested, but for many teams, it remains a black box. You might have OpenTelemetry set up across your application stack. Traces flow from your app servers, databases, and third-party APIs. But the moment a request enters through Kong, observability drops off.

Build Log Automation with Last9's Query API

Manual log investigation is one of those engineering tasks that quietly drains hours without offering much real value. You're debugging an incident. Monitoring shows elevated error rates. Now begins the familiar drill: It’s a tedious cycle, and it doesn’t scale. The whole process breaks down when you’re trying to automate incident response, run continuous security monitoring, or generate compliance reports.

API Staging Is Not Production - But Speedscale Makes It Close

Staging environments are often looked at as the testing ground ahead of the “real” production environment. The idea is simple – build a duplicate of your production environment, run your tests, and ship with confidence. But the reality of using staging in the real world as part of a holistic API testing strategy is rarely that clean. No matter how meticulously you mirror production services, staging always falls a little short.

Generating end-to-end tests with AI and Playwright MCP

When I started using Playwright, there was a single command that blew me away. I immediately became (and still am) a huge Playwright Codegen fanboy. Playwright's codegen command opens up a browser window, and whatever you do in this window will be recorded. Navigating URLs, clicking links, and filling out form elements—the Playwright inspector records all your actions and generates a Playwright test for you. Magic!
Sponsored Post

Boba Paradox

It's 2PM on a Thursday. Your engineering team is knee-deep in bugs from a recent release. But what's the Slack channel buzzing about? Not flaky tests. Not integration coverage. Not mocking services. It's whether to order brown sugar boba or taro with oat milk. Let's be honest: for many companies, it's easier to justify $8 on boba than $800 on testing tools. And we're not here to judge-we're here to understand why.

From Guesswork to Guarantees: How Traffic Replay Improves Release Confidence

In modern software development, the pressure to move fast is matched only by the need to get it right. Teams working within the software development lifecycle (SDLC) must constantly balance velocity and quality, ensuring releases are stable, secure, and performant. Traditional software development models often relied on manual verification and human intuition to validate releases; however, as systems have grown in complexity, guesswork is no longer sufficient to meet these rising needs.

Streamline API testing with Proxy Mock! Capture, mock, and replay API calls locally

Alan Mon introduces Proxy Mock, a powerful tool for capturing and replaying API calls. Learn how to effortlessly record inbound and outbound API requests and responses. The demonstration highlights how Proxy Mock operates entirely on your local machine, eliminating the need for cloud services or internet connectivity for testing. See how to set up Proxy Mock, inspect captured API calls (including request/response headers, body, and unique signatures), and leverage it to mock API responses for seamless local testing, ultimately boosting productivity and reducing the need for costly non-production environments.

The Defense-in-Depth Approach To Application Monitoring

In cybersecurity, defense-in-depth is a fundamental principle – you never rely on a single security measure to protect your systems. The same philosophy applies to application monitoring. No single monitoring approach, no matter how sophisticated, can capture every possible failure mode of your application. This is why layered monitoring isn't just a best practice – it's essential risk mitigation.

Announcing Checkly Uptime Monitors: Simple, Scalable, and Built for Developers

When Checkly launched, it was the first of its kind, enabling developers to monitor complex workflows easier than ever using the automation tooling (Playwright, Terraform, etc) they already knew and loved. We’ve helped detect and resolve issues for 1000s of companies—ranging from monitoring crucial log-ins, to purchasing products, to setting up client instances for millions of monthly users But what about the simpler stuff?

Running #playwright Tests in Multiple Environments with Checkly. #sdet #devops

Learn how to efficiently run Playwright tests across different environments without rewriting them. This tutorial covers managing environment variables in Checkly for API and browser checks, handling global and group-specific settings, and integrating with CI/CD processes. Discover the best practices for setting up environment variables, duplicating test groups, and customizing alerts to ensure your checks are environment-specific.

AI-Powered Monitoring with Checkly

Most monitoring tools weren't built for the AI-first world. By nature, traditional monitoring platforms force you out of your natural coding environment and trap you in clunky web interfaces, brittle configuration panels, and rigid APIs. And sadly, when monitoring providers do offer "AI features," it's usually a chatbot bolted onto their existing UI, being nothing more than a pale imitation of the AI tools you’re reading about every day on Hacker News. All this creates friction.