Today we're introducing bulk editing for monitors, making it easier than ever to update multiple monitors simultaneously. This powerful new feature helps you efficiently manage your monitoring setup at scale.
In today's always-on world, your website or application is the lifeblood of your business. Downtime isn't just an inconvenience; it's a threat to your reputation, customer loyalty, and bottom line. As we highlighted in our recent article on MTTR, quickly resolving incidents is crucial. But equally important is how you communicate those incidents to your users. That's where status page templates come in.
When it comes to handling routine IT tasks, runbook automation has long played a central role. Traditionally designed to schedule and execute jobs across systems like ERP and CRM platforms, these tools were essential in an era when batch processing and time-based triggers ruled the day. But the world has changed. Modern IT environments demand real-time responsiveness, intelligent automation, and event-driven execution.
Bricking your entire fleet with one bad update? That’s every engineer’s worst OTA fear. François Baldassari shares the two must-dos for shipping updates safely: staged rollouts and real-time observability. Catch issues early, before your customers ever notice.
"CFEngine: The agent is in" is our monthly webinar series, where we show new features, teach best practices, and keep the community informed about everything CFEngine.
If you haven’t heard already (which would be shocking considering the numerous posts I’ve seen on Reddit) Opsgenie’s end of life is right around the corner. This means there is no better time for Opsgenie users to explore alerting and on-call management tools outside of the limited alternatives provided by Atlassian. So, I felt now is a better time than any to address the needs of those affected by the dissolution of Opsgenie and reveal why OnPage should be your new platform of choice.
Picture a busy Monday morning. You are working on leftover projects from the previous week, and assuming everything is fine with your applications as you had not received support tickets during the weekend. All of a sudden, during the middle of the day, you get a flood of reports from users who complain about slow response in your application and error pages piling up. You and your team are scrambling hard to figure out the issue.
As modern IT systems grow in complexity, IT operations teams have to work harder to ensure reliability. "What gets measured gets managed" is a management mantra that emphasizes the role of metrics in management. To ensure everything works well, operations teams need service-level objectives (SLOs). This industry term measures how an application meets the agreed-upon quality and reliability standards, serving as a bellwether of good software.
Starlink moves beyond being strictly a direct-to-consumer service provider with the recent activations of its Community Gateways. In recent months, Starlink has become a transit provider to a small but growing number of service providers in remote parts of the world as its unique and groundbreaking service continues to evolve.
Decode website loading sequences with Todd Gardner's essential guide to waterfall charts in this Concepts of Web Performance tutorial. Perfect for entry-level web developers struggling with slow websites, this video demystifies those intimidating colored bars you've seen in Chrome DevTools, WebPageTest, and monitoring tools like Request Metrics. Learn to interpret the crucial elements of waterfall charts—from request queuing and waiting times to content downloading phases—all visualized on a timeline measured in milliseconds. Discover how to identify two major performance bottlenecks.