Operations | Monitoring | ITSM | DevOps | Cloud

Microsoft Entra ID Outage: How Vantage DX Detected the Issue Before Microsoft Acknowledges the Issue

On February 25, 2025, at 11:32 AM EST, Martello’s Vantage DX monitoring began alerting on an issue affecting Microsoft Entra ID (Azure AD SSO). While Microsoft had not yet acknowledged the incident, online reddit forums had noted the issue and our Vantage DX proactive monitoring detected disruptions impacting authentication across multiple workloads. See here the critical warning for Exchange in Vantage DX Monitoring. Here is the critical warning for OneDrive and SharePoint in Vantage DX.

Optimizing AWS NAT Gateway Usage

AWS NAT Gateways are essential for private subnet access but can quickly become a costly burden, even when idle. With Kentik, cloud and network engineers gain deep visibility into NAT Gateway traffic, allowing them to identify underutilized gateways, analyze high-cost usage, and explore cost-saving alternatives like VPC Endpoints, Internet Gateways, or direct peering.

Data sources, visualizations, and apps: A guide to extending and customizing Grafana

Grafana’s extensibility has always been one of the keys to its success. It comes with a wide range of data sources that allow you to query your data no matter where it lives, visualizations to help you quickly make sense of that data, and apps that can provide complete observability solutions, all in a single package.

What does reinventing Data Center Infrastructure Management (DCIM) software mean? #dcim #datacenter

"Growth, growth, growth." At Hyperview, we’ve reinvented data center infrastructure management (DCIM) by embracing a product-led growth (PLG) strategy. This means we work closely with users to create features that actually matter to them. With updates rolling out every five weeks, we’re always finding ways to make the platform even better. Plus, we’ve teamed up with data center leaders like Panduit and nVent to bring even more value to our shared customers. Finally, we’re expanding our reach to serve customers better, adding APAC and UAE regions to host their Hyperview instance and data.

Boosting IT Efficiency: How to Do More With Less

IT teams are constantly asked to do more with limited resources and budgets. Is your IT team’s monitoring strategy keeping up? Thankfully, these challenges aren’t impossible to overcome. Check out this exclusive webinar where Greg Collins, Product Marketing Manager at Progress, and Jason Alberino, Principal Product Manager at Progress, will share tips on accomplishing your IT goals with less.

Last Mile Automation: Going from Alerts to Action

In today’s digital-first world, IT teams rely on a vast array of tools to monitor, manage, and optimize infrastructure. Network monitoring tools, security platforms, IT service management (ITSM) solutions, and observability stacks provide real-time insights into digital environments’ health and performance. But there’s a catch—most of these tools stop at alerting.

Retail's GenAI Edge: Profitable Use Cases Beyond Chat Bots

Who doesn’t love a virtual try-on when shopping online or a quick scan in the physical store that tells exactly when their favorite item will be available in the store? These everyday conveniences, powered by AI, once seemed like science fiction. Traditional AI has already revolutionized retail - from computer vision managing inventory to machine learning predicting demand.

Automating Government Compliance Requirements

Government compliance regulations are becoming more complex every year. For businesses, staying compliant means balancing a growing list of laws and policies while facing tighter budgets, limited resources, and increasing scrutiny. Failing to comply isn’t just risky—it can result in hefty fines, reputational damage, and operational inefficiencies. This is where automation can be a game-changer.

PagerDuty Operations Cloud Spring 25 Release: Reimagining Operations in the Age of AI and Automation

Operational excellence isn’t just a goal—it’s critical for survival for all companies. And, when powered by AI and automation, it’s a strategic competitive differentiator. With over a decade of AI and ML experience in our platform, PagerDuty pioneered the Incident Response space. And now, PagerDuty is redefining what modern operations can look like in the era of AI and automation.