Operations | Monitoring | ITSM | DevOps | Cloud

%term

Microsoft Outage MO842351: Understanding Impact & Scope Saves You From Raising Unnecessary Alarm Bells

Just ten days after the last major Microsoft 365 outage, Microsoft reported another incident at 8:48 am on July 30, 2024. The message on X was vague, offering limited details about the scope and impact of the problem. This left many IT teams preparing for what they anticipated would be another rocky day.

Internal Developer Portal Integration: A Comparison of Direct and Agent-Based Models

You're here because you’re considering how an Internal Developer Portal (IDP) can help your organization build or extend an engineering system of record, set and maintain software standards, and achieve self-service with pre-approved templates, enabling developers to build the right things in the right way, faster than ever. Autonomously. You might be surprised (maybe not) to learn that the IDP you choose directly impacts the success of your implementation.

Understand your Kubernetes cost drivers and the best ways to rein in spending

In the previous blog post in this two-part series, we discussed the critical signals you need to monitor in your Kubernetes environment to ensure optimal resource provisioning. These signals include high CPU and memory utilization, frequent pod evictions, slow application performance, and other indicators that your resources are over- or under-provisioned. Monitoring these signals is essential for maintaining an efficient, cost-effective, and environmentally sustainable Kubernetes environment.

Drive Operational Excellence featuring PagerDuty Advance

Build operational excellence with PagerDuty. Watch this demo to see how the latest innovations for the PagerDuty Operations Cloud come together to help a team tackle a major incident related to a database upgrade. You’ll see how PagerDuty Advance capabilities work in concert with new functionality built for modernizing operations centers, standardizing automation at scale, and transforming incident management. The result? Improved innovation velocity, reduced operating costs, and better customer experiences.

PagerDuty Advance | Generative AI for PagerDuty Operations Cloud

Introducing PagerDuty Advance: GenAI for critical operations work. For every step of the incident lifecycle. For scaling your teams. For sustaining customer experiences. For moving business forward – faster. Work more efficiently. Protect more revenue. Build greater operational resilience. PagerDuty Advance helps operations teams manage business-impacting issues in seconds, not hours. From event to resolution, PagerDuty Copilot’s automations help you resolve issues faster, reduce risk, and control costs.

Achieving Autonomic IT: Your Journey to Highly Efficient Operations and Elevated Business Performance

In today’s fast-paced digital business landscape, IT service management teams face immense pressure to swiftly adapt to new technologies and meet stringent SLAs. To ensure optimal customer experiences and drive business growth, organizations need an approach that goes beyond current AIOps and semi-autonomous market offerings – they need Autonomic IT. Imagine a self-managing IT environment that monitors and optimizes technology investments as it runs.

Staying on Top: Nexthink's Continuous Pursuit of Excellence

"It's tough to get out of bed to do roadwork at 5 am when you've been sleeping in silk pajamas." This quote from boxing champion Marvin Hagler, I feel, perfectly encapsulates the relentless drive needed to sustain excellence in any endeavor. It speaks to Hagler’s vigilance against complacency, an ethos that resonates deeply with us at Nexthink, especially as we celebrate our 20th anniversary and our ongoing status as a Leader in the Forrester Wave.

The MING Stack: What It Is and How It Works

The Internet of Things (IoT) is rapidly reshaping the world. From smart devices in our homes to connected sensors in industrial settings, the amount of data generated is rapidly increasing. But what use is this data if we can’t collect and analyze it in real-time to gain key insights? This is where the MING stack (which includes Mosquitto/MQTT, InfluxDB, Node-RED, and Grafana) comes in. This powerful combination of open-source tools is intended to simplify IoT data management.