Operations | Monitoring | ITSM | DevOps | Cloud

Automate server restarts in SCOM with the Opslogix Autonomous Maintenance Mode Management Pack

Automate server restarts in SCOM with the Opslogix Autonomous Maintenance Mode Management Pack Server restarts are routine, but in SCOM they often result in unwanted alerts if not handled properly. The Opslogix Autonomous Maintenance Mode Management Pack addresses this by automatically managing maintenance mode during restarts, minimizing false alerts and improving operational efficiency.

Status Page Aggregator: Best Practices and Use Cases

A status page aggregator is a powerful tool that brings together the status updates of multiple cloud services, SaaS providers, and third-party services into a single, unified view. Whether you’re tracking the health of critical dependencies like AWS, Cloudflare, or niche SaaS tools your teams rely on, a status page aggregator simplifies monitoring and helps you stay ahead of outages.

Making AI scalable with database change management and Redgate Flyway

With the rise of AI and machine learning comes data. Lots of it. For organizations today, AI is radically changing the way data is accessed, maintained and operationalized. For heads of architecture and development teams, it offers opportunity and responsibility.

When Will We See the First $1 Billion Company Run by a Single Individual?

It’s only a matter of time. OpenAI CEO Sam Altman said in 2024 that he thought this could be achieved by the end of 2026. Personally, I feel this is a little optimistic; however, based on the evidence I’ve seen, it won’t be long after that. Consider Telegram: a global messaging giant with just 30 employees, already achieving a remarkable $1 billion in revenue. Or Midjourney, revolutionizing creative industries with only 40 employees and generating an impressive $500 million.

Can Claude Code Observe Its Own Code?

One of the great things about OpenTelemetry is that it’s a standard, and standards tend to proliferate. I was excited to see Claude Code add OpenTelemetry metric and log support in a recent release. What was really interesting—beyond the ability to capture usage data from Claude Code—is that you can also get pretty detailed logs about what you’re doing with Claude Code.

Departed M365 Users

When someone leaves your organization, the first step IT usually takes is to disable their Microsoft 365 account. But have you ever stopped to ask: The answer might surprise you. If you’re not actively managing this, Microsoft will automatically delete that data — often in as little as 30 days. This post explains exactly what gets deleted (and when), why this is a problem, and what you can do to protect that data — without paying for unnecessary licenses.

The Dos and Don'ts of Successful Software Rollouts

Launching new enterprise software is one of the most strategic—but risk-laden—internal initiatives any organization can undertake. Done right, it accelerates transformation, streamlines operations, and boosts employee productivity. Done wrong, it can paralyze teams, spike IT tickets, and erode employee trust in the tools they’re given and the teams that support them.

Elephant Flows: The Hidden Heavyweights of AI Data Center Networks

Elephant flows are no longer rare. They’re foundational to AI workloads. In today’s GPU-heavy data centers, long-lived, high-volume flows can distort ECMP, overflow buffers, and rack up unexpected cloud bills. Kentik helps you see and tame these elephants with real-time flow analytics, automated alerting, and predictive capacity planning.