Operations | Monitoring | ITSM | DevOps | Cloud

Confessions of a CTO: How we Tamed our Cloud Costs

If you’ve ever found yourself staring at a cloud bill that could buy a small island or at least a very nice car, you're not alone. Believe me, at Cribl, we've had our share of those "molotov cocktail" bills that make our CFO, Zach, look like he's about to spontaneously combust. And yeah, a few F-bombs might have dropped from various senior leaders (myself included, I won't lie).

Vector Databases Explained: What they are & Why they Matter [Quick Question Ep. 2]

Ever wondered what a vector database is and why it’s becoming so important in AI search? In this quick video, I’ll break down what a vector database is, how it works, and what you should consider when choosing one. About Elastic Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale. Elastic’s solutions for search, observability, and security are built on the Elastic Search AI Platform — the development platform used by thousands of companies, including more than 50% of the Fortune 500.

Google Workspace outage: July 18, 2025

Google Workspace went down again in July 2025—but if you had asked AI tools like Google’s own AI Overviews, ChatGPT, or Claude, you would have been told everything was fine. Every one of these tools incorrectly claimed that services were up and running while users across the globe were unable to connect, send messages, or even log in.

SentinelOne outage: July 10 incident went unacknowledged

July 10, 2025, SentinelOne, a leading cybersecurity platform, experienced a widespread outage that disrupted access to its admin consoles across multiple regions. The incident impacted users in Europe, North America, and beyond, preventing security teams from accessing critical management features. Despite the scale of the disruption, no official public acknowledgment or status update was issued by SentinelOne.

Kentik Cause Analysis in 60 Seconds

In a world where network traffic can suddenly spike, manually sifting through flow data is often a daunting task. Kentik AI's new Cause Analysis simplifies troubleshooting by quickly identifying changes in traffic by application, IP, ASN, or service. With just a few clicks, Cause Analysis helps you compare time periods, understand traffic shifts, and detect changes in your network. Kentik: Take the hard work out of running your network.

Beyond AI hype: put reliability at the forefront

Reliability is a constant for every technology, whether it’s cloud, microservices, or AI. Full transcript:  Just a few years ago everybody was screaming about microservices, "That's the wave of the future," and now everybody's looking at AI. No matter what the change in technology hot topic is, your reliability should still be at the forefront of everything that you're doing.