Operations | Monitoring | ITSM | DevOps | Cloud

What is Java Performance Monitoring? [A Guide to DevOps Engineers]

You rolled out a Java application that worked fine in development. Fast, clean, no errors. However, once it went into production, things began to change. Suddenly, the app feels slow. CPU usage climbs without warning. Some users start getting timeouts. You check the dashboards, but nothing jumps out. You look through the logs, but it's mostly noise. And then the questions start coming in - "Is the JVM the problem?" If you've been in that situation, you're not alone.

What to expect in a Gremlin workshop

Gremlin workshops give your team hands-on training with Gremlin so they can get real results and dramatically improve your reliability. Full transcript:  The goal of our workshops is really to accelerate you and the team in your reliability journey. Whether you're starting out for the first time, or you're a more advanced user, this workshop is really designed for you to take you to the next level.

Why don't Kafka and Iceberg get along?

Kafka and Iceberg is a costly marriage of inconvenience. If you write code for a living you’ve probably heard of Apache Iceberg - but you might not realise the detour your Kafka events must take to get there. Typically a Kafka message written to an Iceberg table must take a journey via a connector, rack up transfer fees, and idle in a sidecar before it appears as an Iceberg table—hardly the friction‑free flow open table formats promise.

Advanced Proactive SSL Certificate Monitoring

eG Enterprise version 7.5 introduces advanced capabilities for detailed SSL Certificate Monitoring including monitoring for web servers and apps using SSL. Monitoring SSL certificates is essential to ensure secure connections, prevent service outages, and maintain user trust. Here are a few things you need to monitor and questions you should ask to keep your services and apps running reliably and securely.

Securely query data sources on your Tailscale network using Private Data Source Connect in Grafana Cloud

Balancing security with your observability needs can be a difficult task. We know our users want to leverage platforms like Grafana Cloud to visualize and gain valuable insights into their data, while also keeping their data sources private and secure.

SMS alerts enabled for Early Warning Signals

When service disruptions happen, every second counts. That’s why we’re excited to announce a major update to StatusGator: Early Warning Signals are now available via SMS. Early Warning Signals have already been helping teams stay ahead of outages via email and Slack alerts — and now, with SMS support, you can get real-time notifications directly on your phone, even before incidents are publicly acknowledged.

Automating Linux Disk Expansion with Resolve: Add & Extend VM Disks in Minutes!

Running into disk space issues on your Linux servers or virtual machines? In this step-by-step demo, we show how Resolve’s powerful automation platform can help you automatically add and expand disk space on Linux systems, eliminating manual processes, reducing human error, and improving operational efficiency. In this video, you’ll learn how to: Technologies Featured: Whether you're a system admin, IT operations engineer, or automation specialist, this demo highlights how to streamline critical disk management tasks that normally require elevated access and technical knowledge.

Automate Disk Space Management on Windows with Resolve

Struggling with managing disk space issues on your servers or virtual machines? See how you can use Resolve to automate disk space addition and expansion on Windows systems, saving time, reducing manual errors, and eliminating the need for high-level administrative access. In this video, you'll learn how Resolve automates the process of: Whether you’re a system admin, IT operations engineer, or automation enthusiast, this demo highlights how you can streamline infrastructure tasks using intelligent automation.

Zoom Video Communications Uses PagerDuty to Keep Video Conferencing Frictionless for Every Customer

Zoom Video Communications is a video conferencing company on a mission to make video communications frictionless for all. Eric Yuan, CEO and founder of Zoom, and Alex Guerrero, Senior Manager of SaaS Operations, dive into why their teams have adopted PagerDuty as their end-to-end incident management platform. Companies trust Zoom for their video conferencing services and, according to Yuan, “Our business counts on PagerDuty.”

WebMMU: Multimodal and Multilingual Evaluation of Agent Reasoning on Web

Welcome to the AI research bites. This series of short and informative talks showcases cutting-edge research work from ServiceNow AI Research team. The AI Research Bites are open to all, especially those interested in keeping up with the fast-paced AI research community. Modern web agents can read, but few can see holistically. Despite rapid progress in multimodal LLMs, today's models falter when asked to visually ground UI elements, reason over DOM structures, or edit complex layouts across diverse languages and domains. WebMMU is our attempt to course-correct.