Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Release Roundup March 2024: More ways to discover and test your services

2024 is off to a fast start here at Gremlin. Since our last release roundup, we’ve released new experiment types, new features to improve integration with cloud platforms, and improvements to our auto-detection processes. Now you can push processes to their limits, find dependencies even easier, limit when tests can be run, and much more. We also introduced a slew of platform improvements to improve efficiency, performance, and user experience in the Gremlin web application.

SOC 2 Compliance Requirements: Examples, Use Cases + More

SOC 2 compliance requirements (Service Organization Controls Type 2) ensure that customer data stays private and secure — essential for any business that stores or processes sensitive data. In this blog, we’ll explore the specifics of SOC 2 compliance, and provide a solution to help you automate and enforce SOC 2 compliance going forward.

Easy Guide to Monitor Jenkins Jobs Using Telegraf and MetricFire

Monitoring Jenkins jobs and nodes is foundational to maintaining a robust, efficient, and secure CI/CD pipeline. It enables DevOps teams to stay proactive about system health, optimize performance, manage resources effectively, and adhere to security and compliance standards. In this article, we'll detail how to use the Telegraf agent to collect performance metrics from your Jenkins environment, and forward them to a datasource.

Introducing Process Exhaustion: How to scale your services without overwhelming your systems

We rarely think about how many processes are running on our systems. Modern CPUs are powerful enough to run thousands of processes concurrently, but at what point do our systems become oversaturated? When you’re running large-scale distributed applications, you might reach this limit sooner than you'd expect. How can you determine what that limit is, and how does that affect the number and complexity of the workloads you deploy?

Application Troubleshooting with Automated Root Cause Analysis

In the complex and fast-paced world of application deployment, getting a handle on the tangle of services and resources can sometimes feel like trying to find your way through a maze without a map. And if something goes wrong, trying to find out what's happening where is even more difficult. With alert emails flooding in and questions flying left and right, identifying the glitch that's causing issues can seem like a Herculean feat.

Centralized Multi-Cluster Management and Operations

Join us for our webinar on centralized multi-cluster management and operations, during which we’ll explore how to manage multiple clusters across various regions. This will teach you how to unify access control and authorization layers, ensuring secure, streamlined operations. Our panel will then dive into the challenges of maintaining consistent application deployment across all regions through a single console.

The Future Of Cloud Cost Management: AI And Machine Learning On AWS

As organizations increasingly migrate to the cloud, managing expenses efficiently becomes crucial. Traditional cost management methodsoften fall short in this environment, where resource allocation and usage can fluctuate dramatically. Enter Artificial Intelligence (AI) and Machine Learning (ML). These cutting-edge technologies are revolutionizing the way businesses approach cloud cost management.

Qovery is Now Available on the AWS Marketplace

I'm thrilled to announce the availability of Qovery on the AWS Marketplace. You can now buy and benefit from Qovery right from the AWS Marketplace. Before delving into the specific advantages of purchasing Qovery through the AWS Marketplace, let's first understand what the AWS Marketplace is and why this is something you should consider when purchasing Qovery.