Operations | Monitoring | ITSM | DevOps | Cloud

%term

The evolving role of SREs: Balancing reliability, cost, and innovation

A look at the expanding roles of SREs and the new skills needed: cost management and AI Imagine the CTO walks into your team meeting and drops a bombshell: "We need to cut our cloud costs by 30% this quarter." As the lead SRE, this might cause a strong reaction — isn’t your job about ensuring reliability? When did you become responsible for the company's cloud bill? If you've had a similar experience, you're not alone. The role of site reliability engineers (SREs) is evolving fast.

Critical Context: Adding Trace Quickview to Logz.io's Explore

Complexity rules the day within the world of data systems and pipelines. A goal for any observability practice is to help reduce complexity and give users and administrators a clear view of what’s happening in any system. This is the path to unified observability, a mature system where monitoring and troubleshooting are streamlined. This has been difficult to achieve for many organizations.

Distributed WordPress on Cycle and GCP

Recently I've had the great privilege of working on creating a distributed WordPress deployment that leverages GCP compute and services alongside containers running on the Cycle platform. This blog dives into a bit of the history of why WordPress is difficult to deploy in a distributed way, how we approached it, some really interesting things we found, and finally, the solution we put in place.

AWS EKS Auto Mode with Qovery - Valuable Or Not?

At Qovery, we are closely following the development of EKS Auto Mode, a new feature from AWS designed to simplify Kubernetes management by automating various foundational components. While we recognize the effort AWS has put into this, our initial evaluation shows that EKS Auto Mode is still in its early stages and does not yet offer sufficient value to be a strong consideration for our users.

Guide to Cloud Migration: From PaaS to IaaS

For scaling businesses, transitioning from PaaS (Platform as a Service) to IaaS (Infrastructure as a Service) is less about a choice and more about necessity. Staying on PaaS too long can result in skyrocketing costs, limited flexibility, and performance bottlenecks — challenges that only grow as your workloads and team scale.

Simple Talks Podcast - Episode 13 - Ending the Season Together

Today, all the hosts (Louis Davidson, Ryan Booz, Kellyn Gorman, Steve Jones, and Grant Fritchey) get together for a festive adjacent discussion about AI, content, books, community, vector, PIVOTing, observability, and we end up with a brief discussion on gluten-free gingerbread men (because we did!) All this, though absolutely none of us broke into song like a Hope/Crosby musical!

How Autonomic IT Helps Enterprises Meet the Demands of a Digital and Dynamic Business Landscape

Autonomic IT is the pinnacle of IT evolution. Inspired by the human autonomic nervous system, it refers to self-managing IT systems that autonomously monitor, optimize, and resolve issues. By integrating data, advanced AI and machine learning (ML), and automation, Autonomic IT enterprises can predict, prevent, and resolve IT issues more proactively, enhancing efficiency and reliability. However, Autonomic IT is more than just a framework for machines to fix themselves.

Beyond cost optimization: Key FinOps trends heading into 2025

Picture this: A chic Barcelona seaside hotel, gusty autumn winds, endless tapas, and the Rioja flowing. Could there be a better setting for a FinOps conversation? That was the setting for FinOpsX Europe! It brought together 500 practitioners from EMEA, APAC, and even LATAM. Everyone was in the mood to learn and interact: exchange ideas, share challenges, and find solutions. Here’s a peek at the hottest discussion topics.

Best practices for monitoring event-driven architectures

Microservices architectures empower individual teams to choose their own programming language, tools, and technologies, resulting in more independence and the ability to develop and release features faster. While there are various types of integration patterns that can facilitate microservice communication, many organizations choose to adopt event-driven architectures (EDAs) because of their scalability, agility, and resilience.