AI in SRE: Where and how Google is deploying agentic AI to improve operations May 29, 2026 By Stevan Malesevic In Google Operations With SRE AI, Google plans to fully adopt AI and agentic technologies, leveraging AI as a force multiplier while also maintaining control. Read Post Google Operations AI Blog Cloud Logging Monitoring SRE Read more about AI in SRE: Where and how Google is deploying agentic AI to improve operations
Gemini Cloud Assist: Proactive cloud operations that work for you, even before you ask Apr 22, 2026 By Michael Bachman In Google Operations The redesigned Gemini Cloud Assist proactively executes tasks such as designing applications and optimizing costs that used to need human oversight. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about Gemini Cloud Assist: Proactive cloud operations that work for you, even before you ask
Is your DR plan just wishful thinking? Prove your resilience with chaos engineering Dec 9, 2025 By Deepanshu Kalra In Google Operations Controlled chaos engineering experiments that simulate real-world disasters quantitatively measure the impact of failures on system performance. Read Post Google Operations Blog Chaos Engineering Cloud Monitoring Read more about Is your DR plan just wishful thinking? Prove your resilience with chaos engineering
Application monitoring in Google Cloud: Bridging manual and AI-assisted troubleshooting Jul 19, 2025 By Dave Raffensperger In Google Operations Cloud Observability’s curated Application Monitoring dashboards improve troubleshooting with best practices from Google SREs. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about Application monitoring in Google Cloud: Bridging manual and AI-assisted troubleshooting
Introducing the new Google Cloud Trace Explorer Feb 25, 2025 By Sujay Solomon In Google Operations New UI features in Cloud Trace, part of Google Cloud Observability, make it easier to troubleshoot latency and errors in your applications. Read Post Google Operations Blog Cloud Logging Monitoring Observability Read more about Introducing the new Google Cloud Trace Explorer
An SRE's guide to optimizing ML systems with MLOps pipelines Feb 21, 2025 By Max Saltonstall In Google Operations As AI and ML become more prevalent, administrators can use Site Reliability Engineering (SRE) techniques to manage the ML infrastructure and software. Read Post Google Operations Blog Cloud Logging Machine Learning Monitoring SRE Read more about An SRE's guide to optimizing ML systems with MLOps pipelines
Is your platform ready for 2025? New research on platform engineering reveals the secret to success Jan 24, 2025 By Ning Ge In Google Operations Google Cloud partnered with Enterprise Strategy Group (ESG) on a research study to uncover the secrets of successful platform engineering teams. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Is your platform ready for 2025? New research on platform engineering reveals the secret to success
Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging Oct 9, 2024 By Sandeep Karmarkar In Google Operations BigQuery’s pipe syntax introduces an intuitive, top-down syntax for understanding data transformations, and is used in Cloud Logging Log Analytics. Read Post Google Operations Blog Cloud Logging Monitoring Read more about Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging
Three steps in mapping out your modern platform strategy Oct 5, 2024 By Richard Seroter In Google Operations Are your developers using the latest AI-ready platforms to power ahead with innovation? If not, then it’s time to re-evaluate your platform strategy. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Three steps in mapping out your modern platform strategy
Project management à la SRE: How to juggle the needs of your project and production Sep 28, 2024 By Karan Anand In Google Operations Most IT project management frameworks are directed at single-focus teams like software development, not multi-focus teams like SRE. Read Post Google Operations Blog Cloud DevOps Project Management SRE Read more about Project management à la SRE: How to juggle the needs of your project and production