Gemini Cloud Assist: Proactive cloud operations that work for you, even before you ask Apr 22, 2026 By Michael Bachman In Google Operations The redesigned Gemini Cloud Assist proactively executes tasks such as designing applications and optimizing costs that used to need human oversight. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about Gemini Cloud Assist: Proactive cloud operations that work for you, even before you ask
Is your DR plan just wishful thinking? Prove your resilience with chaos engineering Dec 9, 2025 By Deepanshu Kalra In Google Operations Controlled chaos engineering experiments that simulate real-world disasters quantitatively measure the impact of failures on system performance. Read Post Google Operations Blog Chaos Engineering Cloud Monitoring Read more about Is your DR plan just wishful thinking? Prove your resilience with chaos engineering
Application monitoring in Google Cloud: Bridging manual and AI-assisted troubleshooting Jul 19, 2025 By Dave Raffensperger In Google Operations Cloud Observability’s curated Application Monitoring dashboards improve troubleshooting with best practices from Google SREs. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about Application monitoring in Google Cloud: Bridging manual and AI-assisted troubleshooting
Introducing the new Google Cloud Trace Explorer Feb 25, 2025 By Sujay Solomon In Google Operations New UI features in Cloud Trace, part of Google Cloud Observability, make it easier to troubleshoot latency and errors in your applications. Read Post Google Operations Blog Cloud Logging Monitoring Observability Read more about Introducing the new Google Cloud Trace Explorer
An SRE's guide to optimizing ML systems with MLOps pipelines Feb 21, 2025 By Max Saltonstall In Google Operations As AI and ML become more prevalent, administrators can use Site Reliability Engineering (SRE) techniques to manage the ML infrastructure and software. Read Post Google Operations Blog Cloud Logging Machine Learning Monitoring SRE Read more about An SRE's guide to optimizing ML systems with MLOps pipelines
Is your platform ready for 2025? New research on platform engineering reveals the secret to success Jan 24, 2025 By Ning Ge In Google Operations Google Cloud partnered with Enterprise Strategy Group (ESG) on a research study to uncover the secrets of successful platform engineering teams. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Is your platform ready for 2025? New research on platform engineering reveals the secret to success
Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging Oct 9, 2024 By Sandeep Karmarkar In Google Operations BigQuery’s pipe syntax introduces an intuitive, top-down syntax for understanding data transformations, and is used in Cloud Logging Log Analytics. Read Post Google Operations Blog Cloud Logging Monitoring Read more about Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging
Three steps in mapping out your modern platform strategy Oct 5, 2024 By Richard Seroter In Google Operations Are your developers using the latest AI-ready platforms to power ahead with innovation? If not, then it’s time to re-evaluate your platform strategy. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Three steps in mapping out your modern platform strategy
Project management à la SRE: How to juggle the needs of your project and production Sep 28, 2024 By Karan Anand In Google Operations Most IT project management frameworks are directed at single-focus teams like software development, not multi-focus teams like SRE. Read Post Google Operations Blog Cloud DevOps Project Management SRE Read more about Project management à la SRE: How to juggle the needs of your project and production
GenOps: learning from the world of microservices and traditional DevOps Aug 31, 2024 By Sam Weeks In Google Operations GenOps is a new operational platform for Generative AI. Learn the difference between AI agents and microservices and how to implement GenOps. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about GenOps: learning from the world of microservices and traditional DevOps