An SRE's guide to optimizing ML systems with MLOps pipelines Feb 21, 2025 By Max Saltonstall In Google Operations As AI and ML become more prevalent, administrators can use Site Reliability Engineering (SRE) techniques to manage the ML infrastructure and software. Read Post Google Operations Blog Cloud Logging Machine Learning Monitoring SRE Read more about An SRE's guide to optimizing ML systems with MLOps pipelines
Is your platform ready for 2025? New research on platform engineering reveals the secret to success Jan 24, 2025 By Ning Ge In Google Operations Google Cloud partnered with Enterprise Strategy Group (ESG) on a research study to uncover the secrets of successful platform engineering teams. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Is your platform ready for 2025? New research on platform engineering reveals the secret to success
Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging Oct 9, 2024 By Sandeep Karmarkar In Google Operations BigQuery’s pipe syntax introduces an intuitive, top-down syntax for understanding data transformations, and is used in Cloud Logging Log Analytics. Read Post Google Operations Blog Cloud Logging Monitoring Read more about Write better log queries, faster: Introducing pipe syntax in BigQuery and Cloud Logging
Three steps in mapping out your modern platform strategy Oct 5, 2024 By Richard Seroter In Google Operations Are your developers using the latest AI-ready platforms to power ahead with innovation? If not, then it’s time to re-evaluate your platform strategy. Read Post Google Operations Blog Cloud DevOps Logging Monitoring Read more about Three steps in mapping out your modern platform strategy
Project management à la SRE: How to juggle the needs of your project and production Sep 28, 2024 By Karan Anand In Google Operations Most IT project management frameworks are directed at single-focus teams like software development, not multi-focus teams like SRE. Read Post Google Operations Blog Cloud DevOps Project Management SRE Read more about Project management à la SRE: How to juggle the needs of your project and production
GenOps: learning from the world of microservices and traditional DevOps Aug 31, 2024 By Sam Weeks In Google Operations GenOps is a new operational platform for Generative AI. Learn the difference between AI agents and microservices and how to implement GenOps. Read Post Google Operations AI Blog Cloud Logging Monitoring Read more about GenOps: learning from the world of microservices and traditional DevOps
Best practices for streamlining log centralization with Cloud Logging Jul 31, 2024 By Keith Chen In Google Operations Follow these best practices when using Cloud Logging to centralize and manage logs from diverse sources. Read Post Google Operations Blog Cloud Logging Monitoring Read more about Best practices for streamlining log centralization with Cloud Logging
Free to be SRE - how to use generative AI to code, test and troubleshoot your systems Jun 26, 2024 By Luis Urena In Google Operations Resources to learn generative AI concepts and how to leverage it to enhance your operational efficiency as an SRE. Read Post Google Operations AI Blog Cloud Monitoring SRE Read more about Free to be SRE - how to use generative AI to code, test and troubleshoot your systems
Free to be SRE, with this systems engineering syllabus Jun 14, 2024 By Max Saltonstall In Google Operations Learn more about systems engineering and how to get started with these key resources curated by Google’s Site Reliability Engineering (SRE) team. Read Post Google Operations Blog Cloud Logging Monitoring SRE Read more about Free to be SRE, with this systems engineering syllabus
5 more myths about platform engineering: how it's built, what it does, and what it doesn't Jun 7, 2024 By Darren Evans In Google Operations Part two of a series on platform engineering myths, covering how it’s built, what it does, and what it doesn’t do. Read Post Google Operations Blog Cloud Logging Monitoring Read more about 5 more myths about platform engineering: how it's built, what it does, and what it doesn't