Google Operations

  |  By Luis Urena
Resources to learn generative AI concepts and how to leverage it to enhance your operational efficiency as an SRE.
  |  By Max Saltonstall
Learn more about systems engineering and how to get started with these key resources curated by Google’s Site Reliability Engineering (SRE) team.
  |  By Darren Evans
Part two of a series on platform engineering myths, covering how it’s built, what it does, and what it doesn’t do.
  |  By Poonam Lamba
The new GKE Compliance makes maintaining compliance for your Kubernetes clusters is easier than ever before.
  |  By Darren Evans
We present five common myths about platform engineering — what it is and what it isn’t — that we've heard when folks aren't considering the whole picture.
  |  By Leonid Yankulin
Learn how to explicitly define services for use in Cloud Monitoring’s Services Overview dashboard.
  |  By Paul Nuyujukian
How Stanford researchers use Google Cloud data storage, computing and analytics to manage scientific data following DevOps principles.
  |  By Feng Li
Create microservices with gRPC with Spring, and leverage Managed Service for Prometheus and Grafana for monitoring and observability.
  |  By Lee Yanco
PromQL-based alerting policies and our command-line tool for importing dashboards from Grafana are now available in Cloud Monitoring.
  |  By David Rush
Platform engineers can influence API development by following best practices and implementing DevOps design patterns.
  |  By Google Operations
Cross-cloud networking is a common topic among Google Cloud customers as they deploy and manage workloads across multiple clouds. Watch along as Sri Nannapaneni, Customer Engineer at Google Cloud, discusses Google Cloud connectivity services and a VPC design pattern for cross-cloud and on premises communication.
  |  By Google Operations
Designing a robust DNS design to support seamless name resolution across distributed workloads is important. Watch along as Sri Nannapaneni, Customer Engineer at Google Cloud, discusses Cloud DNS concepts and reviews a design pattern that customers can leverage as part of their hybrid deployment.
  |  By Google Operations
Cloud logging’s log router is a power tool that gives you the flexibility to choose which logs are stored in Cloud Logging, sent to other Google Cloud products like Cloud Storage, or even sent to your favorite third-party product. In this video, we'll explore log sinks, aggregated sinks for centralized management, and the intercepting option to prevent duplicate log storage, equipping you with the knowledge to streamline your log management workflow in Google Cloud.
  |  By Google Operations
Cloud NGFW Enterprise empowers organizations to safeguard their cloud environments with advanced security features. In this demo, learn how Google Cloud NGFW's advanced threat protection capabilities can bolster your cloud security posture.
  |  By Google Operations
How does Cloud SQL achieve near-zero downtime? Join Debi Cabrera as she interviews Product Manager, Rahul Deshmukh. Rahul discusses the various capabilities of Cloud SQL and the best practices to maximize business continuity for applications. Watch along and hear firsthand from the session speaker about configuring and monitoring Cloud SQL for maximum availability.
  |  By Google Operations
Google Security Command Center (SCC) Enterprise is the industry’s first cloud risk management solution that fuses cloud security and enterprise security operations - supercharged by Mandiant expertise and AI at Google scale. Watch and learn how to detect threats to your cloud resources and automate attack response.
  |  By Google Operations
Join this session to discover how Duet AI in Google Cloud, an AI-powered collaborator, can boost your team’s productivity and expertise in the cloud domain. We’ll explore the powerful features of Duet AI, such as AI-driven code assistance directly in integrated development environments to increase developer productivity, AI-backed operations to help operators better manage cloud infrastructure and application, AI-powered data exploration, and Duet AI in AppSheet that empowers business users to build apps in the cloud. We’ll also share our vision for Duet AI, explore the future roadmap, and demo its key features.
  |  By Google Operations
How to configure alerts in Google Cloud Deploy.
  |  By Google Operations
Are you wondering how you can route your Google Cloud logs to your desired destination? Then check out this video, where we introduce you to log sinks which can be used to route logs to various supported destinations, walk you through how it works and the list of supported destinations to which logs can be routed. It covers the different use cases and scenarios, where the logs sinks can be very useful. We’ll also demonstrate how to create and configure an aggregated log sink that sends all VPC flow logs to BigQuery.
  |  By Google Operations
Learn how companies have been innovating for business value while simultaneously tightening costs embracing cloud FinOps at scale.

Monitoring and management for services, containers, applications, and infrastructure.

Operations aggregates metrics, logs, and events from infrastructure, giving developers and operators a rich set of observable signals that speed root-cause analysis and reduce mean time to resolution (MTTR). Operations doesn’t require extensive integration or multiple “panes of glass,” and it won’t lock developers into using a particular cloud provider.

Operations is built from the ground up for cloud-powered applications. Whether you’re running on Google Cloud Platform, Amazon Web Services, on-premises infrastructure, or with hybrid clouds, Operations combines metrics, logs, and metadata from all of your cloud accounts and projects into a single comprehensive view of your environment, so you can quickly understand service behavior and take action.