Google Operations

Jun 24, 2020   |  By Charles Baer
Logging is a critical component of your cloud infrastructure and provides valuable insight into the performance of your systems and applications. On Google Cloud, Cloud Logging is a service that allows you to store, search, monitor, and alert on log data and events from your Google Cloud Platform (GCP) infrastructure services and your applications. You can view and analyze log data in real time via Logs Viewer, command line or Cloud SDK.
Jun 22, 2020   |  By Titouan Rigoudy
Around the world, operations teams are working to automate their monitoring and alerting workflows, looking to reduce the time they spend on rote operational work (what we call “toil”), so they can spend more time on valuable work. For instance, Google’s Site Reliability Engineering organization aims to keep toil below 50% of an SRE’s time, freeing them up to work on more impactful engineering projects.
Jun 12, 2020   |  By Joy Wang
Earlier this year, we added a Dashboard API to Cloud Monitoring, allowing you to manage custom dashboards and charts programmatically, in addition to managing them with the Google Cloud Console. Since then, you’ve asked us to provide more sample dashboard templates that target specific Google Cloud services. Many of you have also asked us to provide a Terraform module to help you set up an automated deployment process.
Jun 8, 2020   |  By Chris Law
As a cloud project owner, you want your environment to run smoothly and efficiently. At Google Cloud, one of the ways we help you do that is through a family of tools we call Recommenders, which leverage analytics and machine learning to automatically detect issues and present you with optimizations that you can act on.
Jun 3, 2020   |  By Rami Shalom
Logs are an important part of troubleshooting and it’s critical to have them when you need them. When it comes to logging, Google Kubernetes Engine (GKE) is integrated with Google Cloud’s Logging service. But perhaps you’ve never investigated your GKE logs, or Cloud Logging? Here’s an overview of how logging works in GKE, and how to configure, find, and interact effectively with the GKE logs stored in Cloud Logging.
Jun 27, 2020   |  By Google Operations
Are you currently operating on a hybrid-cloud or multi-cloud architecture and wanting to standardize SLO’s, observability, and alerting across your platforms? In this video, Yuri Grinshteyn shows you common architecture patterns for a hybrid observability approach. Watch to learn how you can standardize observability across multiple cloud providers!
May 30, 2020   |  By Google Operations
Tracing is a great way to monitor your services, but how does one go about fixing latency issues in a specific service? In this episode of Stack Doctor, Yuri Grinshteyn shows you how to connect traces with logs via OpenTelemetry and Cloud Trace and Logging, enabling you to pinpoint and debug service latency issues in a snap!
May 16, 2020   |  By Google Operations
In the last episode, we showed you how to use OpenTelemetry for tracing to gauge how requests traverse your service. In this episode of Stack Doctor, we show you how to use OpenTelemetry’s metric function, allowing you to define the metrics you want to capture and improve the observability of your Node.js application.
May 2, 2020   |  By Google Operations
Wanting to measure the latency of user requests, and know how long each microservice takes to return a response? In this episode of Stack Doctor, we’ll walk you through how to use OpenTelemetry for tracing, and how this tool shows how your requests traverse your service and how each service contributes to overall latency.
Apr 18, 2020   |  By Google Operations
Did you know you can debug your code while it’s still in production? In this video, Yuri Grinshteyn speaks about the Stackdriver Debugger, and how you can use it with Node.js. More importantly, he talks about the two ways in which this tool can debug by creating snapshots, or logging in real-time. Product: Google Cloud Operation Suite; fullname: Yuri Grinshteyn;