Latest Posts

Manage logs from multiple clouds and on-premises workloads together

May 8, 2020 By Craig Lee In Google Operations

We’ve heard from our customers that you need visibility into metrics and logs from Google Cloud, other clouds, and on-prem in one place. Google Cloud has partnered with Blue Medora to bring you a single solution to save time and money in managing your logs in a single place. Google Cloud’s operations management suite gives you the same scalable core platform that powers all internal and Google Cloud observability.

Read Post

Google Operations

Read more about Manage logs from multiple clouds and on-premises workloads together

Find and fix issues faster with our new Logs Viewer

Apr 13, 2020 By Rami Shalom In Google Operations

Monitoring your cloud infrastructure is an essential part of making sure your operations are running smoothly. Since announcing the new Cloud Logging interface in February, we’ve heard from users that the new interface is making it faster and easier to meet logging needs, including troubleshooting issues, verifying deployments, and ensuring compliance. One of those users, Arne Claus, is a site reliability engineer at trivago, and has taken advantage of the new interface already.

Read Post

Google Operations

Read more about Find and fix issues faster with our new Logs Viewer

Use SRE principles to monitor pipelines with Cloud Monitoring dashboards

Mar 11, 2020 By Charles Baer In Google Operations

Data pipelines provide the ability to operate on streams of real-time data and process large data volumes. Monitoring data pipelines can present a challenge because many of the important metrics are unique. For example, with data pipelines, you need to understand the throughput of the pipeline, how long it takes data to flow through it and whether your data pipeline is resource-constrained.

Read Post

Google Operations

Read more about Use SRE principles to monitor pipelines with Cloud Monitoring dashboards

Use the Dashboard API to build your own monitoring dashboard

Mar 6, 2020 By Charles Baer In Google Operations

Using dashboards in Cloud Monitoring makes it easy for you to track important system metrics. Creating dashboards by hand in the Monitoring UI can be a time-consuming process, especially if you want to use them in multiple different Monitoring Workspaces. With the recent GA announcement for the Cloud Monitoring dashboards API, you now have a way to programmatically create dashboards.

Read Post

Google Operations

Read more about Use the Dashboard API to build your own monitoring dashboard

Stackdriver Push to Splunk

Feb 25, 2020 By Rafael Alvarez In Google Operations

During my career (in technology), I have dealt with many clients to whom security was one of the main areas of concern. As such, there’s always room for improvement but without a shed of a doubt, communications direction and stateful firewalls are some of the very first elements to consider. When it comes to logging and audit information, as a rule of thumb, it’s good to have a log aggregator stored outside of the scope of a cloud provider. A great log correlation out there is Splunk.

Read Post

Google Operations

Read more about Stackdriver Push to Splunk

All together now: our operations products in one place

Feb 24, 2020 By Raghu Nandan In Google Operations

Our suite of operations products has come a long way since the acquisition of Stackdriver back in 2014. The suite has constantly evolved with significant new capabilities since then, and today we reach another important milestone with complete integration into the Google Cloud Console. We’re now saying goodbye to the Stackdriver brand, and announcing an operations suite of products, which includes Cloud Logging, Cloud Monitoring, Cloud Trace, Cloud Debugger, and Cloud Profiler.

Read Post

Google Operations

Read more about All together now: our operations products in one place

Integrating Tracing and Logging with OpenTelemetry and Stackdriver

Feb 18, 2020 By Yuri Grinshteyn In Google Operations

One of the main benefits of using an all-in-one observability suite like Stackdriver is that it provides all of the capabilities you may need. Specifically, your metrics, traces, and logs are all in one place, and with the GA release of Monitoring in the Cloud Console, that’s more true than ever before. However, for the most part, each of these data elements are still mostly independent, and I wanted to attempt to try to unify two of them — traces and logs.

Read Post

Google Operations

Read more about Integrating Tracing and Logging with OpenTelemetry and Stackdriver

Introducing the Stackdriver Cloud Monitoring dashboards API

Feb 18, 2020 By Brian Corwin In Google Operations

Using dashboards in Stackdriver Cloud Monitoring makes it easy to track critical metrics across time. Dashboards can, for example, provide visualizations to help debug high latency in your application or track key metrics for your applications. Creating dashboards by hand in the Monitoring UI can be a time-consuming process, which may require many iterations. Once dashboards are created, you can save time by using them in multiple Workspaces within your organization.

Read Post

Google Operations

Read more about Introducing the Stackdriver Cloud Monitoring dashboards API

Logging + Trace: love at first insight

Feb 14, 2020 By Mary Koes In Google Operations

Meet Stackdriver Logging, a gregarious individual who loves large-scale data and is openly friendly to structured and unstructured data alike. Although they grew up at Google, Stackdriver Logging welcomes data from any cloud or even on-prem. Logging has many close friends, including Monitoring, BigQuery, Pub/Sub, Cloud Storage and all the other Google Cloud services that integrate with them. However, recently, they are looking for a deeper relationship to find insight.

Read Post

Google Operations

Read more about Logging + Trace: love at first insight

SLOs with Stackdriver Service Monitoring

Jan 22, 2020 By Yuri Grinshteyn In Google Operations

Service Level Objectives or SLOs are one of the fundamental principles of site reliability engineering. We use them to precisely quantify the reliability target we want to achieve in our service. We also use their inverse, error budgets, to make informed decisions about how much risk we can take on at any given time. This lets us determine, for example, whether we can go ahead with a push to production or infrastructure upgrade.

Read Post

Google Operations

Read more about SLOs with Stackdriver Service Monitoring

Operations | Monitoring | ITSM | DevOps | Cloud

Manage logs from multiple clouds and on-premises workloads together

Find and fix issues faster with our new Logs Viewer

Use SRE principles to monitor pipelines with Cloud Monitoring dashboards

Use the Dashboard API to build your own monitoring dashboard

Stackdriver Push to Splunk

All together now: our operations products in one place

Integrating Tracing and Logging with OpenTelemetry and Stackdriver

Introducing the Stackdriver Cloud Monitoring dashboards API

Logging + Trace: love at first insight

SLOs with Stackdriver Service Monitoring

Monthly Archive

Follow Us