Google Operations

Feb 2, 2023   |  By Kyle Benson
Google Cloud Ops Agent now supports monitoring GCE instances with Prometheus.
Jan 19, 2023   |  By Afrina M
Cloud Logging’s Log Analytics, with advanced search, as well as aggregation and transformation of all log data types, is now generally available.
Jan 18, 2023   |  By Dave Stanke
Learn more about the connection between SRE, DevOps and reliability.
Jan 14, 2023   |  By Daniella Villalba
Learn more about how culture is the true driver of DevOps success.
Nov 16, 2022   |  By Cat Chu
Understand how to calculate the composite reliability of your cloud infrastructure to help design Cloud architectures with an optimal SLA.
Nov 15, 2022   |  By Joy Wang
In-context dashboards for Google cloud storage, customizable, create alert, view logs for better storage system insights at project level and bucket level.
Nov 8, 2022   |  By Varun Krovvidi
Maintain high uptime and performance for your APIs without any overheads using Google Cloud’s API monitoring tools.
Oct 31, 2022   |  By Charles Baer
Cloud Logging launched Log Analytics powered by BigQuery. The top 10 reasons to get started with Log Analytics for no additional cost
Oct 19, 2022   |  By Afrina M
Flexera’s State of the Cloud Report 2022 pointed out that significant cloud spending is wasted, a major issue that is getting more critical as cloud costs continue to rise. In the current macroeconomic conditions, companies focus on identifying ways to reduce spending. To effectively do that, we need to understand the pricing model. We can then work towards the challenges of cost monitoring, optimization, and forecasting.
Oct 11, 2022   |  By Michael McGrath
Organizations and their software delivery pipelines are continually exposed to growing cyberattack vectors. Coupled with the massive adoption of open source software, which now helps power nearly all of our public infrastructure and is highly prevalent in most proprietary software, businesses around the world are more vulnerable than ever. Today’s organizations need to be more vigilant in protecting their software development infrastructure and processes.
Jan 6, 2023   |  By Google Operations
Are you looking to learn how to send alerts from Cloud Monitoring to your custom notification service? In this video, we share the different ways of processing notifications from Cloud Monitoring. Watch this video to learn the steps involved in sending the notifications from Cloud Monitoring using Cloud Run to your custom notification service, including a description of the sample notification service and of the Cloud Run code.
Dec 15, 2022   |  By Google Operations
Terraform state is critical for monitoring and keeping track of configuration changes. Check out this short to learn more about Terraform state and its importance.
Dec 13, 2022   |  By Google Operations
Cloud Armor allows you to easily monitor your data and have peace of mind that your policies are running correctly. In this episode of Go Deep with Google Cloud Armor, we cover preconfigured and custom dashboards, Security Command Center, and using Looker for more powerful dashboarding to get even better insights from your Cloud Armor data. Watch to learn how you can use Google Cloud Armor for all your monitoring needs!
Dec 8, 2022   |  By Google Operations
Would you like to know how to manage your workloads efficiently? Are you interested in learning how to share the slots across different workloads/projects?
Dec 6, 2022   |  By Google Operations
Have you ever received an unexpected Cloud Alerting incident? Would you like to learn how to prevent unexpected alerts? In this video, we cover some key concepts related to Alert Policy configuring in Google Cloud Monitoring. We’ll show you how to troubleshoot two unexpected incidents one on Metric and one on Log based metric alerting policies and explore configuration improvements to prevent future false alerts.
Oct 21, 2022   |  By Google Operations
Welcome back to GKE Essentials! In this episode, Kaslin Fields explores a key element of your GKE observability: Google Cloud Managed Service for Prometheus. Watch to see how Google Cloud's fully managed multi-cloud solution for Prometheus lets you globally monitor and alert on your workloads without having to manually manage and operate Prometheus at scale.
Oct 12, 2022   |  By Google Operations
Google Cloud CEO Thomas Kurian shares insights on how businesses are using cloud technology to build for the future and adapt to complexities, challenges, and opportunities.
Sep 30, 2022   |  By Google Operations
Are you interested to know about alerts in Cloud Monitoring? Would you like to know how to create metric based alerts for Google cloud products through cloud monitoring? In this video we introduce you to Alerts in Cloud Monitoring, how it works, the different types of alerting policies. Watch this video to learn how to create metric based alerts for Google cloud products.
Sep 12, 2022   |  By Google Operations
As an SAP system administrator, you've probably asked yourself: why did my Compute Instance restart? Why did Pacemaker restart my instance? Why did/didn’t my SAP system failover? By streaming Pacemaker logs into Cloud Logging, you can now find the answers to these questions by using a Cloud Logging query template to filter out the noise generated by Pacemaker logs.

Monitoring and management for services, containers, applications, and infrastructure.

Operations aggregates metrics, logs, and events from infrastructure, giving developers and operators a rich set of observable signals that speed root-cause analysis and reduce mean time to resolution (MTTR). Operations doesn’t require extensive integration or multiple “panes of glass,” and it won’t lock developers into using a particular cloud provider.

Operations is built from the ground up for cloud-powered applications. Whether you’re running on Google Cloud Platform, Amazon Web Services, on-premises infrastructure, or with hybrid clouds, Operations combines metrics, logs, and metadata from all of your cloud accounts and projects into a single comprehensive view of your environment, so you can quickly understand service behavior and take action.