Boston, MA, USA
Feb 25, 2020 | By Rafael Alvarez
During my career (in technology), I have dealt with many clients to whom security was one of the main areas of concern. As such, there’s always room for improvement but without a shed of a doubt, communications direction and stateful firewalls are some of the very first elements to consider. When it comes to logging and audit information, as a rule of thumb, it’s good to have a log aggregator stored outside of the scope of a cloud provider. A great log correlation out there is Splunk.
Feb 18, 2020 | By Yuri Grinshteyn
One of the main benefits of using an all-in-one observability suite like Stackdriver is that it provides all of the capabilities you may need. Specifically, your metrics, traces, and logs are all in one place, and with the GA release of Monitoring in the Cloud Console, that’s more true than ever before. However, for the most part, each of these data elements are still mostly independent, and I wanted to attempt to try to unify two of them — traces and logs.
Jan 22, 2020 | By Yuri Grinshteyn
Service Level Objectives or SLOs are one of the fundamental principles of site reliability engineering. We use them to precisely quantify the reliability target we want to achieve in our service. We also use their inverse, error budgets, to make informed decisions about how much risk we can take on at any given time. This lets us determine, for example, whether we can go ahead with a push to production or infrastructure upgrade.
Apr 22, 2019 | By Charles Baer
Our Stackdriver Monitoring tool works on Google Cloud Platform (GCP), Amazon Web Services (AWS) and even on-prem apps and services with partner tools like Blue Medora’s BindPlane. Monitoring keeps metrics for six weeks, because the operational value in monitoring metrics is often most important within a recent time window. For example, knowing the 99th percentile latency for your app may be useful for your DevOps team in the short term as they monitor applications on a day-to-day basis.
Mar 27, 2019 | By Ariel Peretz
The Kubernetes ecosystem contains a number of logging and monitoring solutions. These tools address monitoring and logging at different layers in the Kubernetes Engine stack. This document describes some of these tools, what layer of the stack they address, as well as best practices for implementation including an example from the field, a quick start, and a demo project.
Mar 28, 2020 | By Google Stackdriver
Looking to implement SRE principles or to better understand how to optimize the reliability of your services? Yuri Grinshteyn covers the basics of Service Level Indicators and Service Level Objectives and how to use Service Monitoring to implement SLO Monitoring and alerting.
Feb 15, 2020 | By Google Stackdriver
In this episode of Stack Doctor, Yuri Grinshteyn talks about audit logs. Specifically, he goes over what is logged by default and what audit logs you have control over. Additionally, he goes over the three kinds of data access audit logs and their perspective functions.
Feb 1, 2020 | By Google Stackdriver
Want to get started with Stackdriver without impacting your production monitoring setup? Join Yuri Grinshteyn to learn how to use the Stackdriver Sandbox to start learning about and using Stackdriver. The Sandbox includes a GKE cluster with a sample microservices application deployed and fully instrumented for monitoring, logging, and tracing.
- March 2017 (1)
- December 2017 (1)
- March 2018 (1)
- April 2018 (1)
- May 2018 (5)
- June 2018 (2)
- July 2018 (9)
- August 2018 (5)
- September 2018 (4)
- October 2018 (7)
- November 2018 (2)
- December 2018 (1)
- January 2019 (1)
- February 2019 (3)
- March 2019 (5)
- April 2019 (3)
- September 2019 (4)
- January 2020 (1)
- February 2020 (5)
- March 2020 (1)