Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

Extracting Insights from Metrics with AIOps for Better Observability

In this second installment of this blog series, we’ll discuss the importance of analyzing metrics, and how AIOps helps you with this fundamental pillar of observability. Without proper metrics analysis, you’re left blind to potential outages, or possibly worse — inundated with false positive anomalies, leading to alert fatigue and ultimately business impacts. Automated discovery and analysis can’t be achieved with legacy tools nor will it scale with humans.

PowerShell and 'Fileless Attacks'

PowerShell had its beginnings as a way to enable administrators to perform their tasks both locally and remotely with unprecedented access to underlying Windows components, such as COM objects and WMI. Since being included in every major Windows Operating System since Windows 7, PowerShell based tooling is well proliferated for both legitimate and malicious use and includes common tooling such as SharpSploit, PowerSploit, PowerShell Empire, Nishang and Invoke-Obfuscation.

Debugging in production with Stackdriver Debugger - Stack Doctor

Did you know you can debug your code while it’s still in production? In this video, Yuri Grinshteyn speaks about the Stackdriver Debugger, and how you can use it with Node.js. More importantly, he talks about the two ways in which this tool can debug by creating snapshots, or logging in real-time. Product: Google Cloud Operation Suite; fullname: Yuri Grinshteyn;

The Uptime.com Report for 2019

Unplanned downtime can drive significant losses in the form of unrealized revenue. Teams may be caught off guard, or may face an outage outside their control, extending downtime hours unnecessarily. Without automated monitoring and alerting, teams face undetected outages that silently threaten SLA fulfillment. The recommendations in this report are best used as a guide on what trends may drive Site Reliability Engineering in the near term.

Splunk: A New Approach for Observability in Kubernetes Environments

While Kubernetes abstracts away many infrastructure complexities enabling DevOps teams to move faster and scale efficiently, it also introduces new operational and monitoring challenges. DevOps and SRE teams grapple with challenges in monitoring dynamic and ephemeral containerized environments. According to the latest CNCF survey, monitoring and complexity are the top inhibitors in Kubernetes adoption. Applying traditional approaches to monitoring in cloud-native environments doesn't work.

CloudZero: Online Workshop: Practical Steps To Reduce Your AWS Bill

AWS gives you access to the compute and tools you need at the press of a button-which helps you get products out the door fast. However, it can be challenging to control your costs and identify waste, especially if you're not sure where to look. In this webinar, Matt Manger, Founder of CloudZero, will discuss some practical steps you can take to assess and reduce your AWS spend.

CloudZero: How to Cut Your Technology Budget During a Downturn

You just received an order from finance telling you it's time to cut your budget, but where do you start? You know you need to make your team work leaner and smarter, but you can't let productivity suffer. In this live online panel, three industry veterans will discuss how to assess your technology investments during an uncertain climate and reduce costs without hurting your company's engineering power.

Five worthy reads: Implementing a successful remote work environment

Five worthy reads is a regular column on five noteworthy items we’ve discovered while researching trending and timeless topics. This week, we delve into how organizations are increasingly adopting a remote work model, and how they can equip themselves to build a synchronized virtual workspace. In the wake of COVID-19 and the subsequent mandates to stay at home, many organizations have implemented a remote work environment in order to maintain business operations.