Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Why and How to Monitor Amazon OpenSearch Service

Some time ago, AWS forked ElasticSearch, the most popular search engine on the planet. They had some struggles with the maintainer of ElasticSearch and decided it was time to part ways. So, with OpenSearch, there is now a new kid in town. Well, not new, but at least some kind of alternative.

PagerDuty Apps for AWS + Automated Diagnostics Demo Highlights (3 min.)

"Reduce downtime and customer impact with service ownership while enabling teams to drive continuous improvement and innovation Learn about how you can modernize and optimize your operations with our enterprise-grade set of AWS integrations. Automate incident response with PagerDuty’s Runbook Automation and learn about our new set of AWS plugins and prebuilt jobs that make it easier to get up and running with auto-diagnostics."

Defining and measuring your SLIs and SLOs

Customers expect that online services are available all the time. The truth is that outages happen to almost everyone because providing 100% service availability is challenging and costly. Creating reliable and profitable service is, amongst other things, finding the balance between application availability, costs and time to market. Faster feature delivery means less availability as constant changes to production may cause issues and introduce bugs.

How I monitor cloud application costs in one simple but powerful dashboard

Although there are many great tools out there to get on top of application monitoring, there’s one vital metric that’s often overlooked by us technical folks – cost. In the days of running apps on servers in private datacenters, the kit was a one-time purchase that the systems team had to deal with. But running apps in public clouds is a different story. Whether you’re running on VMs, containers in Kubernetes, or entirely serverless, execution of your code adds to the bill.

Sponsored Post

What Are Runbooks and How Does It Apply to Network Operation Centers (NOCs)?

Much like in other production environments, the production of cloud services is based on and orchestrated by a plethora of tools-making part of cloud services' overall cloud infrastructure. Given how cloud services are as complex as they are intricate, a vast range of detailed steps need to be performed in a certain order for the production environment to run smoothly, whether it's carrying out maintenance procedures, updates and upgrades, or resolving issues to prevent downtime.

Introducing Cloud Logging - Log Analytics, powered by BigQuery

Logging is a critical part of the software development lifecycle allowing developers to debug their apps, DevOps/SRE teams to troubleshoot issues, and security admins to analyze access. Cloud Logging provides a powerful pipeline to reliably ingest logs at scale and quickly find your logs. Today, we’re pleased to announce Log Analytics, a new set of features in Cloud Logging available in Preview, powered by BigQuery that allows you to gain even more insights and value from your logs.