Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Cloud monitoring, security and related technologies.

Platform as a Service (PaaS): A guide

Cloud computing has conquered our lives, from massive on-premise systems and storage hubs to fully virtualized storage platforms. Today, organizations are reengineering their strategies rapidly into cloud-friendly which resulted in a rapid growth in cloud migration rate. Studies says, worldwide cloud infrastructure services investments are increased to $41.8 billion in the first quarter of 2021. Because, there is always a double-fold benefit from the cloud transformation.

How CloudZero Manages Cloud Costs During Our Product Discovery Process

Like any SaaS organization, CloudZero cares about our COGS, margin, and unit cost. As you can imagine, we have a fair amount of insight into how those numbers are changing every day thanks to being “customer zero” on our cloud cost intelligence platform. Talking with other companies, I often find that the best organizations understand that measuring the cost implications of new products and features once you’ve deployed them can be too late.

Verify GKE Service Availability with new dedicated uptime checks

Keeping the experience of your end user in mind is important when developing applications. Observability tools help your team measure important performance indicators that are important to your users, like uptime. It’s generally a good practice to measure your service internally via metrics and logs which can give you indications of uptime, but an external signal is very useful as well, wherever feasible.

Orchestration in Telcos: the multi-vendor and multi-cloud environments...

The use of NFV migration is becoming commonplace, it is made apparent there is a need for a higher degree of software management, smoother upgrades, and deployment process. Due to the complexity of the migration, Telcos have been deterred from adoption. A solution should be out there to aid businesses in managing and deploying network automation, orchestration, and managed services. In general, a telco network is complex and needs to be managed using multiple perspectives.

Monitor and troubleshoot your VMs in context for faster resolution

Troubleshooting production issues with virtual machines (VMs) can be complex and often requires correlating multiple data points and signals across infrastructure and application metrics, as well as raw logs. When your end users are experiencing latency, downtime, or errors, switching between different tools and UIs to perform a root cause analysis can slow your developers down.

Distributed tracing with OpenTelemetry and Cloud Trace

As more services are involved in serving user traffic and completing transactions, how does each service contribute to overall latency? In this episode of Engineering for Reliability, we’ll show how to use distributed tracing to capture the latency of user requests and how long it takes each service in the path to return a response. Watch to learn how to capture latency in distributed applications using OpenTelemetry and analyze it using Cloud Trace.