Today's users of web and mobile applications and services expect fast and outstanding experiences. Delivering successful web services and applications means meeting these baseline expectations: In this blog post, we dive into why these three goals are vital to modern web applications and services. Then, we will look at how building global and distributed architectures achieve these goals.
Every millisecond costs you in serverless functions. The visible cost is the one that cloud providers charge. The hidden one is when your customers are unhappy with the performance of your application or website. While performance monitoring is a must for modern applications, it may require a high level of effort to implement correct and useful monitoring. Monitoring tools save a life here. We will use Thundra’s APM to measure the latency of the two serverless databases in this article.
Open Telemetry represents an effort to combine distributed tracing, metrics and logging into a single set of system components and language-specific libraries. Recently, OpenTelemetry became a CNCF incubating project, but it already enjoys quite a significant community and vendor support. OpenTelemetry defines itself as “an observability framework for cloud-native software”, although it should be able to cover more than what we know as “cloud-native software”.
Picture this: you run a development organization for a small startup. Or a large corporation. You’ve had to make some decisions that, while expedient, aren’t necessarily the most elegant. Time is money and money is time. You need to ship product fast. You have solid architectural standards but the tyranny of the urgent has upended some of your good intentions. This is our story.
In Part II (Part I is here) of our “Hitchhiker’s Guide to Prometheus,” we are going to continue with the overview of this powerful monitoring solution for cloud-native applications. In particular, we’ll walk you through configuring Prometheus for scraping exporter metrics and custom application metrics, using the Prometheus remote write API, and discuss some best practices for operating Prometheus in production. Let’s get started!
Systems run into problems all the time. To keep things running smoothly, we need to have an error monitoring and logging system to help us discover and resolve whatever issue that may arise as soon as possible. The bigger the system the more challenging it becomes to monitor it and pinpoint the issue. And with serverless systems with 100s of services running concurrently, monitoring and troubleshooting are even more challenging tasks.
I am extremely excited to announce that Epsagon has entered into a definitive agreement to join Cisco to advance its global Full-Stack Observability strategy, which we are thrilled to be a significant part of.