Last9

Sunnyvale, CA, USA
2020
  |  By Aditya Godbole
Dissecting the RCA of Alerting - Reliability, Correlations, Actionability.
  |  By Aditya Godbole
Modern monitoring systems depend heavily on ‘Alerting’ to reduce the Mean Time to Detect (MTTD) faulty systems. But, alerting hasn’t evolved to meet the demands of modern architectures. We’re changing that with Alert Studio.
  |  By Aniket Rao
Chasing shiny new toys, as always ;)
  |  By Piyush Verma
A short history of software monitoring, from the 00s. What has changed? Why are things so arcane?
  |  By Prathamesh Sonpatki
A detailed checklist of points you should consider before choosing a monitoring system.
  |  By Aniket Rao
Setting up OpenCost with Levitate to monitor the cost of Kubernetes clusters.
  |  By Tripad Mishra
We discuss the nuances of Federation in Prometheus, address Prometheus Scaling Challenges along with alternatives to Prometheus federation.
  |  By Aniket Rao
If you want to bring down your monitoring costs, you need to shake up a decision paralysis in engineering.
  |  By Prathamesh Sonpatki
Fulfill all your food delivery orders this December 31st by taming High Cardinality data 😉
  |  By Tripad Mishra
A deep dive on different metric types in Prometheus and best practices.
  |  By Last9
Are you using Prodvana.io for deployments? Send a change event to Levitate for every deployment from Prodvana.
  |  By Last9
You have probably heard of OpenTelemetry in the context of traces. But did you know OpenTelemetry also supports metrics with a comprehensive, forward-looking data model and SDKs? When it comes to metrics, one thinks of Prometheus, but Otel metrics provide exciting ideas such as cumulative deltas, exponential histograms, and more! This talk will demystify everything about Otel Metrics, from the data model to APIs to how to get started. We will cover the differences between Otel Metrics and Prometheus and explain the reasons why people get excited about using Otel Metrics.
  |  By Last9
The Indian Premier League is a unique sporting event for a dozen reasons. But for engineers in India, it’s one of a kind. Very few companies can boast of managing 30+ million concurrent users. Every year, this number grows. Last year, we witnessed ~60 million concurrent users. And things get bigger and larger every year.
  |  By Last9
Predicting the future is hard, especially with metrics-based monitoring systems, because metrics cardinality can snowball. This is important because it affects query performance adversely. Having visibility into what’s happening now and workflows to manage cardinality is crucial. Because the answers depend on the quality of questions, a system allows you to ask. The questions one may have is —
  |  By Last9
We have Carson Anderson, Sr. DevOps Engineer at Weave HQ, talking about how they implemented SLOs using Prometheus, what went wrong, and how they fixed it. This talk was given at "Last9 of Reliability" Discord community on 13th December. Talk Description: First thing's first: Yes, it really did take us 5 tries to implement our SLOs with Prometheus. While that may seem embarrassing, we are very happy to be able to share our SLO journey so that we can hopefully help you avoid the same mistakes.
  |  By Last9
Aniket and Prathamesh team up to discuss how high cardinality is solved today, and Aniket shows the Streaming Aggregation pipeline of Levitate to manage High Cardinality.
  |  By Last9
Most of your outages are probably caused by a change, and having observability around that will make a lot of difference. Dive into this walkthrough, where we showcase tracking Canary deployments in Argo CD, correlating events and metrics seamlessly with Levitate. For Site Reliability Engineers, DevOps engineers, Software Engineers, and Product Managers seeking to elevate their observability and ensure smooth deployments every time.
  |  By Last9
The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.
  |  By Last9
The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.
  |  By Last9
The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

Last9 provides tools to improve Reliability in large-scale cloud-native environments.

Our open-standards-based tools provide visibility into the Rube Goldberg of micro-services. We take away the toil of managing a time series database by dramatically reducing your costs and improving developer productivity.

Levitate is our time series metrics & events warehouse designed for scale and high cardinality. Our warehousing capabilities provide necessary control levers to ensure cost-efficient data growth management, surpassing traditional storage solutions.

Start your observability journey today with Levitate. A Managed Time Series Data Warehouse that SREs trust.