Oct 5, 2018

Be ready for any fault in your server infrastructure. Monitoring thousands of server metrics, ready-made for you. So you won’t miss a thing.

Okmeter.io shows you what's going on with your server infrastructure — deep-dive statistics and comprehensible charts provide you with insight about behaviour of server-side processes. Also okmeter.io will alert you on any problems that affect end-users, like slow page loads or server errors. Okmeter.io will help you fix issues faster by showing related odd events and possible problems.

Why okmeter is perfect for you:

  • Right metrics out of the box: To know what’s happening, you must collect proper parameters. We collected other people’s experience of how things might go south. And we figured out what metrics to collect to get all the info you need for troubleshooting.
  • Auto-detect of common pitfalls: Okmeter has a large knowledge base of typical problems that regularly occur with commonly-used technologies. Okmeter will automatically run 100s of diagnostic checks for each of your servers and subsystems.
  • Auto-magical integration: Okmeter will automatically detect every service, process and technology in your cluster and collect all needed metrics. It’ll appear organized in meaningful chart dashboards. All that with no configuration.
  • Cluster overview with drill-down: Okmeter metrics engine allows combining 1000s of metrics in one chart to get an overview of your whole cluster. And still you can drill-down to any specific subset you want. That allows you to pinpoint perf issues and root causes easily.

Okmeter auto-magically collects 100s and even 1000s of detailed metrics about every part of your system so you won’t miss a thing.

Sep 25, 2018   |  By Pavel Trukhanov
Brendan Gregg’s USE (Utilization, Saturation, Errors) method for monitoring is quite known. There are even some monitoring dashboard templates shared on the Internet. There’s also Tom Wilkie’s RED (Rate, Errors, Durations) method, which is suggested to be better suited to monitor microservices than USE.
Sep 3, 2018   |  By Pavel Trukhanov
Any changes to a Postgresql database first of all are saved in Write-Ahead log, so they will never get lost. Only after that actual changes are made to the data in memory pages (in so called buffer cache) and these pages are marked dirty — meaning they need to be synced to disk later.
Aug 27, 2018   |  By Pavel Trukhanov
A year ago we’ve added SMART metrics collection to our monitoring agent that collects disk drive attributes on clients servers. So here a couple of interesting cases from the real world.
Aug 21, 2018   |  By Pavel Trukhanov
Recently there was a mini-incident in a data center where we host our servers. It did not affect our service after all. And thanks to the right operational metrics, we’ve been able to instantly figure our what’s happening.
Mar 27, 2018   |  By Pavel Trukhanov
This is the second part of our two-part article series devoted to Elasticsearch monitoring. The heading of this article refers to Dante Alighieri’s “Inferno”, in which Dante offers a tour through the nine increasingly terrifying levels of hell.