Operations | Monitoring | ITSM | DevOps | Cloud

New features + new CI: Metrics, Status Page Widget, PandoraFMS, Automation rules, Alert report export

This post highlights some of the features and improvements that we have released in the last month. If you want to submit your own ideas or vote on existing feature requests, you can now use our new public roadmap at roadmap.ilert.com. ‍

5 Tips For Consumers To Shop Safely This Black Friday

While it makes for bleak reading, the frenzy of sales and online shopping activity surrounding Black Friday, means this pre-holiday season is a key period for cybercriminals. And each year we see an increase in cyberattacks during what should be a feel-good time. The picture is all-the-more worrying in 2022, as this Black Friday weekend (25th-28th November) falls on the same date as the USA vs England World Cup game – a highly- anticipated day of betting for bookmakers.

Reduce Data Costs: Log Sampling with OpenTelemetry and BindPlane OP

Redundant logs are a common nuisance in observability pipelines of all kinds. In large environments, excess logs can multiply data costs to unsustainable amounts. Log sampling is the process of randomly sampling logs to produce the same valuable insight with dramatically reduced data flow. Configuring agents in a pipeline to appropriately sample logs can be a pain. Pipeline managers, like BindPlane OP, make that process simple and scalable.

How integrating AWS into Cortex augments visibility into your infrastructure

With AWS re:Invent right around the corner, infrastructure has been top of mind at Cortex. Earlier this year, we launched our revolutionary Resource Catalog, which integrates with AWS accounts to automatically ingest all infrastructure components, from s3 buckets to lambdas. Through this process, Cortex allows you to track everything in a single place, while augmenting the information that already exists in AWS. The Resource Catalog surfaces live information about your infrastructure assets.

How Do You Measure Application Performance?

Web performance isn’t just about how long a website needs to render all its page elements—it also covers techniques for monitoring an application’s runtime, user-defined transactions, component response times, and network requests. The important thing is using performance data to evaluate the success of your app or service, whether you’re trying to compare different versions or introduce new capabilities.

Reducing MTTR for DevOps and SREs with PagerDuty Process Automation and InfluxDB

Mean time to resolution (MTTR) is a metric that transcends industry and technology. It’s a measure of how quickly, on average, support teams identify, act, and resolve IT issues and incidents. Because MTTR directly relates to service quality, maintaining a low MTTR is a critical goal for DevOps and SRE teams. These teams have a vested interest in resolving issues quickly because escalating incidents to higher levels of the support team increases response and resolution times.

Relational Database vs. Non-Relational Database

Relational database or non-relational database: which should you use for your projects? It’s a common question. When choosing the database type that’s right for your requirements, it’s important to understand the differences between the two. Both database types are practical in different situations and use cases and have commonalities.

My Most Surprising Discoveries from The SRE Report 2023

I’ve had the honor and privilege of authoring The SRE Report for the last three years. For the 2023 version, this included working with some amazing individuals like Anna Jones, Kurt Andersen, and Steve McGhee. Download The SRE Report 2023 here (no registration required).