opsdemon

Mailchimp

Not subscribed to OpsMatters Newsletter

Latest posts

Featured

Exoprise

Jan 14, 2019

We help you find and fix issues with your cloud apps fast. Exoprise is the leading solution provider for monitoring SaaS services like Microsoft 365, Box, Dropbox, Salesforce.com and more.

View Organisation

Read more about Exoprise

Sematext

Nov 6, 2018

Monitoring, log management, transaction tracing, and real user monitoring. Finally together!

View Organisation

Read more about Sematext

NiCE IT Mgmt

Oct 18, 2018

From databases to servers, communication systems, operating systems and custom applications, NiCE provides the right application monitoring solutions and services.

View Organisation

Read more about NiCE IT Mgmt

Monitive is an uptime monitoring service, where users sign up and input their website address, which we check every minute, from a random location around the world, and instantly notify them when their site is down.

View Organisation

Read more about Monitive

EventSentry

Jul 28, 2018

EventSentry is an award-winning Hybrid SIEM which features real-time log, system health and network monitoring to proactively monitor networks and preemptively respond to threats.

View Organisation

Read more about EventSentry

ManageEngine

Jul 2, 2018

ManageEngine crafts comprehensive IT management software with a focus on making your job easier. Our 90+ products and free tools cover everything your IT needs, at prices you can afford.

View Organisation

Read more about ManageEngine

Raygun

May 27, 2018

Raygun is a Software Intelligence Platform that gives companies visibility into software problems. Errors, crashes and slow loading pages and scripts affecting end users are automatically detected, enabling teams to build excellent user experiences.

View Organisation

Read more about Raygun

10 Key Application Performance Metrics & How to Measure Them

Jul 14, 2023 By mwatson In Stackify

If you are trying to figure out how to measure the performance of your application, you are in the correct place. We spend a lot of time at Stackify thinking about application performance, especially about how to monitor and improve it. In this article, we cover some of our most important application performance metrics you should be tracking.

Read Post

Stackify

Read more about 10 Key Application Performance Metrics & How to Measure Them

Service Level Objectives: A Complete Overview for Beginners

Jul 14, 2023 By Justin Reynolds In Stackify

DevOps engineers are under intense pressure to provide reliable, high-quality services to teams and stakeholders. In large part, this is because end users today demand seamless access to software and a great user experience – a trend that will only increase as digital transformation accelerates and we move further into the future. DevOps professionals rely on various metrics to meet performance and reliability goals, one of the most important being service level objectives (SLOs).

Read Post

Stackify

Read more about Service Level Objectives: A Complete Overview for Beginners

How our engineering team uses Polish Parties to maintain quality at pace

Jul 14, 2023 By Leo Sjöberg In Incident.io

It’s fair to say that delivering software faster has never been more relevant. But in doing so, it’s easy to let your bar for quality slip. Often, the guardrail to avoid this is to hire dedicated QA Engineers, whose sole job is to ensure your software works as it should and to spot any issues that arise. Seems sensible, right? Well, at incident.io, we take a different approach.

Read Post

Incident.io

Read more about How our engineering team uses Polish Parties to maintain quality at pace

What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Jul 14, 2023 By incident.io In Incident.io

Site reliability engineers manage a lot, and often in incredibly high-stakes environments. Remember that scene from "The Matrix" where Neo dodges bullets in slow motion? Of course you do. As an SRE, it can feel like you're the person getting hit by those bullets, frantically trying to investigate performance issues, automate away toil, and support the engineers around you, all before the next wave of attacks.

Read Post

Incident.io

Read more about What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Celebrating Grafana 10: Top 10 Grafana features you need to know about

Jul 14, 2023 By Michelle Tan In Grafana

Since Grafana started 10 years ago, there have been more than 43,000 commits to the open source project. Grafana founder Torkel Ödegaard has made more than 7,600 of those commits, and he recently reflected on some personal favorites he’s worked on, ranging from early query builders to the latest navigation updates. Torkel isn’t the only one who has strong feelings.

Read Post

Grafana

Read more about Celebrating Grafana 10: Top 10 Grafana features you need to know about

PagerDuty Runbook Automation

Jul 14, 2023 By PagerDuty In PagerDuty

Learn how PagerDuty Runbook Automation can replace manual procedures in your runbooks with automated self-service tasks for faster resolution, simplified security and compliance and reduced support costs.

View Video

PagerDuty

Read more about PagerDuty Runbook Automation

Rundeck by PagerDuty + Ansible

Jul 14, 2023 By PagerDuty In PagerDuty

Rundeck by PagerDuty is open source software that provides a centralized platform to help you manage and automate operations tasks. When you integrate Ansible with Rundeck, you get even more benefits.

View Video

PagerDuty

Read more about Rundeck by PagerDuty + Ansible

What is Business Continuity and Disaster Recovery (BCDR)?

Jul 14, 2023 By Team Ninja In NinjaOne

Perhaps the worst IT scenario an organization can face is an unexpected and forced suspension of all its operations. The downtime that’s experienced in such a situation can lead to financial damages that far exceed those from lost data or hits to reputation. While cyberattacks vary in intensity and approach, downtime and catastrophic loss of data come in many more forms and are equally, if not more, difficult to avoid.

Read Post

NinjaOne

Read more about What is Business Continuity and Disaster Recovery (BCDR)?

Discover, Learn, and Experience: The Qovery Playground is Now Open!

Jul 14, 2023 By Romaric Philogène In Qovery

In the dynamic world of development and operations (DevOps), one thing is clear: there's always room for new, innovative platforms that make life easier for developers and platform engineers. And today, we're thrilled to introduce our latest contribution to this dynamic sphere – the Qovery Playground.

Read Post

Qovery

Read more about Discover, Learn, and Experience: The Qovery Playground is Now Open!

Platform Engineering: The Key to Successful Transformation for the Enterprise

Jul 14, 2023 By CircleCI In CircleCI

Discover how CircleCI empowers teams while meeting compliance and governance needs. See how our centralized system supports platform teams as a key driver of your internal development process.

View Video

CircleCI

CI CD
DevOps

Read more about Platform Engineering: The Key to Successful Transformation for the Enterprise

Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

Exoprise

Sematext

NiCE IT Mgmt

Monitive

EventSentry

ManageEngine

Raygun

10 Key Application Performance Metrics & How to Measure Them

Service Level Objectives: A Complete Overview for Beginners

How our engineering team uses Polish Parties to maintain quality at pace

What Is Site Reliability Engineering? Understanding the complexities of this crucial function

Celebrating Grafana 10: Top 10 Grafana features you need to know about

PagerDuty Runbook Automation

Rundeck by PagerDuty + Ansible

What is Business Continuity and Disaster Recovery (BCDR)?

Discover, Learn, and Experience: The Qovery Playground is Now Open!

Platform Engineering: The Key to Successful Transformation for the Enterprise

Monthly Archive

Follow Us