Operations | Monitoring | ITSM | DevOps | Cloud

Efficient Incident Management with Catchpoint and PagerDuty

The ability to detect and alert performance issues quickly is key to reducing the Mean Time to Resolve (MTTR). Proactive monitoring will catch incidents early on but triggering the right alerts and notifying the relevant incident management team is just as critical. Enterprises rely on multiple disparate tools to monitor different systems so there is a lot of data and noise generated which can render incident management inefficient.

Refreshing PagerDuty's Navigation for Increased Efficiency and Simplification

We are super excited to share that we are currently testing and in the process of rolling out a new desktop global navigation to all of our users. Things that are clear in retrospect often emerge from ambiguous and humble beginnings. Initially built as a simple on-call management tool for IT responders, PagerDuty has evolved into an end-to-end, enterprise-grade digital operations platform.

Pandora FMS vs Centreon vs Nagios XI

Centreon is a solution for monitoring applications, systems and networks, based on Nagios source code. On 1st August, 2005 the company Merethis (now Centreon) was founded and began working on “their” Nagios version, calling it Oreon. In July 2007, the Oreon software changed its name to Centreon due to a name conflict with Orion (a component of the SolarWinds monitoring suite).

Train, evaluate, monitor, infer: End-to-end machine learning in Elastic

Machine learning pipelines have evolved tremendously in the past several years. With a wide variety of tools and frameworks out there to simplify building, training, and deployment, the turnaround time on machine learning model development has improved drastically. However, even with all these simplifications, there is still a steep learning curve associated with a lot of these tools. But not with Elastic.

Masaf Dawood on Google Cloud's Compelling Enterprise Story

Google Cloud has been gaining some noticeable traction in recent months: 43% growth in Q2 is nothing to sniff about, especially during a global recession. Masaf Dawood is director of Google Cloud services with SpringML, a premier Google Cloud partner with specialties in application development, data analytics, machine learning and marketing analytics. SpringML works exclusively with Google Cloud and has worked on 200 engagements with Google since the consultancy was founded in 2015.

AWS ECU vs vCPU-Everything You Need to Know

If you’ve deployed an application or service to the Amazon Web Service (AWS) cloud, you’ve probably made use of an EC2 instance. One of the decisions that you had to make before you could start a new instance, was which instance type to use. Choosing an EC2 instance type can be a complicated process. AWS organizes their instance types into instance families, and within an instance family, there are varying sizes from micro to 32xlarge.

New release: Incident Automation just got even better with conditions in FireHydrant Runbooks

The ability to automate your incident response process means you can start responding to incidents faster. So it’s easy to see why FireHydrant Runbooks is so popular within the platform. When you let automation take over, you can spend more focus fixing problems and keeping your customers happy. Now with the addition of conditions, you can create even more powerful automation.

How to: Email Incident Stakeholders with conditions in FireHydrant

Our release of conditions in FireHydrant Runbooks has made it easier for teams who rely on email to communicate with key stakeholders or a distribution list. 💡If your team uses Slack, and you haven’t already installed our Slack integration, you should definitely check it out as it’s the easiest way to automate updates to channels when the status of an incident changes.