Operations | Monitoring | ITSM | DevOps | Cloud

Latest Blogs

Tech Talk: Galileo Cloud Compass

A video chat with the developers who bring right-sized cloud costing information in real-time. It all started with a casual conversation about the cloud and infrastructure monitoring among the team at Galileo and our parent company, The ATS Group. From there, it quickly evolved into a discussion on how we can alleviate a source of frustration for many of our customers.

How to Build Grafana Dashboards with InfluxDB, Flux and InfluxQL

We’re excited about today’s release of Grafana 7.1, which extends Grafana’s built-in InfluxDB datasource to run queries in both the Flux language and InfluxQL. This means it’s super easy to connect Grafana to InfluxDB — whether you use InfluxDB 1.8 or 2.0, Flux or InfluxQL. Because this InfluxDB datasource is built into Grafana 7.1, there is no separate plugin to download and install.

Derek Saves the Day with Network Monitoring

Network Monitoring solutions, much like the diagnostic and surgical tools of a medical professional, make it easier for the IT team to discover and locate devices installed within the network or operated via the cloud. These systems make it easier for the IT operations team to understand the ongoing issues in real-time, as and when they occur. Whether it is uptime, disk space, or any other performance issues.

Leadership and Innovation with Instacart's VP of Infrastructure

Blameless CEO Ashar Rizqi recently had the pleasure of interviewing Dustin Pearce in a virtual executive fireside chat and AMA. Dustin is an experienced leader in scaling hyper-growth, cloud-native companies, as the VP of Infrastructure at Instacart and having previously served as Head of Service Engineering at Slack.

A developer's guide to optimizing PHP performance

With its open-source nature, PHP has evolved into one of the most popular languages among web developers. According to w3techs, 78 percent of websites across the globe use PHP as their server-side language. Even amongst the top 1,000 ranked sites, PHP is dominant, being used by more than 50 percent of them.

Prometheus vs Nagios

Production environment stability and high availability are the holy grail of every SaaS company. R&D organizations put a lot of effort into achieving these goals by implementing different monitoring and alert methodologies and by utilizing a variety of systems and tools. Mean-time-to-detect (MTTD) and mean-time-to-repair (MTTR) are two crucial KPIs that help R&D management personnel determine the efficiency and proficiency of their teams’ responses to production incidents.

Managing Docker Logs with ELK and Fluentd

This article provides an overview of managing and analyzing Docker logs and explores some of the complexities that may arise when looking through the log data. We will go through the default logging approach, as well as look at some more advanced configurations that will make diagnosing issues in your Docker-hosted applications much easier going forward.

Kibana Settings: Spaces, Export Dashboard, and more

Kibana is considered the “window” to Elasticsearch and indeed it’s a powerful UI for searching, filtering, analyzing, and visualizing Elasticsearch data, but Kibana settings are also used to configure, administer and monitor the Elasticsearch cluster. In this lesson, we’re going to explore how Kibana settings can be tweaked for collaborative teamwork. Without further ado let’s jump right into spaces!

Using Dynamic Thresholds for More Proactive Issue Detection

Have you ever been paged for a critical issue and started troubleshooting only to find an obvious drop in requests that weren’t caught by a static threshold? Or a significant increase in a metric that didn’t cross a static threshold? Or even, evidence of warning alerts triggered long ago that should have enabled someone to resolve the issue and prevent it from causing business impact, but instead was ignored in the massive alert volume received by the team?