Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Wi-Fi and Application Health with Endpoint Monitoring

For today’s tech tip, we’re going to focus on our Endpoint Monitoring Client and two specific use cases: Wi-Fi and application health. With so much of the workforce working remotely, Endpoint Monitoring is an essential tool to help smooth the transition. According to recent research from Gartner, nearly half of US employees will continue to work remotely for at least some of the time post-pandemic.

Incident Review - Microsoft Office 365 Outage

The whole internet spins across different domains but when we talk about the backbone suite of every organization, MS Office 365, is for sure, one of the biggest contenders. Just like the recent Century Link/Lumen outage, we witnessed another major outage, this time Microsoft O365. This month might as well be considered a bad month for the internet, as we have seen a lot of daily used consumer services getting impacted like Reddit, Pinterest, Google Services, etc.

Logs and Traces: Two Houses Unalike in Dignity

Intelligent Medical Objects (IMO) and its clinical interface terminology form the foundation healthcare enterprises need, including effective management of Electronic Health Record (EMR) problem lists and accurate documentation. Over 4,500 hospitals and 500,000 physicians use IMO products on a daily basis. With Honeycomb, the engineering team at IMO was able to find hidden architectural issues that were previously obscured in their logs.

Backing SCCM With Smart IT Experience Automations

Like many in IT, I am a big fan of Microsoft System Center Configuration Manager (SCCM). It’s one of those tools that you can’t really go without: it can help locate your company’s servers, desktops and mobile devices; it helps install client software, patch updates (see Microsoft Patch Tuesday); and it protects your endpoints and access control tools. All good things, but… Sometimes our beloved SCCM needs a little backup—like Robin to Batman.

Managing Sensu Go 6 using Ansible

Earlier this year, we shared the certified Ansible Collection for Sensu Go, which makes it easy to automate your monitoring and achieve real-time visibility into auto-scaling infrastructure. Now that Sensu Go 6 has been released, we’ll share the latest updates on the Collection, including the management aspects of Sensu Go 6, with a focus on the structure of Ansible playbooks in the Sensu Go 6 world.

Microsoft Teams and OpManager: The perfect team for your remote IT management game

It seems almost everything is going digital during this pandemic: businesses, education, and medical consultations. This increased digital consumption is squeezing the juice out of the IT infrastructure of many organizations. On top of that, remote work policies are posing serious security issues. At times like these, IT infrastructure monitoring is like a football game for IT admins, except: So how do you navigate all these challenges and score a touchdown?

You spoke, Microsoft listened. Ignite 2020 SCOM takeaways

Have you ever looked at SCOM on User Voice - Microsoft’s way of collecting feedback on what end users have to say about SCOM and its future? Well good news, the top 2 items are going to be addressed! At Ignite 2020 - System Center session, Dianna Marks (SCOM Product Marketing Manager) told us Microsoft have heard your feedback and will be taking action.

New in Grafana 7.2: $__rate_interval for Prometheus rate queries that just work

What range should I use with rate()? That’s not only the title of a true classic among the many useful Robust Perception blog posts; it’s also one of the most frequently asked questions when it comes to PromQL, the Prometheus query language. I made it the main topic of my talk at GrafanaCONline 2020, which I invite you to watch if you haven’t already. Let’s break the good news first: Grafana 7.2, released only last Wednesday, introduced a new variable called $__rate_interval.

Building and Using a 2020 Status Page with Uptime.com

A hosted status page gives you the peace of mind that users can always answer one simple question: is it up or down. Hosted status pages work because they offer third-party confirmation your services are up. If your site goes down, the third party is likely not down and you can use them to refer to your status. Status pages are your personal 24 hour news cycle. Regardless of if you’re up or down, customer service fields fewer support tickets, and users praise your transparency.

Easily view your old queries with Cloud Logging recent queries

As you analyze your logs for application performance, infrastructure errors, system events, and more, sometimes you may need to look back to logs you were previously analyzing to help correlate events and identify the root cause of a problem. To help, we are excited to introduce Google Cloud Logging recent queries, to make it easy to track and run your past searches as you deep dive on your log data.