Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

How to monitor istiod

Istio is a service mesh that enables teams to manage traffic in distributed workloads without modifying the workloads themselves, making it easier to implement load balancing, canarying, circuit breakers, and other design choices. Versions of Istio prior to 1.5 adopted a microservices architecture and deployed each Istio component as an independently scalable Kubernetes pod. Version 1.5 signalled a change in course, moving all of its components into a single binary, istiod.

New Microsoft partnership embeds Datadog natively in the Azure portal

We are excited to announce a new partnership with Microsoft Azure, which has enabled us to build streamlined experiences for purchasing, configuring, and managing Datadog directly inside the Azure portal. This first-of-its-kind integration of a third-party service into a public cloud provider reduces the learning curve for using Datadog to monitor the health and performance of your applications in Azure—and sets you up for a successful cloud migration or modernization.

VMware Management Pack Update Release (20.9.2060.0)

Our fourth update release for 2020 of OpsLogix VMware Management Pack for Operations Manager is now released. Improvements includes existing features such as Host Ram Disk monitoring and Discovery of Tagging information for Hosts and Virtual machines. Important: In our previous release we also simplified the configuration and licensing experience and moved everything under the administration pane.

Securing and Monitoring AWS Container Services

Developers, operations, and security teams must work together to address key workflows to secure and monitor containers, Kubernetes and cloud services across the entire cloud-native lifecycle. By addressing mage scanning, runtime security, and compliance, along with monitoring for Kubernetes, container, applications, and cloud services you can automate protection and performance management to accelerate cloud adoption.

Top 3 Things to Consider When Selecting a Log Analysis Platform

Effective log analysis can help you significantly reduce the time spent investigating and troubleshooting incidents. With the many different log analysis platforms available, it can be overwhelming to choose and difficult to know what to look for. In this short guide, we’ll share the top three things you should consider when selecting a log analysis platform for your business.

Application Performance Monitoring - What is APM?

Software applications are increasingly critical for businesses today. They perform key customer-facing roles, power back-office activities, and help us gain greater insight into business activities. Using software gives us greater efficiency and leverage, but can come at a cost in terms of transparency. It can be hard to see how well customers are being served, where they are struggling, or understand why parts of the business aren't working as expected.

What Are SSL Certificate Errors: Causes & Best Practices on How to Prevent and Fix Them

What do you think of a website that displays SSL/TLS certificate errors when you visit it? Most people abandon it in disappointment. A certain amount of trust and respect for the service is lost. After investing a lot of effort and time in getting users to visit your site, and the user finds the site down or showing a warning, it will result in having dissatisfied users. Moreover, if the downtime or warning is due to a security issue, it will also hurt your brand image.

Vital Web Performance

I hate slow websites. They are annoying to use and frustrating to work on. But what does it mean to be “slow”? It used to be waiting for document load. Then waiting for page ready. But with so many asynchronous patterns in use today, how do we even define what “slow” is? The W3C has been working on this with the new Event Timing and Element Timing API, and has defined some new Web Vital metrics to describe the different ways that slow performance can impact a webpage.

Be the First to Know When Microsoft 365 Service Issues Arise

On Monday September 28 a multi-hour global Microsoft 365 outage brought down Teams, Office 365 and Outlook leaving many people disconnected. While Microsoft outages are rare, there are a range of possible issues on your network and in your user’s environment that can cause service issues at any time. Knowing quickly when these are happening, and what’s causing them is key to keeping users productive on Microsoft 365.

How to Boost User Experience With RUM Software

According to a report by Google, mobile users abandon webpages if the load time is more than three seconds. In other words, even a fractionally slower website will lead to a bad user experience and increased bounce rates. Today’s users are more demanding and expect near-instantaneous load times and seamless experiences. If you can’t meet their expectations, they’ll find other websites or applications capable of providing a better experience.