Operations | Monitoring | ITSM | DevOps | Cloud

December 2018

Using AWS or Azure? Why You Need a Website Monitoring Service

Already have a monitoring service with your current host? Does your plan include Amazon CloudWatch, Azure Monitor or another proprietary monitoring tool? You might want to check the fine print. These services often provide internal web monitoring only. While they may check HTTP availability from locations outside their network, HTTP checks alone are not sufficient.

2018 - Year in review! On a path to continued innovation

It was indeed an eventful year for us. With launching a slew of new capabilities, attending more than 23 global events, and conducting 12 seminars in multiple cities, the year 2018 was a blast. This year has also made us one of the firsts in the monitoring industry to introduce AI-driven Azure monitoring, along with many other significant enhancements.

IT in 2018: A ManageEngine retrospective

2018 is coming to an end and you might already be looking back at how this year has been for you. In the IT world, 2018 started with the processor bugs Meltdown and Spectre. From there, the year was filled with data breaches and ransomware. But things weren’t all bad this year; the GDPR went into effect, which is a huge step forward in data privacy, and plenty of new technologies were released.

eG Enterprise APM Wins 'Great User Experience Award' on an Independent Review Platform for B2B Solutions

FinancesOnline, an independent B2B solutions review platform, recently conducted an independent assessment of eG Enterprise APM and published their review. This review provides an overview of the features, benefits, and problems solved by eG Enterprise APM. Also, the review analyzes eG Enterprise APM in relation to other APM solutions in the market.

Why will the IoT change the world?

You may have heard on many occasions that this technology will change the world forever. And if you’re old enough, you’ll have seen that on some occasions, indeed, it has been like that, but on others, things have gone on as usual and that technology that was going to wipe everything out has ended up in the drawer of forgetfulness.

Upgrading to Sensu Go: takeaways (and solutions) from Sensu Summit

Now that Sensu Go is out, I thought this would be a great time to circle back and follow up on the Sensu Summit 2018 breakout session concerning Sensu 1.x to Sensu Go workload migration challenges. That session had some great feedback from Sensu users; we’ve been heads down over the past few months putting the pieces together to make it easier to move your existing workloads when you upgrade to Sensu Go and keep your existing Sensu Plugins working while you transition.

Why Web Apps Are Exceptional Choice To Surpass Competitive Era?

It’s been a while when advanced applications were introduced to reduce businesses hassles. But still, there’s an issue which bothers individuals’ minds that how do they develop a website, should they move to the mobile application or should they choose between promoting a website, web app or mobile application?

Happy Holidays from Pandora FMS!

Christmas is here! Have you decorated your Christmas tree yet? Have you made gingerbread cookies? The people behind Pandora FMS love Christmas! Merry Christmas Pandora FMS 2018! The truth is that we are more Christmassy than Santa Claus and, during these days, we become more tender than ever, and we love to look back and remember all the good things that have happened in this year 2018.

Simple Way to Add E-Mail Notifications

Foglight has a robust rules engine for alerting and notification. It's often the case that you can get to the same end zone in Foglight by many different plays. Using the Service Builder is an easy way to group "things" together. "Things" can be higher level objects like database instances or hosts, or very detailed objects like a set of disks or jobs that match a pattern.

May All of Your Lights be Green this Holiday Season

Creating your own dashboard in Foglight is as easy as putting up the Festivus pole. In this post, a service containing all MySQL instances was created. A great feature of Foglight is the ability to create your own custom dashboards and reports. Expand the right-hand panel, and select Create dashboard. I normally start with "Use All Data."

Detecting Slow Database Queries as the Root Cause of Application Performance Slowdowns

Performance problems with the database can cause up to 70% of all application performance issues in production. eG Enterprise is a converged application and infrastructure performance monitoring and troubleshooting solution that helps you understand how database issues are affecting application performance.

Detecting Code-Level Issues in Microsoft .NET Applications

Developers and application owners need application code-level insight, so they can pinpoint issues in the code and fix them before users notice. eG Enterprise is an application performance monitoring and troubleshooting tool that helps you diagnose code-level issues in Microsoft .NET applications in no time.

Detecting Code-Level Issues in Java Applications

Developers and application owners need application code-level insight, so they can pinpoint issues in the code and fix them before users notice. eG Enterprise is an application performance monitoring and troubleshooting tool that helps you diagnose code-level issues in Microsoft .NET applications in no time.

Are Your Customers Satisfied with Their Web Application Experience?

In today's digital business era, it's true when people say ‘The customer experience is even surpassing price and product as the biggest brand differentiator.' eG Enterprise is a digital experience monitoring solution that measures the customer experience on web sites and web applications in real time.

Why is the Website Slow? What is the Cause of Slow Page Load Time?

User experience is the basis for competition in the digital services world. A slow web site means you're potentially losing money and customers. Watch this short video and learn how eG Enterprise uses Real User Monitoring (RUM) to diagnose the cause of slow web page load time.

Building confidence via automated container security scanning - Xavier Vello - DockerCon EU 2018

Container image security scanners are one of several tools we use in our development process to ensure the software that we ship to our customers is reliable and safe. In this talk, we’ll discuss our approach to continuous vulnerability monitoring (spoiler: it’s all automated), and how it increases our responsiveness while decreasing our operational cost.

2018 Main Top 50, and we are number six!

We are headed to the top! We’ve risen another three rankings to number six for the 2018 Main Software 50 awards. The Main Software Top 50 list is an award handed out to Dutch software companies by the investment firm of Main Capital Partners. Each year Main Capital picks the 50 best companies, and Uptrends has continued to climb in the rankings.

5 Steps to Prepare Your Web Infrastructure for 2019

At least 10 major data breaches occurred in 2018 including Facebook, Google Plus, WordPress, and healthcare sectors. The surge in breaches proves we need to do a better job detecting foul play. User data and our own IP are crucial assets to safeguard in this environment. We also observed major downtime incidents from companies like Facebook and Microsoft.

Running LogicMonitor API Scripts in AWS Lambda

Sometimes it's necessary to run a maintenance API script in your LogicMonitor portal. For example, I move decommissioned devices into a specific folder because I no longer want to receive any alerts on these devices. An API script helps automate the process by running once a day to disable alerts on any new devices added to this folder.

Monitoring errors in Xamarin apps

Xamarin is based on Mono, the open source implementation of Microsoft's .NET Standard. It allows us to create apps that easily run in multiple devices like phones and smart watches. It solves the difficulties many developers face when they’re developing cross-platform apps like different coding languages and UI paradigms. With Xamarin, you can use C# as a single language for iOS, Android, and Universal Windows apps.

Proactive ITSM: Staying Ahead of The Curve

Although technology continues to evolve, the processes that support Information Technology Service Management (ITSM) have remained relatively unchanged for several decades. One of the main challenges to delivering high-quality IT services in this long-established approach is reactivity – that is, focusing on incident management as a means to resolve something that should never have happened in the first place.

Domain Blacklists: How to Check and Remove Your Site

Are website visitors experiencing problems with accessing your website or email? Your domain may be blacklisted. If your website isn’t available, blacklists can wreak havoc on your web traffic potentially resulting in lost revenue. It’s crucial to remedy the problem immediately. Uptime.com includes a Domain Blacklist Check to continually monitor your site for issues.

OpsRamp Webinar - Preparing for a Cloud World - A review of the #CloudNative Skills report

OpsRamp recently released a report on how IT organizations are handling the massive gap in skills for moving to cloud-native technologies. Watch this webinar, Preparing Your Organization For a Cloud World to review the key findings and insights from the Cloud Native Skills Crisis report and see a quick demo of the OpsRamp platform.

Gain End-to-End Intelligence Across Your Customer Experience and Application Performance with Latest Updates to AppDynamics

This month, we are excited to deliver additional AppDynamics functionalities across observability, intelligence, and usability that will help enterprise companies gain end-to-end intelligence across customer experience and application performance.

WAN monitoring: changes associated with the Internet-based model

When we think about WAN monitoring we usually start from the basics: the behaviour of remote communication links will directly affect the performance of our applications. Therefore, we understand that if traffic over the communications link experiences high levels of latency this will negatively impact the response time that our users observe when accessing the applications.

Monitoring Kubernetes, part 1: the challenges + data sources

Our industry has long been relying on microservice-based architecture to deliver software faster and safer. The advent and ubiquity of microservices naturally paved the way for container technology, empowering us to rethink how we build and deploy our applications. Docker exploded onto the scene in 2013, and, for companies focusing on modernizing their infrastructure and cloud migration, a tool like Docker is critical to shipping applications quickly, at scale.

Visualize your Thundra Monitoring data with Honeycomb

Visualize your Thundra monitoring data with Honeycomb. Identifying critical issues in your stateless serverless environments can be difficult. Often, you are left guessing at where the problems may lie. Learn how to pinpoint critical issues in your AWS Lambda environment with the deep query and end-to-end tracing.

How to Troubleshoot .NET Application Performance Problems

Ready to know why your Microsoft .NET applications are slow? What is causing performance problems? Is there an issue in the .NET code? Developers and application owners often get involved in long war room sessions to isolate the root cause of application performance problems. With the right know-how, you can triage problems faster.

How to Monitor CPU and Memory on Ubiquiti Unifi Devices

Retune AB manages a variety of Ubiquiti devices -- wireless data communication products for enterprise and wireless broadband providers. Naturally, we wanted to bring these in under monitoring. However, Ubiquiti does not expose real-time CPU or memory metrics through SNMP in a way that we found reliable and these are some of the key values needed to verify the health of the device.

Why Shift-Right is Essential for SaaS Applications

To many IT software teams, the mantra currently in vogue for team practice is “shift-left.” That refers to moving certain activities, such as code integration, build, and testing, earlier in the software development and delivery process. By shifting them to the left, the team knows more about code quality and performance earlier, allowing for corrective action and making them nimbler in response.

The Multi-faceted Use Cases and Benefits of Application Performance Monitoring Tools in Enterprise IT

Application performance monitoring (APM) solutions are among the most essential tools for IT today. As organizations undertake transformational initiatives such as cloud migration, container orchestration and microservices, they need to be able to manage performance of their business-critical applications and end-user experience across complex and sophisticated technology landscapes.

How Much Should My Observability Stack Cost?

What should one pay for observability? How much observability is enough? How much is too much, or is there such a thing? Is it better to pay for one product that claims (dubiously) to do everything, or twenty products that are each optimized to do a different part of the problem super well? It’s almost enough to make a busy engineer say “Screw it, I’m spinning up Nagios”. (Hey, I said almost.)

Whitelist Email Addresses: Thunderbird

TLDR; "If you expect to receive important emails from a trusted email address it is worth whitelisting the address to make sure that emails won't be accidentally blocked by an overzealous email client." In this post we show you how to do it in Mozilla Thunderbird by selecting the correct address book settings and adding the email address as a contact in your address book...

Forrester Insights: Powering Digital Transformation With Intelligent Monitoring & Analytics

During our recent webinar, Using the Right Data In Context to Prevent IT Outages, Forrester Principal Analyst Charles Betz shared his insights on the importance of leveraging the different types of machine data collected through intelligent monitoring and analytics solutions.

AWS monitoring 101: Metrics to watch out for

Amazon Web Services (AWS) is one of the most popular public cloud providers today. Over the years, AWS’ services have expanded from cloud computing to application development and security. To retain the reliability, availability, and performance of your AWS instances, an AWS cloud monitoring solution is a must. It’s critical for AWS monitoring tools to collect data from all parts of your AWS service, so that multi-point failure can be easily debugged.

Windows as a Service: Stay Ahead. Keep Control.

With technology’s unrelenting advance, the evolution of the digital workplace has unarguably entered the fast lane. Microsoft – which provides critical digital workplace solutions through their Windows OS – is no exception. Indeed, end users indispensably depend on Microsoft’s range of workplace services to achieve everyday tasks – from simple log-ins to advanced programming – putting them at the front lines of any new updates and modifications.

On the merits of pubsub & workflows (or, why Sensu over Nagios)

Not too long ago in the Sensu Community Slack, the question: “Why Sensu instead of Nagios?” arose. Specifically, “How do I convince my boss to choose Sensu over Nagios?” I responded to the thread, but decided it was worthwhile to share my response with the wider community. At Willis Towers Watson, we moved from Nagios to Sensu 1.2 almost a year ago (and now we’re upgrading to Sensu Go).

Health Check With "Applicare"

An ounce of prevention is better than a pound of cure.’ Because Ben Franklin knew a lot about not letting small problems become big headaches, he would have loved Applicare! Discovering trouble after it happens is too late. The only way to survive in business is to stay ahead of what tomorrow may bring. Arcturus Technologies’ health check service can keep your application environment tuned-up and running smoothly.

Applications Manager: A game changer for multiple businesses

To retain a competitive advantage in today’s fast-paced digital market, organizations need to focus on their application stack. Error-prone or bug-ridden applications may deter potential customers from visiting a site again. Organizations must monitor application performance at every level to ensure optimal performance of business applications.

AIOps: Building the Next Generation of Intelligent Infrastructure

As engineers, we are constantly bombarded with complex machine data. In order to better monitor and troubleshoot our environments, we must analyze and gain an accurate understanding of this data to best evaluate how our systems are performing and combat any issues that may occur. Yet, sifting through this data while managing the added intricacies of serverless architecture, microservices, containers, and other technologies make our jobs increasingly difficult to navigate.

Citrix Cloud 101: Key Questions Every Citrix Admin Wants Answered

A few weeks back, eG Innovations collaborated with David Wilkinson and conducted a webinar on the topic “Is Citrix Cloud Enterprise Ready? Best Practices to get the Most Out of Citrix Cloud Deployments.” Citrix Cloud implementations are growing in the industry today, and as organizations begin evaluating their cloud options, Citrix administration teams want to understand how Citrix Cloud will sustain, scale and be supported in lieu of on-premises Citrix deployments.

BMW Gets Clarity to Quickly Solve for the Digital-First Driver, Leading the Way in Connectivity

BMW is a brand known for premiere driving experiences. But it’s also a software company operating within the internet of things (IoT). Having grown its digital services team from 70 to 180 people in three years, BMW is serious about elevating the personalized driving experience.

SolarWinds Lab Episode #71- FestivOps for the Rest of Ops

Confused about much hyped DevOps? Curious if developer's monitoring tools are different than those made for operations? Wondering if there are hidden cloud tools in the Orion® Platform modules you already have? As always, SolarWinds Lab™ is here to help. In a SolarWinds Lab first, Head Geeks™ Thomas LaRock and Patrick Hubbard travel to Las Vegas for AWS re:Invent to interview technology pros and SolarWinds customers about how much or even *if* they’re happily using DevOps and cloud technologies in production.

Elixir Overview and Tutorial (as told in a Wizard fable)

Interested in Learning the Elixir language? Join us in this entertaining Elixir tutorial and overview. This post will spin a yarn about an ambitious wizard, Alatar, and his quest to revamp a magic web storefront using Elxir. We will observe Alatar decide on Elixir as his development platform, and follow him on the journey of learning and implementation. Along the way, he will utilize several frameworks written for Elixir (including Phoenix, Ecto, and Poison).

Now Available: IBM Cloud Monitoring with Sysdig.

Today at Kubecon we announced the availability of IBM Cloud Monitoring with Sysdig. Together, IBM and Sysdig have launched this new offering to provide a fully managed enterprise-grade monitoring service for cloud-native applications on IBM Cloud. If you build, ship, and run applications on IBM Cloud, you now have direct, integrated access to Sysdig Monitor.

Moving Ahead: $85 Million in Funding and the Next Chapter in Our Journey

Patrick, Vincent and myself founded Nexthink because we believed in a future in which IT departments and employees work together to have a great digital experience. At that time, we observed that IT departments were traditionally focused on server, network and applications and often neglected the employee experience as a key driver for success and productivity. At best, organizations were reactive to support users, instead of proactively fixing issues before employees were impacted.

8 Features your e-commerce website must have to make it big

If you are running an online business, your sales will greatly depend on the quality and availability of your e-commerce website. While the quality of the product is equally important, it is your website that tempts potential consumers to buy your product. In an age when the majority of customers search and buy products or services online before visiting a store, the quality of your e-commerce website becomes more important in shaping public perception about your product.

The Next Great Thing in Rubber Duck Debugging

There's a line in the second Harry Potter film where a wizard named Mr. Weasley asks “Tell me, what exactly is the function of a rubber duck?” It’s a good question. Some acceptable answers: rubber ducks are for singing to in the bath, floating down a river as part of a creative fundraiser raise money, or to entertain your dog. Developers, however, have a completely different answer to Mr. Weasley's question.

Foglight Container Management is available!

We’re excited to announce the general availability of a brand new product: Foglight Container Management – Part of the Foglight for Performance Management suite. oglight Container Management provides real-time and historical analytics of containers and their hosts, across physical, virtual and cloud environments. It identifies performance bottlenecks, failed containers and issues within the orchestration layer.

Monitor Postgres Disk Space Usage with Foglight

Many things can happen if the database runs out of disk space. None of them are good. DBAs understand that it is essential to monitor database disk space so that critical business processes are uninterrupted. Quest’s Foglight provides peace of mind by monitoring that space and alerting on the threshold well in advance of potential space issues.

Healthcare IoT: Monitoring Diabetes with Logz.io

Before I hop right in, it’s important to understand a bit about diabetes. Diabetes is what happens when your body cannot produce (type 1) or respond (type 2) to insulin effectively. The impact on the body is frequently quite severe — people who have difficulty controlling their blood sugar levels run the risk of losing feeling in their fingers and/or toes or even going into a coma if their blood sugar is either too high or too low.

Distrusted Symantec Certificates are Added to SSL Monitoring

Google had announced that Chrome would begin distrusting certificates issued by Symantec Corporation’s PKI and the decision is followed by other major browsers. These are the certificates by Thawte, VeriSign, Equifax, GeoTrust, and RapidSSL that are issued before 1st of December 2017.

2018 Website Outages: Key Lessons from Popular Website Downtime

Now that companies depend on the cloud for access to key services and business operations, downtime has a larger impact on productivity. Uptime is just as critical to small businesses as it is to major ecommerce retailers on Black Friday. Even the public relies on various services like Alexa and email to be available throughout the day. Looking at major outages over the past year provides insight into how companies prepare to handle these events.

Why MSPs Need Application Availability Monitoring

I’ve lived in the virtualization and cloud world for a really long time. During that time, I’ve also seen the impacts on users when you don’t have a good performance monitoring and troubleshooting system in place. That said, what if you have managed services? Do you understand what challenges managed services providers (MSPs) face when working without a monitoring and troubleshooting tool? How are you effectively resolving common VDI performance as well as connectivity issues?

CloudReady Dashboards Tips & Tricks Part #3

This is the third part of a series on the CloudReady dashboards and visualizations. The first part covered basics like Overviews, Refresh, and Layout settings. The second part covered more advanced dashboard usage like layout pinning, capture and embedding. In this third part we’ll cover key widgets, their usage and settings.

Part II: Anomaly detection within monitoring: how can you get started?

In a previous post we introduced anomaly detection as a group of techniques used to identify unusual behavior that does not comply with expected data pattern. In this article we will find out how we can apply anomaly detection within monitoring.

9 Gifts for Your Stressed-Out MSP Colleagues

The sprint to the end of the year can be crazy for MSPs—new maintenance templates have to be made, calendars and expenses need to be updated in the PSA, and everyone has to mentally prepare for the in-laws to visit. Don’t let the Most Hectic Time of the Year affect your bottom line or the health of your team. These nine gadgets will help boost personal productivity, reduce stress, and eliminate distractions so you can help everyone stay focused and productive.

Tracking VueJS SPA user behaviour with Google Analytics

In the past I used to use the right tool for the right purpose. This led me to employ a lot of tools, and with most turning out to use subscription-based billing, increasing our costs much more that I would have hoped for. So, I adopted a new strategy: Use as few tools as possible, but use them as much as possible.

Handling Sensu Plugin handlers in Sensu Go

In case you missed it, Sensu Go is here! And, as I wrote about previously, one of the hurdles with migrating workloads from the original version of Sensu to Sensu Go are the changes in the internal event data structure. The existing handlers and mutators in the community maintained Sensu Plugins collection might not work as expected in Sensu Go because of these event data model changes. But friends, I’m here to tell you that we’ve got this problem licked.

Infrastructure UI for Kubernetes and Docker using Elasticsearch and Kibana

The Elastic Stack comes with powerful data visualization capabilities. Filebeat and Metricbeat modules, as well as, Elastic APM ship with pre-built Kibana dashboards that serve as a great starting point for exploring logs, metrics, and APM data in Kibana. On top of that, the Infrastructure, Logs, and APM UIs enable common workflows for correlating the data coming from different operational contexts.

NiCE VMware MP 5.00 Preview Release

Be the first to get a complete picture of the health and performance of your business critical VMware environment using the new NiCE VMware Management Pack. The NiCE VMware Management Pack delivers first-rate monitoring for business critical, highly dynamical virtualized environments. Leverage your existing investment and reduce costs, save time and build efficiencies now.

Key metrics for monitoring Tomcat

Apache Tomcat is a server for Java-based web applications, developed by the Apache Software Foundation. The Tomcat project’s source was originally created by Sun Microsystems and donated to the foundation in 1999. Tomcat is one of the more popular server implementations for Java web applications and runs in a Java Virtual Machine (JVM).

Analyzing Tomcat logs and metrics with Datadog

In Part 2 of this series, we showed you how to collect key Tomcat performance metrics and logs with open source tools. These tools are useful for quickly viewing health and performance data from Tomcat, but don’t provide much context for how those metrics and logs relate to other applications or systems within your infrastructure.

Part I: Anomaly Detection in monitoring: what can we really do?

In recent years we have frequently found the term anomaly detection in monitoring. In fact, some monitoring tools have introduced in their features the customized application of anomaly detection algorithms and some companies offer anomaly detection from data collected by monitoring tools.

Incident Communications - Get Ready for Black Friday/Cyber Monday 2019!

As the year draws to a close, for many of us this is a time to slow down, kick back and look forward to holiday time. For others, the work certainly isn’t done yet. The “S” word comes down to bear. Like it or not, this time of year – it’s all about the Shopping.

MySQL DBAs Are Obsessed with This Freebie for Five Surprising Reasons

Hearing you’re getting MySQL is like hearing you’ve been “volunteered” to pet sit your neighbor’s rabid ferret: There’s confusion, mild panic and the frustrating realization that someone else is saving money because you’re doing all the hard work. Open source: fantastic for IT budgets, not so fantastical for DBAs.

Sponsored Post

A guide to Apdex score: Calculations, improvements, and more

Apdex scores are a fantastic, simple tool you can use today to better understand how your development team is doing, how it can be improved, and the impact of almost every change to your service. They're also likely a part of your Service Level Agreements (SLAs) with your customers, which help them understand your platform's availability.

The New and Improved Uptime.com Transaction Check Tool

The Uptime.com Transaction Check tool is evolving. It’s designed to mimic user interactions, and can interact with nearly every element on your website. The Transaction Check is an important monitor for those worried about conversions or signup forms. It can measure landing pages, shopping carts, and other interactive elements, mimicking the customer experience and providing important metrics about response time and errors along the way.

Sponsored Post

C# Logging best practices in 2019 with examples and tools

Applications that have been deployed to production must be monitored. One of the best ways to monitor application behavior is by emitting, saving, and indexing log data. Logs can be sent to a variety of applications for indexing, where they can then be searched when problems arise.

New generation of web servers based on HTTP/2 and with TLS by default: " Caddy Web Server".

As we all know, Pandora FMS allows the monitoring of practically any device or application. Let’s talk about web content servers. Even the very popular applications or “apps”, made for the Android operating system of our phones, generally use API commands, which are also hosted on web servers to take advantage of the secure protocol (HTTPS).

Cloudways - A Managed Cloud Hosting Platform that Facilitates Choice, Simplicity, and Performance

A reliable web host is unlike any other friend when you’re super monitoring your website. You should be able to spread your wings and expand those horizons without all the fuss. In our search of many web-hosting providers, we found one name that is powerful enough to scale your website effectively – Cloudways.

The Tool Sprawl Problem in Monitoring

One of the biggest KPIs in the DevOps space is monitoring. There are so many tools to help any organization to complete their monitoring picture, but no tool does everything and most organizations use many tools to help complete their monitoring solution. Mashing tools together often creates a problem of its own — the tool sprawl problem.

Faultd Update - Next Generation Alerting

Circonus will soon be releasing our next generation fault detection system, faultd (fault-dee). Faultd is an internal component of our infrastructure has run alongside our existing fault detection system for several months with outputs verified for accuracy. Additionally it is in use by some of our enterprise customers who have reported no issues with faultd.

How to Monitor Kubernetes Without an Agent on Every Node

LogicMonitor is an agentless monitoring solution. What we really mean by “agentless” is that we don’t require an agent on every monitored server (physical or virtual). One LogicMonitor Collector - a lightweight application that takes just seconds to install - can monitor hundreds or even thousands of devices, including servers, virtual machines, network switches, storage systems, cloud resources, containers, and more.

Site Reliability Engineering Meets Traditional Operations

Google has effectively made the discipline of site reliability engineering (SRE) a DevOps best practice by publishing two decades’ worth of lessons in keeping alive the most scalable apps on the planet. As more organizations make the shift (or “transformation,” as it were) to becoming IT organizations, the demand for reliability increases substantially for customer-facing services.

ActiveMQ architecture and key metrics

Apache ActiveMQ is message-oriented middleware (MOM), a category of software that sends messages between applications. Using standards-based, asynchronous communication, ActiveMQ allows loose coupling of the elements in an IT environment, which is often foundational to enterprise messaging and distributed applications.

Collecting ActiveMQ metrics

In Part 1 of this series, we looked at how ActiveMQ works, and the key metrics you can monitor to ensure proper performance of your messaging infrastructure. In this post, we’ll show you some of the tools that you can use to collect ActiveMQ metrics. This includes tools that ship with ActiveMQ, and some other tools that make use of Java Management Extensions (JMX) to monitor ActiveMQ brokers and destinations.

Stackdriver tips and tricks: Understanding metrics and building charts

Seeing what’s going on with your IT infrastructure, applications and services has always been critical to the success of modern businesses’ day-to-day operations. Google Stackdriver monitoring provides out-of-the-box visualizations and insights for Google Cloud Platform (GCP) users so you can easily understand your systems.

Safe Web Services with Actix and Sentry

Remember that time Mom told you that the internet is a dangerous place? No? Well, she did, but you weren’t listening. Jokes aside, we can probably all agree that there are many potential security risks in web services, with all their APIs and user-contributed content. Yet, the internet is what defines our digital age, and barely any piece of technology can do without. In the midst of this insecurity, Rust came along with its memory safety and zero-cost abstractions.

SolarWinds NPM: Your Complete Network Monitoring Solution

SolarWinds® Network Performance Monitor (NPM), created by network engineers for network engineers, is a complete monitoring solution designed to provide you with the tools you need to work smarter, improve visibility, and prevent downtime. See why SolarWinds is a worldwide leader in network monitoring.

Hands Off My Docker Containers: Dynamic Java Instrumentation in Three Easy Steps

Instrumenting your application with an APM tool is not always easy. Configuration is often complicated, and managing agent files can be daunting. AppDynamics has developed a three-step solution for automating Java agent deployment and infrastructure monitoring in a Docker environment.

Enterprise WLAN 101: The Basics of Big Wi-Fi

There are serious differences between smaller business and enterprise wireless environments. At the same time, defining “enterprise” can be tricky. For where we’re going in this piece, enterprise equals big as measured by client device counts and diversity, complicated when it comes to security, and critical when it comes to uptime and stability. That gets the conversation started in the right place.

Understanding Your Customer Should Be Your #1 Priority

Does anyone like receiving calls from telemarketers? Unless you use the opportunity to set up a prank and get a good laugh, odds are these calls annoy you just as much as me. These over-the-phone salespeople are frustratingly persistent as they interrupt my day, and they also missed one key step when they dialed my number: Researching whether there was any chance that I would want their product.

How a company might lose more than $7 billions in 30 minutes

I've been working at WebGazer for seven months. For every day I spend in this business, I feel like I'm trying to run on a tightrope. Since our job is website monitoring, I see the similar downtime tragedies every day. I was reading some old downtime stories like "Amazon has lost $3.75 million in only 20 minutes!". Then, I decided to make a research about some possible downtime tragedies might happen.

Monitoring Microservices: IT's Newest Hot Mess

In this THWACKcamp session, you’ll learn how microservices are different from other applications, when performance bottlenecks most often occur, how they tend to break, and where you can add monitoring to stay ahead of trouble. You’ll also see how to extend existing infrastructure dashboards to include microservice workloads, cut troubleshooting time, and include new business metrics that measure the business goals driving microservices in the first place.

Six Ways to Improve Your Security Posture Using Critical Security Controls

Security policies within organizations are under a lot of scrutiny in today's times. Trying to stay up to date with these policies can create stress to users and the IT staff managing the infrastructure. Just like network standardization is a must, so is security standardization.

"Observability": Just a Fancy Word for "Monitoring"? A Journey From What to Why

Too often, monitoring is a never-ending arms race. We keep adding more monitoring in response to new problems, but the cycle never seems to end. Humans, (the business), drive new changes, which cause new problems, and need more, new monitoring. And that’s where real, useful observability may be able to help finally identify root cause and break the cycle of reactive monitoring for novel issues.

Monitoring Like a Network Engineer When You're a SysAdmin

Last year, we showed network engineers how to monitor like sysadmins. This year, we're flipping the script and showing systems administrators that there's nothing to fear from those network devices, and that monitoring them won't steal precious time from ensuring business services are up and users are happy.

Ruby Agent 2.4.21 is out with a bug fix, a new configuration option, and a debug option

As reported on Issue #228, if scout_apm is disabled on a node via the configuration monitor = false, we don't intend to install any instruments, but a few snuck in anyway. Since the rest of the agent isn't running, they (slowly but steadily) built up recorded info, but didn't purge it, causing a slow memory leak that became clear over the course of a week or two. We've stopped the offending instruments from installing themselves when Scout is disabled.

Overcoming The Black Box Problem With Machine Learning in IT Operations

Chronically understaffed and constantly stressed-out IT Ops and NOC teams are overwhelmed by today’s IT noise. Artificial Intelligence (AI) and Machine Learning (ML) can help these teams because ML (and AI) are exceptionally good at processing enormous volumes of very complex data in real-time, or near real-time, and surfacing actionable insights. But ML successes in IT Ops are still hit-or-miss.

How to Enhance Your ServiceNow Investment with Nexthink

In describing why Nexthink is a critical partner in their value-offering Morten Grønneb?k, Chief Commercial Officer at BusinessNow, said “What we were experiencing as a consultancy company was that SLAs might be green out there, but your customers or end-users’ satisfaction was often red.” Indeed, prior to Nexthink, BusinessNow was increasingly faced with a major issue: clients’ IT departments were blind to IT issues at the end-user level, although data centers seemed oper

451 Research: Gain Intelligence Through SaaS-Based Monitoring and Machine Learning

The ability to analyze data across customers in order to inform their offerings is emerging as a potentially significant differentiator between monitoring vendors that have SaaS deployment offerings and those that don't. While some vendors pursue this opportunity, others are waiting on the sidelines, uncertain about privacy implications.