Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Is the role of ITSM only limited to IT? Understanding Enterprise Service Management

Enterprises are facing new challenges not just in terms of staying relevant in the market and customers but also in controlling internal organizational chaos. Over the years, there have been a number of frameworks and models that were consistently being rolled out to assist enterprises to declutter operational and organizational challenges, streamline enterprise services and service delivery, and identify loopholes and fix them.

Fault Monitoring vs. Performance Monitoring: What's The Difference? | Obkio

Fault Monitoring vs. Network Monitoring: What are the differences and when do you need either solution? Where do we start when users or employees complain about poor network performance? And what tools are available to help? Check out our video to learn about the differences between Fault Monitoring and Networking Monitoring and what may be the right solution for your needs, in under 2 minutes. Every IT professional knows that users typically complain about two things: Something doesn’t work. Something is slow slow.

6 Tips for Improving Drupal Performance

Providing a top digital user experience is critical if you want to grow the number of visitors and keep them engaged. Anything you can do to improve your Drupal website will have an impact on your business and ultimately revenue. Luckily, there’s much you can do to optimize your Drupal website, including implementing a range of tools and services, and installing extra Drupal modules. In this article, we consider how to improve Drupal site performance.

Best Log Management Tools in 2020, and How to Select One for Your Organization

In modern digital environments, logs are present everywhere. From networking devices, servers, and databases, to operating systems, cloud-based services, and applications, every component produces some form of digital records of events. These records or logs provide an audit trail for Security Information Event Management (SIEM) and help in performance monitoring of servers and applications.

HoneyByte: Incremental Instrumentation Beyond the Beeline

“It turns out,” said Liz, “it was not a giant pile of work to start adding those rich instrumentation spans as you need them.” Liz Fong-Jones was telling dev.to’s Molly Struve about an error she encountered while trying to update her dev.to profile. When she entered honeycomb.io into the Employer URL field, the app responded with an angry red box...

Debugging in production with Stackdriver Debugger - Stack Doctor

Did you know you can debug your code while it’s still in production? In this video, Yuri Grinshteyn speaks about the Stackdriver Debugger, and how you can use it with Node.js. More importantly, he talks about the two ways in which this tool can debug by creating snapshots, or logging in real-time. Product: Google Cloud Operation Suite; fullname: Yuri Grinshteyn;

The Uptime.com Report for 2019

Unplanned downtime can drive significant losses in the form of unrealized revenue. Teams may be caught off guard, or may face an outage outside their control, extending downtime hours unnecessarily. Without automated monitoring and alerting, teams face undetected outages that silently threaten SLA fulfillment. The recommendations in this report are best used as a guide on what trends may drive Site Reliability Engineering in the near term.

Webinar: Serverless At Scale: the Present and Future of Modern Cloud Architectures

In this webinar on 16 April 2020 we covered the following topics:

  • The main challenges of scaling modern cloud applications
  • Implementing well-architected best practices
  • Battle-tested architectural patterns
  • How to improve resilience and scalability