Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

The Hidden Costs of Website Outages and How Uptime.com Has Your Back

Businesses lose potential revenue, trust, and brand reputation every moment your website is down. Some of those things can never be earned back. Website outages sting whether you’re a blossoming startup or a seasoned enterprise. How often do they happen, and what’s the actual cost? That is exactly what we will explore together today!

Evolution at the Core: LogicMonitor's Transformative AI Empowers the Future

New Integrations Provide Superior Digital Developer and User Experiences Our most recent release of Dexda demonstrates our commitment to harnessing the transformational power of Artificial Intelligence (AI) for hybrid observability. Using a variety of advanced machine learning and AI techniques, Dexda dramatically reduces alert fatigue. Our use of intelligence and automation continues to change the way IT teams work.

AI is not intellignece: Bill Kennedy - The Reliability Podcast

The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

Predictive Network Technology in 2024

IT networks generate large volumes of information in the form of security, network, system and application logs. The volume and variety of log data makes traditional network monitoring capabilities ineffective — especially for monitoring use cases that require proactive decision making. These decisions are based on things like: All of this makes large-scale and complex enterprise IT networks a suitable use case for advanced AI and machine learning capabilities.

Bugs in NASAs codebase : Bill Kennedy - The Reliability Podcast

The Reliability podcast aims to speak with engineers who have worked on large, complex systems and glean through their learnings. What best practices should one imbibe? What are non-negotiable learnings to become better at a craft? What’s ‘engineering’ going to be like with the advent of AI? We answer these and more tracing personal journeys of engineers who have built stellar careers around decoding the innumerable intricacies of software engineering.

Prometheus Dashboards

Prometheus is a very popular open-source monitoring and alerting toolkit originally built in 2012. Its main focus is to provide valid insight into system performance by providing a way for certain variables of that system to be monitored. Prometheus displays the performance of these variables as a graph to allow its users to see their system’s performance at a glance.

Tutorial: Monitoring MySQL Server Performance with Prometheus and sql_exporter

Databases in one form or another are almost an inseparable part of modern applications. A popular one among them is MySQL on which this article will focus. But how to monitor MySQL? This article will give an introduction to this topic.

Monitor Azure Resource Events with LogicMonitor Logs

The integration of Azure’s event-driven model with LogicMonitor’s monitoring capabilities offers businesses a robust solution for real-time IT infrastructure monitoring. LogicMonitor’s cloud-based platform provides a comprehensive overview of an organization’s IT infrastructure, both on cloud and on-prem.