Operations | Monitoring | ITSM | DevOps | Cloud

Monitoring

The latest News and Information on Monitoring for Websites, Applications, APIs, Infrastructure, and other technologies.

Reproducing and testing distributed system failures with xk6-disruptor

Distributed systems, such as modern microservices-based applications, are highly scalable, but also highly complex. Dependencies and unexpected interactions between services are a common cause of incidents, and these incidents are also notoriously hard to test for. xk6-disruptor — an extension that adds fault injection capabilities to Grafana k6, the open source reliability and load testing tool — can help overcome these challenges.

Microsoft Teams Issues: How to Pinpoint & Highlight Performance Problems

See the connection between your car's warnings and IT alerts? Just as your car warns you of problems, monitoring tools do the same for your business. But what about services you don't own, like Microsoft Teams? Discover how Vantage DX transforms Teams calls and meetings into monitoring insights, helping IT teams automatically detect and prevent issues, boosting user satisfaction and productivity.

Home Assistant Hardware: Requirements and Recommendations

With the smart home revolution in full swing, choosing the proper hardware for platforms like Home Assistant can be overwhelming. Whether you’re new to home automation or a seasoned pro, the hardware you select can make or break your experience. But fear not! This comprehensive guide will demystify the requirements, delve into the various options, and help you make an informed decision. From the compact Raspberry Pi to the powerful Intel NUC, we’ve got you covered.

Grafana and Graphite Best Practices

Efficient monitoring and visualization of performance metrics are paramount for ensuring seamless user experiences and reliable system operations. Grafana and Graphite, two powerful open-source tools, form an unbeatable combination when it comes to monitoring and analyzing time-series data. Grafana provides a robust and flexible platform for visualizing data, while Graphite acts as a scalable and efficient backend for storing and retrieving metric data.

Industry Cloud Platforms, Explained

Cloud computing changed the way enterprise IT works. Investments in public technologies are forecasted to grow by 21.7% to reach the $600 billion mark by the end of this year. The trend is driven by two major factors: Business organizations view these capabilities as an imperative for digital transformation — especially the domain-specific IT services that solve problems unique to their industry verticals.

Maturity Models for IT & Technology

Setting meaningful goals for your technology investment decisions requires an understanding of your requirements. Primarily, that’s… Measuring your IT maturity is one way to advance your IT performance — in a way that aligns with your organizational goals and minimizes the risk of failure. You can compare your current situation to a group of peers or competitors and also to industry benchmarks. Let’s take a look.

Partner Watch: CI/CD Build Systems for Embedded Development

To excel in embedded development in 2023, it is essential to have a solid understanding of build systems, continuous integration, and deployment strategies. This workshop by Percepio training partner Jacob Beningo aims to provide a comprehensive primer on these practices, equipping participants with the knowledge and skills necessary to tackle complex firmware projects with confidence.

Network Utilization Monitoring: How to Check Network Usage

Whether you're a small startup or a large enterprise, the ability to access and share information quickly and reliably is essential for productivity, customer satisfaction, and overall business performance. However, as networks become increasingly complex and interconnected, it's crucial for businesses to gain deep insights into their network utilization.

What is Prometheus Alertmanager?

Prometheus Alertmanager is a powerful tool designed to handle various alerts generated by Prometheus. It plays a vital role in the overall monitoring ecosystem, acting as a centralized hub for managing alert notifications. With Prometheus Alertmanager and its robust notification management capabilities, you can efficiently define alert routing and notification policies. This empowers you to take timely actions and mitigate potential issues before they impact your service availability.