Operations | Monitoring | ITSM | DevOps | Cloud

Co-Managed Services: A Powerful Way to Bridge the IT Talent Gap

In today’s tech world, finding skilled IT pros is somewhat like finding a needle in a haystack. This shortage of IT talent can make it tough for businesses to keep their IT systems up and running smoothly. That’s where MSPs come in with their co-managed services. For many organizations, these services are like having a team of IT experts on speed dial, ready to help whenever you need them.

How to make your services zone redundant

In January of 2020, an entire availability zone (AZ) in AWS’ Sydney region suddenly went dark. Multiple facilities lost power, preventing customers from accessing EC2 instances and Elastic Block Storage (EBS) volumes. Customers who didn’t have backup infrastructure in another zone had to wait nearly 8 hours before service was restored, and even then, some EBS volumes couldn’t be recovered. Major cloud provider outages are rare, but they happen nonetheless.

The New AWS Public Monitoring Agent: AWS Canada West

Exciting news is on the horizon! We're stoked to share that Obkio’s new AWS - Canada West Public Monitoring Agent is now live! Just a month after AWS announced its first-ever data center in Western Canada (and second in Canada), Obkio has launched a brand new Monitoring Agent, allowing current and future customers to monitor network performance from their network locations up to the AWS infrastructure in Calgary.

Monitor Windows Performance Counters with Datadog

The Windows operating system exposes metrics such as CPU, memory, and disk usage as built-in performance counters, which provide a unified way to observe performance, state, and other high-level facets of Windows subsystems, components, and native or third-party applications. As such, Windows Performance Counters can be invaluable for monitoring resource usage and the health of your infrastructure, as well as systems your services are using.

Top 8 IT Department Issues in 2024

From the proliferation of artificial intelligence and machine learning to an unpredictable economic and geopolitical landscape, 2023 was an eventful year. The top IT department issues have become even more diverse and complex. In 2024, businesses will need to address IT infrastructure and disaster recovery, cybersecurity threats, data privacy policies, and the effects of AI and automation on their industries to stay ahead of the curve.

The Key Role of Cloud Observability in Ensuring Security

The utilization of cloud-based technologies developed to optimize and streamline business operations is far from a novel idea. In fact, research suggests at least 90% of modern organizations currently use cloud platforms and related technologies to oversee essential processes.

Understanding Linux File System: A Comprehensive Guide to Common Directories

Welcome to an in-depth exploration of the Linux file system! In this comprehensive guide, we'll demystify the various directories found in a typical Linux distribution, explaining their purposes and functionalities. Whether you're a seasoned sysadmin or a curious newcomer, this article will enhance your understanding of the backbone of Linux's structure and operation.

SRE Metrics: Availability

Understanding SRE metrics and how they impact your platform's availability are fundamentals of Site Reliability Engineering. How available is your website, service, or platform? What must you monitor and measure to ensure availability? How do you translate uptime into availability? This chart has numbers that every Site Reliability Engineer (SRE) should know.

Beyond deployment: The ongoing challenges in application performance monitoring implementation

In the age of digital acceleration, application performance monitoring (APM) acts as a sentinel, empowering organizations to maintain, analyze, and optimize the health of their digital ecosystems. However, as organizations navigate the intricacies of distributed architectures, hybrid cloud deployments, and dynamic workloads, they confront a complex terrain marked by data proliferation, siloed environments, and a scarcity of skilled personnel.