Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Ubuntu 23.10 Mantic Minotaur Mascot Animation

Just as the prophecy foretold, the Ubuntu 23.10 Mantic Minotaur has been revealed! Few creatures from mythology have captured the imagination of mankind quite like the legendary Minotaur. The word Mantic is used to describe one’s ability to prophesy or reach into the future. Our lead designer, Marcus Haslam, pulled inspiration from the ancient past when creating our iconic Minotaur Mascot and new official wallpapers.

Prometheus vs. Datadog

Before we do a detailed dive into what Prometheus and Datadog are, let's look at the key comparison points. Both Prometheus and Datadog are monitoring tools, but Prometheus is open source and Datadog is proprietary. Prometheus is the de facto tool for monitoring time-series for Kubernetes, and Datadog is an all-around APM, logs, time-series, and tracing tool.

How To Monitor AWS EC2 With MetricFire

AWS EC2 (Elastic Compute Cloud) has revolutionized the way businesses operate in the cloud. With its scalable and flexible infrastructure, EC2 allows organizations to easily deploy virtual servers and manage their computing resources efficiently. However, as your EC2 environment grows, monitoring becomes crucial to ensure optimal performance, security, and cost optimization. One powerful solution for monitoring AWS EC2 is Hosted Graphite by MetricFire, a comprehensive graphing and monitoring service.

IT Operations: Making IT Work Better with IT Ops Automation

IT ops is often so focused on solving problems that it can be easy to miss how IT ops contributes to an organization’s overall success, even if you’ve been in IT for a long time. When things get tossed over the wall from development to ops, anyone not in IT ops starts to ask, “What is IT ops?” and “What does IT operations do?” So let’s set the record straight.

How to ensure your Kubernetes Pods have enough CPU

Gremlin's Detected Risks feature immediately detects any high-priority reliability concerns in your environment. These can include misconfigurations, bad default values, or reliability anti-patterns. A common risk is deploying Pods without setting a CPU request. While it may seem like a low-impact, low-severity issue, not using CPU requests can have a big impact, including preventing your Pod from running.

Azure: The Ultimate Guide to Microsoft's Cloud Computing Platform

Cloud computing has revolutionized the way businesses operate and manage their data. With the vast amounts of information being generated daily, traditional on-premises infrastructure struggles to keep up with the demands of scalability, security, and cost-effectiveness. This is where Azure, Microsoft's cloud computing platform, comes into play. Azure provides a comprehensive set of tools and services that enable organizations to build, deploy, and manage applications and services on a global scale.

How to Monitor Multi-layer Huawei Switch with MetricFire

Monitoring your network infrastructure plays a pivotal role in identifying potential bottlenecks, optimizing performance, and ensuring seamless operations. By implementing a comprehensive monitoring solution like MetricFire, you gain access to a wide range of features and functionalities designed to simplify the process of monitoring and managing your Huawei switches.

Introduction to MLFlow

MLFlow is an open source platform used for managing machine learning workflows. It is a crucial component of the open source MLOps ecosystem, having passed 10 million monthly downloads at the end of 2022. It has four main components that ensure experiment tracking, model registry, model deployment and code packaging. Join our webinar to learn more about MLFlow During this webinar, Andreea Munteanu will discuss MLFlow and Charmed MLFlow, Canonical’s distribution of the open source platform.