Operations | Monitoring | ITSM | DevOps | Cloud

Next-Gen Supply Chains Powered by Machine Learning

Machine learning is radically transforming how supply chains operate, pushing them towards unprecedented efficiency and responsiveness. This technology, powered by vast streams of data and sophisticated algorithms, is enabling businesses to anticipate needs, optimize operations, and adapt more swiftly to market changes. These advancements allow consumer packaged goods companies to enhance accuracy and efficiency, drive significant cost reductions, and better align themselves with consumer expectations.

FrogML SDK: the Gateway to Model Governance

Data-driven decisions are critical. And to support high-stakes decision-making – from fraud detection in credit card transactions to demand forecasting in retail – organizations are increasingly relying on complex models. According to McKinsey, 78% of organizations report using AI in at least one business function, highlighting just how embedded AI and ML models have become in operational and strategic decision-making.

Amazon SageMaker Pricing Guide: 2025 Costs (And Savings)

Amazon SageMaker makes it easy to prepare data for machine learning (ML) and then train, deploy, and modify ML models. SageMaker is a fully managed service that automates much of the ML lifecycle. So, if you want a single partner to help you through all stages of your Artificial Intelligence (AI) lifecycle, SageMaker might be the answer. Perhaps more important for this post is the promise that Amazon SageMaker can reduce your machine learning model costs. But does SageMaker pricing reflect this?

Optimizing Legacy ML Systems with Real-World DevOps Practices

We chose to feature this article because it reflects exactly what OpsMatters stands for: practitioners solving real problems with practical DevOps thinking. When we came across Ashish's detailed breakdown of his experience modernizing a complex ML environment, it stood out for its clarity and actionable insights. We reached out to him to learn more about the work behind this case study, and with his permission, we are sharing it here so the broader community can benefit from these lessons in observability, cost optimization, and real-world DevOps execution.

Canonical announces Charmed Feast: A production-grade feature store for your open source MLOps stack

July 10, 2025: Today, Canonical announced the release of Charmed Feast, an enterprise solution for feature management with seamless integration with Charmed Kubeflow, Canonical’s distribution of the popular open source MLOps platform. Charmed Feast provides the full breadth of the upstream Feast capabilities, adding multi-cloud capabilities, and comprehensive support.

Automating machine learning security checks using CI/CD

Machine learning (ML) pipelines are increasingly being treated like software; built, tested, deployed, and monitored using automated tooling. But while infrastructure as code and microservices have matured with security best practices, ML systems often lag behind. The truth is, your ML pipeline is part of your software supply chain and it is vulnerable.

Forecasting with InfluxDB 3 and HuggingFace

Machine learning models must do more than make accurate predictions; they also need to adapt as the world around them changes. In real-world systems, data distributions shift due to seasonality, equipment wear, user behavior changes, or other external forces. If your models can’t keep up, the result is poor predictions. This can lead to outages, inefficiencies, or missed opportunities. That’s why forecasting systems need to be monitored and resilient, not just accurate.

Best VPS for Machine Learning

When it comes to machine learning, choosing the right Virtual Private Server (VPS) can significantly impact your project's performance, scalability, and cost-effectiveness. A VPS provides dedicated resources that can handle the intensive computational tasks associated with training machine learning models. Here, we explore the best cheap GPU VPS options for machine learning, highlighting the importance of GPUs and introducing Cloudzy as a leading choice.

Announcing Charmed Kubeflow 1.10

We are thrilled to announce the release of Charmed Kubeflow 1.10, Canonical’s latest update to the widely-adopted open source MLOps platform. This release integrates significant improvements from the upstream Kubeflow 1.10 project, while also bringing a suite of additional capabilities targeted towards enterprise deployments. Charmed Kubeflow 1.10 empowers machine learning practitioners and teams to operationalize machine learning workflows more efficiently, securely, and seamlessly than ever.

Machine learning vs AI: Key differences and how they work together

Machine learning (ML) and artificial intelligence (AI) are often used interchangeably in tech discussions, yet they represent distinct concepts with important differences. While AI refers to the broader field of creating machines capable of intelligent behavior that mimics human capabilities, machine learning is a specific subset of AI focused on developing algorithms that allow computers to learn from and make predictions based on data.