Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

ML for software engineers ft. Gideon Mendels of Comet ML

In this episode, Rob explores the fascinating crossroads of machine learning and software engineering with Gideon Mendels, the co-founder and CEO of Comet ML. Gideon navigates the often ambiguous world of training ML models, focusing on building a common language between software engineers and data science teams. Gain valuable insights into fostering mutual understanding between these two disciplines and aligning the possibilities of ML with organizational needs in this thought-provoking episode.

How to automate Snyk container scanning of your production environments

If you’re using containers to deploy your software, it is important to be aware of potential vulnerabilities within your container images. These may be introduced through dependencies in your built image, or perhaps through dependencies within the base image(s) used to build your image.

Orchestration as a Data Management Challenge-Part 2

In Part 1 of this blog series, I proposed that closed-loop orchestration (CLO) is a data management problem. Data extends beyond your medical history, social media, and other PII in the digital world. Configurations, inventories, and monitoring systems are based on managing, interpreting, and actioning data. CLO is a workflow about manipulating the data around the target environments. Even the policies and constraints need to be expressed as data.

Optimize your MLOps pipelines with inbound webhooks

In a traditional DevOps implementation, you automate the build, test, release, and deploy process by setting up a CI/CD workflow that runs whenever a change is committed to a code repository. This approach is also useful in MLOps: If you make changes to your machine learning logic in your code, it can trigger your workflow. But what about changes that happen outside of your code repository?

NVIDIA H200 Power Requirements: Can Your Racks Support Them?

This week, NVIDIA unveiled what they are calling “the world’s most powerful GPU for supercharging AI and HPC workloads,” the H200 Tensor Core GPU. There is much hype around the H200 as it is the first GPU with HBM3e. The larger and faster memory will further enable generative AI, large language models, and advance scientific computing for HPC workloads. Read the NVIDIA press release.

Goodbye, GitOps: Getting to green in an AI-powered world

The cognitive bias known as the streetlight effect describes our desire as humans to look for clues where it’s easiest to search, regardless of whether that’s where the answers are. For decades in the software industry, we’ve focused on testing our applications under the reassuring streetlight of GitOps. It made sense in theory: wait for changes to the codebase made by engineers, then trigger a re-test of your code. If your tests pass, you’re good to go.

How To Create a DevOps Implementation Roadmap

You’ve done your research and decided to use the DevOps approach for your software development process and IT operations. However, before you start tossing around terms like “continuous integration” and “containerization,” there’s an important starting point on your DevOps journey — creating a DevOps implementation roadmap.