Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

A network admin's guide to network diagram software

For organizations growing larger by the day, network management becomes increasingly complex, and scaling to meet this growth can be a major headache. To battle such complexity without a graphic representation of a network is a tiresome task, which is where network diagram software comes in. Network diagram software allows a network admin to portray the network clearly and legibly through detailed graphics.

5 ways incidents made me a better engineer

Incidents are a great opportunity to gather both context and skill. They take people out of their day-to-day roles, and force ephemeral teams to solve unexpected and challenging problems. In my career, I've found incidents can be a great accelerator - for both myself and others around me. It was after leading my first incident at GoCardless that I started to feel really comfortable in the codebase and the team.

A Simple Guide to Taming the Beast That Is Kubernetes

Containers are amazing. But when you start to orchestrate them in a complex environment, they can become quite the beast. Kubernetes is one of the best tools to tame that beast, but few resources exist to help you manage your big data workloads on Kubernetes. If you want to learn how you can optimize your big data workloads on Kubernetes, this is for you.

TensorFlow Python Code Injection: More eval() Woes

JFrog security research team (formerly Vdoo) has recently disclosed a code injection issue in one of the utilities shipped with Tensorflow, a popular Machine Learning platform that’s widely used in the industry. The issue has been assigned to CVE-2021-41228. This disclosure is hot on the heels of our previous, similar disclosure in Yamale which you can read about in our previous blog post.

Terraform and Shipa 101 - Your First Terraform and Shipa Cloud Integration

Leveraging Terraform, which is an infrastructure-as-code platform, is a great match. Using both technologies together is becoming more mature and there have been some great pieces around the art of the possible between the two platforms. Though if you are unfamiliar with both, this guide will get you up and started with both Terraform and Shipa together. In this example will be using Terraform to create all of the necessary Shipa resources to deploy to a Kubernetes cluster.

SRE Principles: The 7 Fundamental Rules

In one of our previous articles, we discussed what an SRE is, what they do, and some of the common responsibilities that a typical SRE may have, like supporting operations, dealing with trouble tickets and incident response, and general system monitoring and observability. In this article, we will take a deeper dive into the various SRE principles and guidelines that a site reliability engineer practices in their role.

DevOps State of Mind Podcast Episode 1: Trust, tooling, and a no-blame culture with LogDNA

Tucker Callaway is the CEO of LogDNA. He has more than 20 years of experience in enterprise software with an emphasis on developer and DevOps tools. Tucker fosters a DevOps culture at LogDNA by tying technical projects to business outcomes, practicing extreme transparency, and empowering every person in the company to contribute.