Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

What Is AWS Anomaly Detection? (And Is There A Better Option?)

To explain how cost anomaly detection works in AWS, let’s first look at an analogy. Imagine you strategically propagate, cultivate, harvest, and replenish trees for a thriving forest products company. In keeping with your eco-friendly policy, you only harvest the trees with straight and tall trunks. You leave irregularly shaped trees alone. Your company doesn’t harvest trees you won't use.

Site Reliability Engineer (SRE) Roles and Responsibilities

Software development is getting faster and more complex – frustrating IT operations teams more than ever. So, DevOps gained popularity in order to combat siloed workflows, decreased collaboration and a lack of visibility. While establishing a culture of DevOps has helped teams collaborate better and deliver reliable software faster, DevOps teams don’t necessarily have someone specifically dedicated to developing systems that increase site reliability and performance.

Rollbar Pro Tips: Manage Rollbar automatically through the Rollbar Terraform Provider

Terraform is a multi-cloud provisioning product used to create, manage, and update infrastructure resources. The Provider will automate the creation, modification, and removal of resources within your account such as projects, users, and teams. Rollbar is the leading continuous code improvement platform that proactively discovers, predicts, and remediates errors with real-time AI-assisted workflows. With Rollbar, developers continually improve their code and constantly innovate rather than spending time monitoring, investigating, and debugging.

Scaling HashiCorp's Cloud Platform - Dash 2021 (HashiCorp)

Identifying bottlenecks during times of high load is critical to building a scalable software platform. Stress testing is one way to simulate high load on a system and allows you to proactively capture potential bottlenecks before they impact customers. Once a solution is implemented to address the bottleneck, you need a way to measure success and find a new limit. See how HashiCorp Cloud Platform (HCP) has developed a stress testing framework which heavily relies on Datadog’s custom metric capabilities in combination with some out of the box integrations to give HCP engineers a comprehensive view of their platform and how they used these insights to scale their concurrent data-plane provisioning by 300%.

How to do serverless monitoring right #shorts

Monitoring CPU load and memory usage is common practice, but with serverless no action is required. In this video, we quickly explain that if your Cloud Run instances start hitting high CPU load, Google Cloud will automatically spin up new instances for you, and vice versa!

Config best practices: dependency caching

Let’s face it: Creating the optimal CI/CD workflow is not always a simple task. In fact, writing effective and efficient configuration code is the biggest hurdle that many developers face in their DevOps journey. But you don’t need to be an expert to set up a fast, reliable testing and deployment infrastructure. With a few straightforward techniques, you can optimize your config.yml file and unleash the full potential of your CI/CD pipelines.

Various policy engines for Kubernetes policies - Saiyam Pathak

Kubernetes configurations are complex to manage across developers and operators. External tools like Helm, Kustomize cannot ensure environment-specific configurations and admission controllers provide a way to do this. Now, various tools have evolved over time that helps solve this problem - OPA Gatekeeper, Kyverno, Kubewarden and jsPolicy. In this talk during ContainerDays 2021, Saiyam Pathak from Civo goes through the need for a policy engine and discusses how each of the tools help along with the differences between them and where these are headed to.

A CTO's View: Driving Continuous Alignment with Mattermost 6.0

The past few weeks have marked a real milestone for the Mattermost community. My co-founder and longtime colleague, Ian, shared his reflections on our huge v6.0 launch, and I echo his take on the magnitude of the launch and our new product capabilities. As CTO at Mattermost, I have the unique pleasure of leading product development efforts for an open source platform backed by an inspiring community of contributors and enthusiasts.

ChatOps and Mobile Adoption: The Power of Teams Working Where They Are

The way we socialize, learn, shop, and receive care has changed drastically over the last 18 months. For many of us, perhaps one of the most drastic changes was the way we work. While work from home (WFH) was an option before the pandemic, NCCI states, “only 6% of the employed worked primarily from home and about three-quarters of workers had never worked from home.” Fast forward to 2021, and according to NorthOne, here’s how much things have changed.