Operations | Monitoring | ITSM | DevOps | Cloud

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Gremlin User Newsletter: AWS App2Container, an update to the WAF, and what's new in Gremlin

As systems become increasingly complex, we’ve seen the growth of engineering tools to abstract away and manage the complexity. But often our tools are “opinionated” and the default actions or settings may not align with how our systems are intended to work or how we think they work. Chaos Engineering is a good way to not only test your applications, but also the tools you use to build them.

Kubernetes Log Management: The Basics

Log messages help us to understand data flow through applications, as well as spot when and where errors are occurring. There are a lot of resources for how to store and view logs for applications running on traditional services, but Kubernetes breaks the existing model by running many applications per server and abstracting away most of the maintenance for your applications. In this blog post, we focus on log management for applications running in Kubernetes by reviewing the following topics.

Prometheus Metric Federation with Thanos

Prometheus is a CNCF graduated project for monitoring and alerting. It is one of the most widely used monitoring and alerting tools in the Kubernetes ecosystem. Rancher users can leverage Prometheus quickly by using the built-in monitoring stack. Prometheus stores its metrics as a time series database on the local disk. Prometheus local storage is limited by the size of the disk and amount of metrics it can retain.

Why Netdata picked VerneMQ

In 2019, the Netdata team already knew that a Netdata Cloud solution in the form of an online platform would greatly complement Netdata’s distributed monitoring by making it much easier to organize large infrastructures and by enabling new ways for teams to collaborate. The old node registry available at the time wasn’t enough for Netdata’s users. Building an online platform, even one that does not directly process users’ metrics, is challenging.

How to Secure your WFH Environment

I am making a digital transformation during this novel work-from-home (WFH) era due to a COVID-19 quarantine. Many of you are going through the same and distractions abound while sharing a workspace with housemates, children, and pets. Moreover, we have to contend with an increased risk to cybersecurity, given recent attacks on work-related software such as Slack and Zoom.

Byte Down Too: Build Cost Effective Infrastructure Like Netflix

Think of orgs with lots of data and it’s impossible to not think of Netflix. In a new Netflix Technology Blog, titled "Byte Down: Making Netflix’s Data Infrastructure Cost-Effective", their Platform Data Science & Engineering team describe their data infrastructure "which is composed of dozens of data platforms, hundreds of data producers and consumers, and petabytes of data.” At this scale, cost-effectiveness is a critical matter of success and failure.

An effective hybrid cloud migration project can yield 145% ROI in three years, global study reveals

San Jose, CA, July 14, 2020 – Organizations that embrace hybrid cloud migration projects can yield a 145% return on investment within three years, a commissioned study conducted by Forrester Consulting on behalf of Virtana has revealed. The study, which reviewed customer projects completed in the last three years, found that a composite organization achieved benefits of $2 million by adopting planning and optimization platforms, such as as Virtana’s CloudWisdom and VirtualWisdom.

Using Observability as a Proxy for Customer Happiness

Today, users and customers are driven by response rates to their online requests. It’s no longer good enough to just have a request run to completion, it also has to fit within the perceived limits of “fast enough”. Yet, as we continue to build cloud-native applications with microservice architectures, driven by container orchestration like Kubernetes in public clouds, we need to understand the behavior of our system across all aspects, not just one.

Event Chaos or Enrichment? BigPanda's CTOs Can Help You Decide

In our recent “IT Ops Demystified – Event Chaos or Enrichment?” webinar our field CTOs discuss how enrichment can help reduce operational costs by an order of magnitude. Here is a quick overview of all the goodness that you’ll be watching.