Operations | Monitoring | ITSM | DevOps | Cloud

Integrate Datadog Compliance Monitoring with your AWS Well-Architected workloads

Many of our customers rely on the Amazon Web Services (AWS) Well-Architected Framework as a guide to build safe, secure, and performant applications in the cloud. AWS offers the Well-Architected Review (WAR) Tool as a centralized way to track and trend adherence to Well-Architected best practices. It allows users to define workloads and answer a set of questions regarding operational excellence, security, reliability, performance efficiency, and cost optimization.

Introducing MinIO Support in Sematext

Sematext Logs is a Log Management-as-a-service. Think of it as your own central location for logs in the cloud. If you prefer or need to keep logs in your own environment instead of shipping it to the cloud Sematext Enterprise, designed to run on your own infrastructure, makes that possible. You can collect logs from any part of your software stack or infrastructure, IoT devices, network hardware, and much more.

From SysAdmin to SRE: How to evolve your skillset

Are you wondering what it takes to become an SRE from a SysAdmin background? Our latest blog, covers the growth areas and technical skills needed to successfully transition to an SRE role. The last decade has seen widespread adoption of SRE practices based on the best practices laid out by Google. Many SysAdmins have observed this trend and are now evaluating becoming SREs. Which gives rise to the question how much of a skills overlap is there between an SRE and a SysAdmin?

CFEngine 2020 Retrospective

2020 is nearly over, and we’d like to take a couple of minutes to reflect on our year as well as provide a sneak peek into what you can expect from us in 2021. Although it has been a year full of distractions, the CFEngine team has continued to make significant strides when it comes to product improvements and new features that help our users.

How to monitor an Oracle database with Prometheus

In this article, we will explain how to monitor an Oracle Database with Prometheus using an exporter to generate metrics. Also, we will review the main metrics that you should monitor on resource usage and performance, and what to alert on to detect issues and incidents in your Oracle Database.

With the new AWS IoT SiteWise plugin, you can visualize and monitor your equipment data in Grafana dashboards

In collaboration with the AWS team, we have just launched another AWS integration, the AWS IoT SiteWise plugin for Grafana. AWS IoT SiteWise is a managed service that helps users collect, store, organize, and monitor data from industrial equipment, across facilities, at scale.

What the Google Outage Can Teach Us About Our Own Services

This week, the world stopped for a few hours as Google users experienced an outage on a massive scale. The outage affected ALL services which require Google account authentication. This includes the Google Cloud Platform (Cloud Console, Cloud Storage, BigQuery, etc.), Google Workspace (Gmail, Calendar, Docs, Drive, etc.) and Google Classroom. With the myriad of affected platforms, this particular outage was far from passing by unnoticed by users.

Manual steps in parallel groups available for Pipelines

Bitbucket Pipelines now allows steps with a manual trigger to be used in parallel groups, satisfying one of the highest voted feature requests. This feature provides more flexibility in Pipelines, allowing teams to configure pipelines with multiple options and then only run the steps they actually need to run, at the time they want. For example you can choose which environments should be deployed for individual developers, giving them different environments to test and do their work.

An Open Source Prometheus Tutorial for System and Docker Monitoring

Prometheus is an essential element in DevOps. It has one of the largest contributor communities in the open source world and is the favorite database for time-series metrics data. It is also part of Logz.io’s complete observability package, which integrates open source grafana, Jaeger, and ELK Stack into one platform. It’s a favorite for system and container monitoring. This tutorial will show you how Prometheus works with Docker and your local machine.

Automated, Simplified DNS Troubleshooting for Kubernetes: Only in Calico Enterprise

The Domain Name System (DNS) is a naming system for computers, services, or other resources connected to the Internet or a private network. DNS translates domain names to the numerical IP addresses needed for locating and identifying computer services and devices. For decades It’s been an essential component of the Internet. It’s an essential part of Kubernetes as well, and is used to determine how workloads connect to Kubernetes services as well as resources outside the cluster.