Operations | Monitoring | ITSM | DevOps | Cloud

Latest posts

What is DNS? Some basic concepts

What is DNS? DNS is the Domain Name System, or the hierarchical system of nomenclature that orders the names of members who connect to IP networks, such as the Internet. In this article we will briefly learn what DNS is, how it works, what it is used for and some of its advantages and disadvantages. What is DNS? Shall we begin?

Reduce Incident Downtime and Fix Outages Sooner with Policy-Based Alert Escalation Management

Proactive Incident Analysis, Diagnosis, and Resolution with Service-Centric AIOps. Alerts define the state of an infrastructure resource, application, or any other IP discoverable device. Organizations take action on alerts based on business impact and priority and ensure that IT service performance meets the required standards for availability, usability, and security.

Sysdig: Monitoring Modern Cloud-Native Workloads

If you're involved with developing applications, chances are you're being tasked with adapting to cloud-native infrastructure and being able to support cloud-native workloads with solutions like containers and Kubernetes. Join this discussion with Sysdig and IBM Cloud, who have recently built a partnership focused on improving monitoring and troubleshooting for your modern DevOps services.

Sysdig: 4 Best Practices to Meet Compliance Requirements in Kubernetes Environments

With the rise of technologies like Docker, Kubernetes, and underlying cloud infrastructures, enterprises are moving towards cloud-native architectures that leverage microservices to run their critical business applications. While this shift has tremendous business advantages, it also creates significant security and compliance challenges.

Key ECS metrics to monitor

Amazon Elastic Container Service (ECS) is an orchestration service for Docker containers running within the Amazon Web Services (AWS) cloud. You can declare the components of a container-based infrastructure, and ECS will deploy, maintain, and remove those components automatically. The resulting ECS cluster lends itself to a microservice architecture where containers are scaled and scheduled based on need.

Tools for ECS monitoring

In Part 1, we introduced a number of key metrics that you can use for ECS monitoring. Monitoring ECS involves paying attention to two levels of abstraction: the status of your services, tasks, and containers, as well as the resource use from the underlying compute and storage infrastructure, monitored per EC2 host or Docker container. In this post, we’ll survey some techniques you can use to monitor both levels of your ECS deployment.

Monitoring ECS with Datadog

As we explained in Part 1, it’s important to monitor task status and resource use at the level of ECS constructs like clusters and services, while also paying attention to what’s taking place within each host or container. In this post, we’ll show you how Datadog can help you: Automatically collect metrics from every layer of your ECS deployment, Track data from your ECS cluster, plus its hosts and running services in dashboards, and more.