Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

Securing Access to Cloud Native Resources with Certificates - Civo Navigate NA 2023

In this talk, Alan Vailliencourt, a Senior Solutions Engineer with Teleport, discusses the importance of moving away from passwords and securing access to cloud-native resources using short-lived certificates. He highlights the risks associated with passwords and showcases the benefits of identity-native access, incorporating proof of presence, mutual authentication, and device security. The talk provides practical steps for adopting certificate-based authentication and improving security posture for Kubernetes, databases, and other cloud resources.

CD for machine learning: Deploy, monitor, retrain

While there are an increasing number of off-the-shelf machine learning (ML) solutions that promise to adapt to your specific requirements, organizations that are serious about investing in ML for the long term are building their own workflows tailored exactly to their data and the outcomes they expect. To make full use of this investment, ML models must be kept up to date and working from the freshest available data.

How we use trace-based alerts to reduce MTTR

On-call shifts are part of every developer’s job – we’ve all been there. It’s 3am, suddenly you get an alert for an issue occurring in production. The microservices landscape is complicated and finding the root cause of an issue is like looking for a needle in a haystack. How can you get to the root of what’s happening in the system so you can analyze and resolve the issue quickly and effectively?

Understanding Blameless Postmortems

Progress often accompanies unforeseen challenges and mishaps in organizations. Traditionally, these setbacks resulted in pointing fingers, hindering progress, and creating a negative work atmosphere. However, a "Blameless Postmortems" approach transforms how organizations respond to failure. In this blog, we will delve into the importance of cultivating a blameless postrmortem culture when faced with setbacks.

The broader approach on Azure monitoring

This episode of Azure On Air podcast tackles the challenges in IT infrastructure monitoring and transitioning from on-premise to the cloud. Pedro Sousa, Microsoft Azure MVP, advocates for a shift from traditional monitoring to a holistic observability approach, starting with an understanding of business needs and working down to infrastructure details. Furthermore, he provides invaluable advice on migrating from on-premise to Azure, emphasizing the consistency of observability principles across environments.

Using Grafana and Graphite to monitor server load

Since server outages can lead to a loss of customers, reputation, and other troubles and it is important to get information on the status of the server on time. MetricFire's Hosted Grafana and Graphite will help you monitor server load in a timely and efficient manner. Servers generate a large number of metrics and it is essential to not only track their values but also to observe their changes over time. There is also a possibility to correlate app statistics with server load metrics.

Using Helm Dashboard and Intents-Based Access Control for Pain-Free Network Segmentation

Helm Dashboard is an open-source project which graphically shows installed Helm charts, revisions, and changes to their Kubernetes resources. The intents operator is an open-source Kubernetes operator which makes it possible to roll out network policies in a Kubernetes cluster, chart by chart, and gradually achieve zero trust or network segmentation.

AzCopy Installation: Simplifying Data Transfers to the Cloud

Data management and transfer are essential components of the digital era. Whether you are an IT professional, a developer, or simply someone looking to move large amounts of data to the cloud, the efficiency and reliability of the process are paramount. That’s where AzCopy comes into the picture.

Leveraging AWS EventBridge to stay ahead of spot instance interruptions

Amazon EC2 Spot Instances can help you save significantly on your compute costs. However, you should also be aware that Amazon can take them back with a two-minute notice if the demand for the instance type goes up. Fortunately, AWS EventBridge, along with Spot by NetApp, can help you automate the process of detecting and reacting faster to these interruptions.

Azure Distributed Transaction Performance Monitoring

In this article, we will explore Azure Distributed Transaction Performance Monitoring using Serverless360’s new feature called BAM Duration Monitoring. Our primary focus will be effectively monitoring a long-running business process implemented using the dynamic combination of Logic Apps and Data Factory.