Operations | Monitoring | ITSM | DevOps | Cloud

DevOps

The latest News and Information on DevOps, CI/CD, Automation and related technologies.

How we Went From Two Major Outages to 99.98% Reliability in Just 6 Months with Eran Kampf

Discover TwinGate's incredible journey from facing major outages to achieving 99.98% reliability within six months. At Navigate NA 24, hear firsthand about the challenges, solutions, and innovations that transformed their operations. Learn about their approach to architecture, incident management, and customer communication that not only restored trust but also turned reliability into a competitive advantage.

The clock is ticking: Azure VM RIs exchange opportunity ends July 1

Did you know that companies typically waste as much as 35% of their budgets on unused compute resources? And, oftentimes, these idle resources are commitment waste — reserved commitments purchased at a discount but with the financial lock-in of one or three years. Commitments, such as Azure Compute Reservations (VM RIs), offer significant savings when compared to on-demand.

Introducing Playbooks automation

We're rolling out Playbooks, our latest in fully automating the incident response process. Imagine every action you (incident responders), had to manually take are now fully automated with Playbooks. Steps like initiating a war room (video conference), logging incidents, sending out alerts, and running diagnostic scripts are now executed with precision, every single time, are all now effortlessly automated without you lifting a finger.

The next buzz in the city of bees: digital infrastructure, AI, and Manchester

Manchester has come a long way - from pioneering the world’s first stored program digital computer, to becoming the top tech city in the UK outside of London. The MCC 2021-2026 Digital Strategy now guides a £5bn digital economy, with more than 10,000 businesses employing over 96,000 people. It has seen the development of five unicorns and is still home to three, billion-pound businesses. So, the city of bees is buzzing.

How to standardize resiliency on Kubernetes

There’s more pressure than ever to deliver high-availability Kubernetes systems, but there’s a combination of organizational and technological hurdles that make this ‌easier said than done. Technologically, Kubernetes is complex and ephemeral, with deployments that span infrastructure, cluster, node, and pod layers. And like with any complex and ephemeral system, the large amount of constantly-changing parts opens the possibility for sudden, unexpected failures.

Understanding Monitoring Tools

If you care about operational excellence when it comes to your IT infrastructure, the role of monitoring systems is pivotal. As we navigate through the myriad of available monitoring tools, it becomes essential to understand the distinct architectures, styles, and focal points of various monitoring solutions, as well as the time-to-value they offer.

#024 - Kubernetes for Humans Podcast with Gabriele Bartolini [EDB]

A long-time open-source programmer and entrepreneur, Gabriele has a degree in Statistics from the University of Florence. After having consistently contributed to the growth of 2ndQuadrant and its members through nurturing a lean and DevOps culture, he is now leading the Cloud Native initiative at EDB.